Specialist, Site Reliability Engineer (SRE)

TNG Digital Lihat semua pekerjaan

  • Kuala Lumpur
  • Tetap
  • Sepenuh masa
  • 3 hari lepas
We fuel the ideas and ambitions of our people with an environment built on Our DNA of Love, Entrepreneurship, Agility, and Passion - LEAP! We are a culture that empowers everyone to innovate and create solutions that will leave a positive impact on our communities and our nation, Touch 'n Go will always be here to inspire our talents to grow as leaders and innovators giving you the power to make a difference. What would you do Network administration a) Design, implement and manage network infrastructure b) Monitor network performance and ensure security compliance c) Maintain and implement cross/multi cloud networking d) Implement network segmentation and access control to improve security e) Implement and maintain monitoring systems to proactively identify performance bottlenecks, security vulnerabilities, and other issues. Cloud infrastructure management, capacity planning and monitoring a) Maintain and optimize cloud-based infrastructure b) Deploy, configure, and manage Linux and Windows servers using automation tools c) Monitor and troubleshoot infrastructure performance, security, scalability d) Assess system capacity and performance requirement, implement scalable solutions that meet future growth needs Cloud FinOps a) Monitor and analyze cloud spending across multi-cloud environments (AWS, Azure, Alibaba Cloud) b) Work with finance teams to ensure accurate cloud budgeting, forecasting, and chargeback models c) Optimize storage, networking, and computing costs without impacting performance d) Leverage FinOps tools (AWS Cost Explorer, Azure Cost Management, Alibaba Cloud Cost Center, etc.) Security and compliance a) Collaborate with other SRE and Security team to manage and optimize infrastructure resources and ensure application of industry standard security b) Collaborate with development teams to design and implement secure infrastructure architectures, ensuring the confidentiality, integrity, and availability of systems and data c) Ensuring compliance with security, audits and regulatory requirements d) Plan and execute disaster recovery plans. Incident response and troubleshooting a) Collaborate with different functional teams to address incidents and restore services quickly. Investigate and resolve incidents, apply root cause analysis to prevent recurrence Who should join us Bachelor's degree in Computer Science, Network or related field Professional cloud certification Proven 5 experience in a Cloud Network or Cloud Infrastructure role Strong experience in site reliability engineering, infrastructure engineering or a similar role. Strong knowledge on network and protocols, network security and cloud networking Proven strong record of cloud cost optimisation Experience with cloud platforms (e.g., AWS, Azure, GCP, Alibaba Cloud) and infrastructure as code (IaC) tools (e.g., Terraform, CloudFormation). Basic or advanced cloud certification is a plus Experience with containerization technologies like Docker and container orchestration platforms such as Kubernetes is a plus Knowledge of networking principles and protocols Deep knowledge of Linux/Unix systems and administration Strong problem-solving skills and the ability to handle high-pressure situations calmly and effectively Strong attention to detail and a commitment to delivering high-quality results Our Perks & Benefits: Flexi clock-in hours. Monthly eWallet allowance. Additional 1% employer EPF contribution from your 1st to 3rd year of service, with further increases based on your continued years of service. Unlimited office pantry fruits, snacks and drinks. Mobile and broadband subscription reimbursement. Flexibility to opt dependants coverage (spouse, child, parents or parents-in-law) for outpatient medical benefits. Additional leave including family leave and paid care leave to care for family members. Medical coverage including dental, optometrist, mental care, maternity, registered Traditional Chinese Medicine ('TCM') and Chiropractic. Corporate membership discount and many more to explore. We believe that you have what it takes to fit into the Touch 'n Go family and help revolutionize the Fintech industry by paving the way to a cashless society. If you're ready to take the next step, apply now! Touch 'n Go is an organization that strives to provide Equal Opportunity Employment, based on merit, qualifications, capabilities, and calibre. It is Touch 'n Go's policy to not discriminate based on age, race, religion, colour or other personal status, identity or characteristics. Fair Opportunity is Our Value and Practice. Please advise us of any accommodations you may need by e-mailing: [HIDDEN TEXT] Note : Only shortlisted candidates will be contacted. Let's keep LEAP-ing forward together!

foundit

Pekerjaan yang sama

  • Site Reliability Engineer Lead

    FeedMe

    • Kuala Lumpur
    About Us FeedMe's Software Engineering team develops next-generation technologies that change lifestyles for millions of users. Our products handle transactions at a massive scale …
    • 3 hari lepas
  • Senior site reliability engineer

    BP

    • Kuala Lumpur
    Who You Will Work With A multi-disciplinary squad, engaging enterprise platform teams, data platform teams, vendors, third party resources in resilient and optimal operations of on…
    • 3 hari lepas