Senior Platform Operations Engineer

Singapore Permanent Work from Home or Hybrid View Job Description
The role focuses on ensuring high availability, security, and performance of an Azure‑based AI platform running on Red Hat OpenShift. It also involves incident management, cybersecurity oversight, and collaborating with development teams to embed automation, monitoring, and best practices into AI/ML operations
  • Innovative Technology and Industry Leadership
  • Embark on the next phase of AI Evolution

About Our Client

A leading regional technology and digital services enterprise, recognised for its large‑scale cloud, cybersecurity, and digital infrastructure capabilities

Job Description

  • Monitor availability, detect outages, and optimize performance for the Azure‑based AI cloud platform.
  • Maintain continuous uptime, resilience, and operational efficiency of the RE:AI cloud environment running on Red Hat OpenShift.
  • Lead incident management, conduct root‑cause investigations, and implement disaster‑recovery measures to safeguard business continuity.
  • Oversee cybersecurity operations, including vulnerability remediation, threat monitoring, and access‑control enforcement.
  • Assist with security audits, compliance documentation, and ensure adherence to Singtel policies, regulatory requirements, and industry standards.
  • Work closely with development teams to embed monitoring, automation, and security best practices within AI/ML pipelines.
  • Drive ongoing operational improvements through enhanced automation, observability, and operational excellence initiatives



The Successful Applicant

  • Bachelor's degree in Computer Science, Engineering, or related field, with 4-6 years of cloud operations experience
  • Strong expertise in Azure monitoring tools (Azure Monitor, Log Analytics, App Insights) and OpenShift observability/performance tuning.
  • Solid background in incident response, SRE practices, disaster recovery, and cloud security operations (IAM, SIEM/SOAR, vulnerabilities, firewalls, endpoint protection).
  • Proficient in IaC (Terraform, Bicep, ARM) and automation scripting (PowerShell, Python).
  • Familiar with AI/ML platform components such as AKS, GPU compute, pipelines, and model‑hosting environments.
  • Knowledge of ISO 27001, CIS, NIST, with strong problem‑solving, communication, and the ability to anticipate and mitigate risks



What's on Offer

Innovative Technology and Industry Leadership

Embark on the next phase of AI Evolution

Contact
Jayden Yap (Lic No: R22110369/ EA no: 18C9065)
Quote job ref
JN-032026-6966967
Phone number
+65 6416 9897

Job summary

Function
IT
Specialisation
Infrastructure
What is your area of specialisation?
Technology & Telecoms
Location
Singapore
Contract Type
Permanent
Consultant name
Jayden Yap (Lic No: R22110369/ EA no: 18C9065)
Consultant contact
+65 6416 9897
Job Reference
JN-032026-6966967
Work from Home
Work from Home or Hybrid

Diversity & Inclusion at Michael Page

We don't just accept difference - we celebrate it. We encourage applicants from all backgrounds to apply for this role and are committed to building inclusive, diverse workplaces where everyone can thrive. If you require any support or reasonable adjustments during the recruitment process, please let us know.