AI Egineer_GPU Performance

Singapore Permanent View Job Description
This role focuses on designing and optimizing large‑scale AI and GenAI workloads across multi‑GPU systems, driving performance, scalability, and efficiency. You will build custom models, AI agents, and distributed training pipelines that power next‑generation manufacturing intelligence.
  • Work on GenAI, large‑scale model training, and GPU performance optimization
  • Exposure to multi‑node, multi‑GPU systems, low‑level optimization

About Our Client

The client is a global technology manufacturer recognized for innovation in advanced systems, AI, and high‑performance computing. With a strong commitment to research, sustainability, and engineering excellence, they provide an environment where highly technical engineers can solve complex, real‑world problems at scale.

Job Description

  • Architect and execute large‑scale model training and fine‑tuning on multi‑node, multi‑GPU clusters
  • Optimize training and inference performance using distributed strategies (DDP, FSDP, DeepSpeed, Megatron‑LM)
  • Design and develop autonomous AI Agents for complex, multi‑step manufacturing workflows
  • Profile and analyze GPU‑intensive workloads to identify compute, memory, and latency bottlenecks
  • Develop and optimize high‑performance GPU kernels using CUDA or related GPGPU frameworks
  • Partner with hardware architects to shape next‑generation accelerator features
  • Build performance regression testing frameworks for drivers, compilers, and runtime system



The Successful Applicant

  • At least 5 years' experience in GPU computing, performance optimization, or low‑level systems programming
  • Deep knowledge of GPU architectures, memory hierarchies, and interconnects
  • Strong hands‑on experience with PyTorch and distributed model training techniques
  • Expertise in LLM fine‑tuning, inference optimization, and GenAI application development
  • Advanced C++ skills and proficiency in CUDA or other GPGPU frameworks
  • Solid understanding of end‑to‑end ML systems, CI/CD pipelines, and cloud or on‑prem environments
  • Excellent analytical, communication, and cross‑functional collaboration skills



What's on Offer

  • Competitive compensation and performance‑linked incentives
  • Opportunity to work on industry‑leading AI and GPU technologies
  • Exposure to large‑scale, real‑world GenAI and manufacturing systems
  • Strong emphasis on learning, innovation, and technical growth
  • Collaborative, inclusive culture with long‑term career progression



Contact
Lydia Chen (Lic No: R22108104 / EA no: 18C9065)
Quote job ref
JN-042026-6992136
Phone number
+65 6416 9829

Job summary

Function
IT
Specialisation
Infrastructure
What is your area of specialisation?
Technology & Telecoms
Location
Singapore
Contract Type
Permanent
Consultant name
Lydia Chen (Lic No: R22108104 / EA no: 18C9065)
Consultant contact
+65 6416 9829
Job Reference
JN-042026-6992136

Diversity & Inclusion at Michael Page

We don't just accept difference - we celebrate it. We encourage applicants from all backgrounds to apply for this role and are committed to building inclusive, diverse workplaces where everyone can thrive. If you require any support or reasonable adjustments during the recruitment process, please let us know.