Save Job Back to Search Job Description Summary Similar JobsWork on GenAI, large‑scale model training, and GPU performance optimizationExposure to multi‑node, multi‑GPU systems, low‑level optimizationAbout Our ClientThe client is a global technology manufacturer recognized for innovation in advanced systems, AI, and high‑performance computing. With a strong commitment to research, sustainability, and engineering excellence, they provide an environment where highly technical engineers can solve complex, real‑world problems at scale.Job DescriptionArchitect and execute large‑scale model training and fine‑tuning on multi‑node, multi‑GPU clustersOptimize training and inference performance using distributed strategies (DDP, FSDP, DeepSpeed, Megatron‑LM)Design and develop autonomous AI Agents for complex, multi‑step manufacturing workflowsProfile and analyze GPU‑intensive workloads to identify compute, memory, and latency bottlenecksDevelop and optimize high‑performance GPU kernels using CUDA or related GPGPU frameworksPartner with hardware architects to shape next‑generation accelerator featuresBuild performance regression testing frameworks for drivers, compilers, and runtime systemThe Successful ApplicantAt least 5 years' experience in GPU computing, performance optimization, or low‑level systems programmingDeep knowledge of GPU architectures, memory hierarchies, and interconnectsStrong hands‑on experience with PyTorch and distributed model training techniquesExpertise in LLM fine‑tuning, inference optimization, and GenAI application developmentAdvanced C++ skills and proficiency in CUDA or other GPGPU frameworksSolid understanding of end‑to‑end ML systems, CI/CD pipelines, and cloud or on‑prem environmentsExcellent analytical, communication, and cross‑functional collaboration skillsWhat's on OfferCompetitive compensation and performance‑linked incentivesOpportunity to work on industry‑leading AI and GPU technologiesExposure to large‑scale, real‑world GenAI and manufacturing systemsStrong emphasis on learning, innovation, and technical growthCollaborative, inclusive culture with long‑term career progressionContactLydia Chen (Lic No: R22108104 / EA no: 18C9065)Quote job refJN-042026-6992136Phone number+65 6416 9829Job summaryFunctionITSpecialisationInfrastructureWhat is your area of specialisation?Technology & TelecomsLocationSingaporeContract TypePermanentConsultant nameLydia Chen (Lic No: R22108104 / EA no: 18C9065)Consultant contact+65 6416 9829Job ReferenceJN-042026-6992136