Principal Machine Learning Engineer, AI Platform – AI Infrastructure
Grab is Southeast Asia's leading superapp that delivers everything from meals to finances and local mobility.
We harness technology and AI to empower people economically while staying true to heart, hunger, honour, and humility.
As a
Principal Machine Learning Engineer
focused on
AI Infrastructure , you will shape the backbone of Grab’s AI ecosystem.
You will design and evolve scalable platforms for model training, serving, and evaluation anchored on Ray and Kubernetes, enabling thousands of engineers and data scientists to innovate safely and efficiently.
You will report to the Head of Engineering and the role will be onsite at Grab office in Singapore.
Critical Tasks You Will Perform
Independently Lead and Execute Demonstrate strength as a technical lead by taking full responsibility for projects conception, planning and execution.
Architect the Future of AI Infrastructure Design and scale the next generation of distributed systems for model training, inference, and experimentation on Kubernetes and Ray.
Build Platforms for Scale= Develop core abstractions, APIs, and services that make AI experimentation, deployment, and monitoring seamless across Grab.
Enable Cost-Efficient AI at Scale Drive initiatives to optimize GPU/CPU utilization, storage, and networking for large-scale AI workloads, driving significant efficiency gains.
Integrate Research with Production Systems Translate cutting-edge distributed training, scheduling, and serving techniques into production-ready systems that can handle Grab's scale.
Influence AI Platform Strategy Partner with engineering and product leadership to set direction for Grab's AI infrastructure roadmap, balancing long-term vision with practical execution.
Mentor and Inspire Provide deep technical mentorship, foster platform-thinking, and cultivate a culture of excellence across engineering and research teams.
Qualifications
What Essential Skills You Will Need
Experience
6+ years of experience building large-scale AI/ML or distributed systems infrastructure.
At least 2 years in a technical leadership capacity, driving architectural decisions and mentoring teams.
Deep Infrastructure & Distributed Systems Expertise
Hands-on experience with Ray (Ray Train, Ray Serve, Ray Tune) and distributed data processing frameworks (e.g., Dask, Spark).
Expertise in Kubernetes, container orchestration, autoscaling, and cloud-native architectures.
Systems & Platform Engineering
Experience designing and delivering developer platforms that abstract away complexity while ensuring scale.
Background in APIs, microservices, observability, and CI/CD best practices.
Cloud & Compute Optimization
Experience running large-scale AI/ML workloads on cloud infrastructure (AWS/GCP/Azure).
Expertise in GPU scheduling, heterogeneous clusters, and cost-optimization strategies.
Programming & Engineering Excellence
Proficiency in Python and one or more system-level languages (e.g., Go, Rust, C++).
Strong engineering fundamentals in concurrency, networking, storage, and system performance.
Strategic Visionary & Leadership
Strategic AI Infrastructure Leadership: Develops roadmaps that align AI infrastructure with core business priorities.
Platform Empowerment: Passionate about building platforms that accelerate impact for engineers, researchers, and product teams.
Influence & Mentorship: Influence technical direction across diverse teams and a strong track record of mentoring engineers.
Benefits
We care about your well-being.
Here are some global benefits we offer:
We have your back with Term Life Insurance and comprehensive Medical Insurance.
With GrabFlex, create a benefits package that suits your needs and aspirations.
Celebrate moments that matter in life with loved ones through Parental and Birthday leave, and give back to your communities through Love-all-Serve-all (LASA) volunteering leave.
We have a confidential Grabber Assistance Programme to guide and uplift you and your loved ones through life's challenges.
Balancing personal commitments and life's demands are made easier with our FlexWork arrangements such as differentiated hours.
What We Stand For At Grab
We are committed to building an inclusive and equitable workplace that enables diverse Grabbers to grow and perform at their best.
As an equal opportunity employer, we consider all candidates fairly and equally regardless of nationality, ethnicity, religion, age, gender identity, sexual orientation, family commitments, physical and mental impairments or disabilities, and other attributes that make them unique.
Location: Queenstown, Central Singapore Community Development Council, Singapore
#J-18808-Ljbffr