Job Duties
- Manage Multi-Cloud Operations – Operate, maintain, and troubleshoot cloud-native services across AWS, Azure, and GCP, ensuring uptime, performance, and scalability in production environments.
- Lead Infrastructure as Code (IaC) Practices – Maintain and enhance IaC pipelines using Terraform, Ansible, or ARM templates, resolving drift and deployment issues while promoting automation and GitOps practices.
- Oversee OS Lifecycle & Patch Management – Lead Windows and Linux patching operations using AWS Patch Manager, Azure Update Management, WSUS, SCCM, and YUM/DNF, ensuring compliance and audit readiness.
- Support Application Deployment & Troubleshooting – Deploy, monitor, and troubleshoot applications on Windows and Linux servers; optimize OS-level performance and collaborate with development teams on infrastructure-related issues.
- Enforce Security & Compliance – Implement CIS hardening, remediate vulnerabilities using tools such as Trend Micro Vision One, Qualys, and Tenable, and ensure adherence to government security and audit standards.
- Drive Containerization & DevSecOps Integration – Support containerized environments (Docker, Kubernetes, ECS, EKS, AKS, GKE) and CI/CD pipelines, aligning with SHIP-HATS and other government DevSecOps frameworks.
- Maintain ITIL & Service Management Processes – Manage incidents, problems, and changes through ITSM tools (ServiceNow, Jira), coordinate CAB reviews, and ensure SLAs/OLAs are consistently met.
- Implement Monitoring & Observability Tools – Integrate monitoring and log analysis solutions using CloudWatch, Azure Monitor, and GCP Cloud Logging to enhance infrastructure visibility and reliability.
- Lead Documentation & Knowledge Management – Develop and maintain detailed runbooks, SOPs, architecture diagrams, and CMDB entries to ensure operational consistency and audit readiness.
- Provide Technical Leadership & Mentorship – Guide Level 2 and junior engineers through technical escalations, training sessions, and best practice adoption, fostering a culture of operational excellence.
Job Requirements
- Education: Bachelor's degree in Computer Science, Information Systems, or a related field.
- Experience: Minimum 5 years in cloud engineering, with at least 3 years in AWS/Azure/GCP environments and 2 years in regulated or public-sector settings.
- Cloud Expertise: Proven experience managing production workloads across AWS, Azure, and GCP, including core services such as EC2, EKS, AKS, and GKE.
- IaC & Automation: Hands-on proficiency in Terraform, Ansible, or ARM templates; scripting skills using PowerShell, Bash, or Python.
- OS Management: Strong understanding of Windows Server administration and patching, with working knowledge of Linux (RHEL).
- Security Knowledge: Experience with CIS Benchmarks, IAM best practices, vulnerability remediation, and SSL certificate management.
- DevSecOps & Containers: Familiarity with CI/CD pipelines, container orchestration tools, and Singapore Government's SHIP-HATS or IM8 frameworks.
- ITIL & ITSM: Strong understanding of ITIL processes and experience using ITSM platforms such as ServiceNow or Jira.
- Leadership Skills: Proven ability to mentor, lead technical teams, and handle escalations in complex multi-cloud environments.
- Certifications (Preferred): AWS Solutions Architect / SysOps Administrator, Azure Administrator / Architect Expert, RHCE or LPIC, and ITIL v4 Foundation.
To Apply, please kindly email your updated resume to
Regret to inform that only shortlisted candidates will be notified.
CEI: R
EA License: 14C7275