Key Responsibilities:
Cloud Operations & Escalation Support :
- Lead the health, availability, security, and performance of multi-account AWS cloud infrastructure.
- Act as the final escalation point for critical incidents, conducting root cause analysis and long-term fixes.
- Drive continuous improvement in monitoring, alerting, patching, backup, and configuration management processes.
- Ensure proper IAM, encryption, access controls, and security posture across all workloads.
- Collaborate with SOC, security, and audit teams on vulnerability remediation and compliance enforcement.
- Optimize cloud usage and cost through tagging, budgeting, and automated policies.
- Conduct internal audits, support DR drills, and maintain operational documentation.
Cloud Deployment & Architecture Support (Day-1 – Secondary):
- Lead or support deployment of infrastructure using Terraform, CloudFormation, or Azure Bicep/ARM templates.
- Design and implement cloud solutions using CI/CD pipelines (GitLab CI, AWS CodePipeline, Azure DevOps).
- Architect secure and scalable workloads using best practices aligned with AWS Well-Architected Framework.
- Participate in solutioning for migrations, containerization (ECS, EKS), and modernization projects.
- Define governance, tagging, security controls, and logging strategy across AWS and Azure platforms.
- Ensure all designs align with GCC policies, IM8 guidelines, and Government compliance requirements.
Experience:
- 6–10 years of IT experience, with at least 4+ years managing production-grade AWS & Azure workloads.
- Prior experience supporting Singapore Government projects or regulated environments (GCC).
- Experience leading cloud deployments, migrations, DR planning, and hybrid infrastructure.
- Strong incident management and change control skills in ITIL-governed environments.