1.Manage daily operations of the company's AWS cloud platform, including services such as EC2, RDS, S3, VPC, ECS, and CloudWatch.
2.Design, implement, and maintain highly available, scalable cloud architecture solutions.
3.Manage and optimize automation tools for operations, such as Terraform, CloudFormation, and Shell/Python scripts.
4.Monitor system performance, promptly respond to and resolve incidents and performance issues in the cloud environment.
5.Support development teams in continuous delivery (CI/CD), including build, deployment, and release processes.
6.Regularly assess and optimize costs, security policies, and resource utilization efficiency.
7.Assist in compliance tasks such as security audits, data backup, and disaster recovery.
1.
Bachelor's degree or higher in Computer Science, Networking, Security, or a related field.
2.
Fluent in Chinese and English, both of which can be used as working languages.
3.Minimum 5 years of public cloud operations experience, with at least 3 years focused on AWS.
4.Proficient in Linux OS and common commands, with strong troubleshooting skills.
5.Hands-on experience with at least one infrastructure automation tool (e.g., Shell, Python).
6.Familiarity with monitoring tools such as CloudWatch, Prometheus, or Grafana.
7.Excellent documentation, communication, and teamwork abilities.
8.AWS Certification (e.g., Solutions Architect Associate, SysOps Administrator) is a plus.
9.AI Ops experience is a plus.
1.Understanding of DevOps practices, with experience in GitLab CI/Jenkins.
2.Hands-on experience in cross-region AWS deployment and failover implementation.
3.Knowledge of IAM policy design and CloudTrail log analysis.
4.Experience handling emergency incidents (e.g., high load, data loss, DDoS attacks).