Join to apply for the Hunyuan LLM Site Reliability Engineer role at Tencent .
**Business Unit**
Technology Engineering Group (TEG) is responsible for supporting the company and its business groups on technology and operational platforms, as well as the construction and operation of R&D management and data centers.
TEG provides users with a full range of customer services.
As the operator of the largest networking, devices, and data center in Asia, TEG also leads the Tencent Technology Committee in strengthening infrastructure R&D through internal and distributed open source collaboration, constructing new platforms, and supporting business innovation.
**What The Role Entails**
- Responsible for the operation and maintenance of overseas model services at Hunyuan, ensuring stable, reliable, and efficient service operations;
- Responsible for capacity management and planning, resource cost optimization, ensuring reasonable online service capacity and improving resource efficiency;
- Responsible for continuous integration and delivery, efficient and automated operational optimization, enhancing service stability and R&D efficiency;
- Participate in the design of online systems and various service architectures, providing professional solutions for stability and architecture improvement;
- Analyze and deeply explore the shortcomings of existing systems, data-driven to find weak points, and promote system optimization and improvement;
- Pay attention to industry front-end technology trends, explore technologies and directions for automation and intelligence in the operation and maintenance of complex business systems.
**Who We Look For**
- Bachelor's degree or above, with 2+ years experience in internet operations and maintenance;
- Familiar with Linux OS, solid system management and network knowledge;
- Experience with deploying, configuring, and tuning Nginx, Redis, MySQL;
- Proficient in monitoring systems like Zabbix, Prometheus, Grafana, and real-time system status monitoring;
- Proficient in at least one programming language (Python, Go, Shell, etc.), with experience developing automated operational tools;
- Familiar with overseas public cloud operations (AWS, Azure, etc.), containerization, microservices architecture;
- Strong responsibility, good communication, learning ability, and team spirit;
- Proficient in English and Chinese (listening, speaking, reading, writing), capable of updating workflows and technical documents.
**Equal Employment Opportunity**
We believe diverse voices fuel innovation.
Tencent fosters an environment where every employee feels supported and inspired to achieve goals.
Seniority level
Employment type
Job function
- Information Technology and Engineering
Industries
#J-18808-Ljbffr