Job Description
            
                About Data Cycling CenterAI is the new electricity and data is the energy for AI.
Our core belief is that unstructured data contains the untapped wisdom of humanity — and by building the best AI-ready data infrastructure, we enable more meaningful, responsible, and creative uses of AI.
Data Cycling Center (DCC) is a Data Science team that is responsible for building the mid-platform data layer that powers AI development across all business lines of TikTok, including e-commerce, advertising, livestreaming, and emerging technologies.
Our initiatives include:
- Scaled human-in-the-loop (HITL) data processing (including capability, methodology, E2E process, platform), delivering the most cost-effective results at benchmark quality levels that outperform leading tech peers.
- Built a highly automated end-to-end data pipeline for AI-generated content (AIGC) that supports strong model performance with minimal resource input.
- Develop a comprehensive content understanding and insight generation system, setting new benchmarks for marketing intelligence in the online ads ecosystem.
DCC aims to be the number 1 data service provider in the AI era.
Our mission is to build the most effective (lossless and affordable) content understanding capabilities to fully satisfy AI needs of TikTok and to help the industry push the limit.
About the role As a Lead Data Engineer, you will be responsible for building up both the team and the data pipeline.
You will be working on cutting-edge challenges in the big data and AI industry which requires strong passion and capability of innovation.
You will collaborate closely with cross-functional teams to understand business requirements and translate them into technical solutions.
Additionally, you will provide technical guidance, mentorship, and support to junior team members.
Responsibilities:
- Architect efficient, scalable and reliable data pipelines and infrastructure for ingesting, processing, and transforming large volumes of data
- Define the technical strategy and roadmap for data engineering projects in alignment with business objectives, actively evaluate and bring in industry best practices and state-of-the-art technical approaches, and timely update the strategy according to the rapid change of the industry
- Build and lead a high-performing data engineering team with a clear strategy, providing business, technical, and personal development coaching
- Own and drive data engineering projects by leveraging both internal and cross-functional resources, setting meaningful and challenging targets, and achieving them with innovative approaches
- Foster a collaborative and inclusive team culture within and across teams, collaborate with data scientists Why Join Us
- Shape the Future of AI
Work at the core of TikTok’s AI development, where your data powers cutting-edge applications across e-commerce, ads, livestream, and more.
- Massive Scale, Real-World Impact
Process petabytes of unstructured data to influence real products used by over a billion users worldwide.
- Global Team, Local Leadership
Join a diverse, high-performing team spread across Singapore, New York, San Jose, Shanghai, and more
- Build What Others Benchmark Against
Our innovations in HITL systems, AIGC pipelines, and content understanding set the industry standard - not just keep up with it.
- From Data to Strategy
We don’t just process data — we turn it into actionable insight that shapes product and business strategy across all TikTok verticals.
- Dynamic, Fast-Paced, Fun
Experience the energy of a high-impact team that moves fast, breaks new ground, and never stops learning.
 Minimum Qualifications
- Bachelor's or Master's degree in Computer Science, Engineering, or related field.
- At least 5 years of experience in data engineering, with proven expertise in building scalable data pipelines and infrastructure.
- Experience with big data technologies such as Apache Spark, Hadoop, and Kafka along with strong proficiency in programming languages such as Python, SQL, and Java.
- Strong understanding of database systems, data warehousing, and distributed computing concepts.
- Proven experience in team management, with a strong track record of leading teams and collaboration with stakeholders at a regional or global level; proven track record of leading teams and delivering complex data engineering projects.
- Excellent communication and expression skills, logical thinking, teamwork and execution abilities.
Independent thinking and capable of exploring and prioritising the core business needs.
- Ability to thrive in a fast-paced, dynamic environment and manage multiple priorities effectively.