The Data Cycling Center (DCC) is a Data Science team that develops AI-driven content understanding capabilities, identifies business opportunities, and builds products to capture those opportunities.
Our mission is to simplify the acquisition and utilization of unstructured data.
About the Role: We are looking for experienced data scientists to join our team and apply advanced analytics and machine learning techniques to optimize intelligent labeling workflows and data products within TikTok's ecosystem.
Key Responsibilities
- Collaborate with cross-functional stakeholders to gather and refine requirements for data labeling projects and identify opportunities for optimization through data-driven solutions.
- Design and manage the full lifecycle of end-to-end data labeling and policy testing workflows.
- Establish and maintain a centralized knowledge base for Retrieval-Augmented Generation (RAG) systems.
- Operationalize intelligent labeling pipelines leveraging Prompt Engineering, agent-based workflows, and labeling models.
- Translate complex policy documents into machine- and human-readable formats.
- Apply multi-modal LLM techniques to extract latent signals from content.
- Lead applied ML and data science research and experimentation to solve business-critical use cases.
- Own the model lifecycle from data sourcing and preprocessing to training, deployment, and post-launch maintenance.
Qualifications
Minimum Qualifications:
- Advanced degree (Master's or Ph.D.) in Statistics, Computer Science, Applied Mathematics, Data Science, or a related quantitative field.
- Strong theoretical foundation in computer science, machine learning, and statistics.
- In-depth experience in unsupervised learning, clustering algorithms, and pattern recognition from unstructured data.
- Strong experience with unsupervised learning, clustering algorithms, and extracting data insights from unstructured video format data.
- Experience in data project management, and solid foundations of maths and algorithms.
- Expertise in SQL, Hive, Presto, or Spark, and experience with large-scale datasets.
- Excellent communication and collaboration skills.
Preferred Qualifications:
- At least 3 years of experience in software development or model/data pipeline development.
- Deep understanding of data pipeline architecture, model development lifecycle, testing, and deployment.
- Practical industry experience in applying prompt engineering and emerging Al techniques.
- Demonstrated strong intellectual curiosity, excellent problem-solving skills, and advanced analytical abilities.
About TikTok
TikTok is the leading destination for short-form mobile video.
Our mission is to inspire creativity and bring joy.
Why Join Us
Inspiring creativity is at the core of TikTok's mission.
We strive to do great things with great people.
Diversity & Inclusion
TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives.
Trust & Safety
TikTok recognizes that keeping our platform safe for the TikTok communities is no ordinary job.
#J-18808-Ljbffr