Overview
Data Scientist (Reinforcement Learning/LLM Agent / Vision Language Model - either 1) at Binance.
Join to apply for this role.
Binance is a leading global blockchain ecosystem focused on security, transparency, and scalable AI-enabled products.
Responsibilities
- Research and develop state-of-the-art RL algorithms, focusing on Large Model Optimization and alignment techniques.
- Design and implement RL training pipelines, including environment simulation, data generation, and reward function design.
- Apply RL methods to enhance LLM/VLM/Agentic AI capabilities in reasoning, planning, and autonomous decision-making.
- Collaborate with Engineers and researchers to integrate RL solutions into enterprise AI platforms.
- Monitor model performance in production and continuously improve through iterative training and fine-tuning.
Requirements
- Master’s Degree in Computer Science, Applied Mathematics, Machine Learning, or related fields.
- 5+ years of hands-on experience in RL or LLM/VLM/Agentic AI optimization.
- Strong coding skills in Python, with experience in ML frameworks and RL libraries.
- Experience with large-scale distributed training and optimization.
- Self-driven, ownership mindset, and strong problem-solving skills.
Excellent communication skills for cross-functional collaboration.
Why Binance
- Shape the future with the world’s leading blockchain ecosystem
- Collaborate with world-class talent in a user-centric global organization with a flat structure
- Tackle unique, fast-paced projects with autonomy in an innovative environment
- Thrive in a results-driven workplace with opportunities for career growth and continuous learning
- Competitive salary and company benefits
- Work-from-home arrangement (the arrangement may vary depending on the work nature of the business team)
Binance is committed to being an equal opportunity employer.
We believe that having a diverse workforce is fundamental to our success.
By submitting a job application, you confirm that you have read and agree to our Candidate Privacy Notice.
#J-18808-Ljbffr