Overview
Data Scientist (Reinforcement Learning/LLM Agent / Vision Language Model - either 1) at Binance.
Join to apply for this role.
Binance is a leading global blockchain ecosystem focused on security, transparency, and scalable AI-enabled products.
Responsibilities
Research and develop state-of-the-art RL algorithms, focusing on Large Model Optimization and alignment techniques.
Design and implement RL training pipelines, including environment simulation, data generation, and reward function design.
Apply RL methods to enhance LLM/VLM/Agentic AI capabilities in reasoning, planning, and autonomous decision-making.
Collaborate with Engineers and researchers to integrate RL solutions into enterprise AI platforms.
Monitor model performance in production and continuously improve through iterative training and fine-tuning.
Requirements
Master’s Degree in Computer Science, Applied Mathematics, Machine Learning, or related fields.
5+ years of hands-on experience in RL or LLM/VLM/Agentic AI optimization.
Strong coding skills in Python, with experience in ML frameworks and RL libraries.
Experience with large-scale distributed training and optimization.
Self-driven, ownership mindset, and strong problem-solving skills.
Excellent communication skills for cross-functional collaboration.
Why Binance
Shape the future with the world’s leading blockchain ecosystem
Collaborate with world-class talent in a user-centric global organization with a flat structure
Tackle unique, fast-paced projects with autonomy in an innovative environment
Thrive in a results-driven workplace with opportunities for career growth and continuous learning
Competitive salary and company benefits
Work-from-home arrangement (the arrangement may vary depending on the work nature of the business team)
Binance is committed to being an equal opportunity employer.
We believe that having a diverse workforce is fundamental to our success.
By submitting a job application, you confirm that you have read and agree to our Candidate Privacy Notice.
#J-18808-Ljbffr