Machine Learning Engineer | $30/hr Remote | Mercor
At Crossing Hurdles, we work as a referral partner. We refer candidates to Mercor that collaborates with the world’s leading AI research labs to build and train cutting-edge AI models.
Organization: Mercor
Position: Technical Reviewer — RL Environment Terminal Benchmarking (Agentic AI)
Referral Partner: Crossing Hurdles
Type: Hourly Contract
Compensation: $25–$30/hour
Location: India (Remote)
Commitment: 10–40 hours/week
Role Responsibilities (Training support will be provided)
- Review and validate reinforcement learning (RL) environment design, terminal conditions, and benchmarking protocols.
- Assess evaluation metrics and ensure fairness, reproducibility, and consistency across RL experiments.
- Provide detailed technical feedback on environment codebases, documentation, and evaluation workflows.
- Collaborate with AI researchers to refine environment architecture, performance measures, and reproducibility standards.
- Verify experimental results across runs, seeds, and hardware configurations to ensure robust benchmarking practices.
- Recommend improvements for environment design, metric definitions, and implementation rigor.
Requirements
- Strong background in Reinforcement Learning, Computer Science, or Applied AI research.
- Experience working with RL environments and benchmarking methodologies.
- Skilled in Python programming; familiarity with frameworks such as PyTorch or TensorFlow preferred.
- Excellent understanding of evaluation metrics, reproducibility protocols, and experimental analysis.
- Strong analytical thinking, technical communication, and attention to detail.
- Interest in agentic AI systems and the development of reliable evaluation pipelines.
Application Process (Takes 20 min)
- Upload resume
- AI interview based on your resume (15 min)
- Submit form