Machine Learning Engineer | $30/hr Remote | Mercor

At Crossing Hurdles, we work as a referral partner. We refer candidates to Mercor that collaborates with the world’s leading AI research labs to build and train cutting-edge AI models.

Organization: Mercor

Position: Technical Reviewer — RL Environment Terminal Benchmarking (Agentic AI)

Referral Partner: Crossing Hurdles

Type: Hourly Contract

Compensation: $25–$30/hour

Location: India (Remote)

Commitment: 10–40 hours/week

Role Responsibilities (Training support will be provided)

Review and validate reinforcement learning (RL) environment design, terminal conditions, and benchmarking protocols.
Assess evaluation metrics and ensure fairness, reproducibility, and consistency across RL experiments.
Provide detailed technical feedback on environment codebases, documentation, and evaluation workflows.
Collaborate with AI researchers to refine environment architecture, performance measures, and reproducibility standards.
Verify experimental results across runs, seeds, and hardware configurations to ensure robust benchmarking practices.
Recommend improvements for environment design, metric definitions, and implementation rigor.

Requirements

Strong background in Reinforcement Learning, Computer Science, or Applied AI research.
Experience working with RL environments and benchmarking methodologies.
Skilled in Python programming; familiarity with frameworks such as PyTorch or TensorFlow preferred.
Excellent understanding of evaluation metrics, reproducibility protocols, and experimental analysis.
Strong analytical thinking, technical communication, and attention to detail.
Interest in agentic AI systems and the development of reliable evaluation pipelines.

Application Process (Takes 20 min)

Upload resume
AI interview based on your resume (15 min)
Submit form

Apply for job