Technical Lead – Healthcare AI Human Data

MAKZ Logo
  • Healthcare
  • PartTime
  • Applications have closed

Part-time / Contract | Remote | Potential Founding Role

We are building a specialist company focused on producing high-signal healthcare human data to improve reasoning, safety, and evaluation in frontier AI models.

Rather than operating across many domains, we focus exclusively on healthcare, working with experienced clinicians to generate structured datasets that expose model weaknesses and improve performance in complex medical scenarios.

++The role:++

You will lead the design of the technical framework that turns clinical expertise into high-quality AI training and evaluation data.

Your responsibilities will include:

  • Designing healthcare tasks and prompts that test clinical reasoning and model capabilities
  • Developing evaluation rubrics and scoring frameworks for clinician-generated outputs
  • Working closely with clinicians to translate real-world medical reasoning into structured datasets
  • Identifying model failure modes and designing tasks that surface them
  • Testing datasets across frontier models to assess signal quality
  • Establishing the foundations of a scalable healthcare human data pipeline

The initial focus will be producing a pilot dataset that demonstrates the quality and usefulness of our approach to frontier AI labs.

++Ideal background:++

We are looking for someone who understands how human expertise improves AI systems.

Strong candidates may have experience in areas such as:

  • RLHF or post-training datasets
  • LLM evaluation or benchmarking
  • Model red-teaming or safety testing
  • Human-in-the-loop AI systems
  • Prompt design and rubric creation
  • Designing datasets that expose model reasoning failures

Experience working with AI labs, AI research groups, or large-scale human data projects is particularly valuable.

Healthcare experience is helpful but not required, as you will be collaborating with clinicians.