Technical Lead – Healthcare AI Human Data

Part-time / Contract | Remote | Potential Founding Role

We are building a specialist company focused on producing high-signal healthcare human data to improve reasoning, safety, and evaluation in frontier AI models.

Rather than operating across many domains, we focus exclusively on healthcare, working with experienced clinicians to generate structured datasets that expose model weaknesses and improve performance in complex medical scenarios.

++The role:++

You will lead the design of the technical framework that turns clinical expertise into high-quality AI training and evaluation data.

Your responsibilities will include:

Designing healthcare tasks and prompts that test clinical reasoning and model capabilities
Developing evaluation rubrics and scoring frameworks for clinician-generated outputs
Working closely with clinicians to translate real-world medical reasoning into structured datasets
Identifying model failure modes and designing tasks that surface them
Testing datasets across frontier models to assess signal quality
Establishing the foundations of a scalable healthcare human data pipeline

The initial focus will be producing a pilot dataset that demonstrates the quality and usefulness of our approach to frontier AI labs.

++Ideal background:++

We are looking for someone who understands how human expertise improves AI systems.

Strong candidates may have experience in areas such as:

RLHF or post-training datasets
LLM evaluation or benchmarking
Model red-teaming or safety testing
Human-in-the-loop AI systems
Prompt design and rubric creation
Designing datasets that expose model reasoning failures

Experience working with AI labs, AI research groups, or large-scale human data projects is particularly valuable.

Healthcare experience is helpful but not required, as you will be collaborating with clinicians.

Apply for job