LLM Engineer

Apolis Logo
  • Production
  • FullTime

Role: LLM Engineer

Location: San Jose, CA (2 Days onsite)

Duration: 12 Months

Only W2

Model Development & Optimization: Design, train, fine-tune, and evaluate large language models (LLMs) for performance, efficiency, and alignment with product or research goals.

Systems Integration & Deployment: Implement scalable inference pipelines, optimize serving infrastructure (e.g., quantization, caching, distillation), and integrate models into applications or APIs.

Research & Cross-Functional Collaboration: Lead experimentation with new architectures, prompt-engineering techniques, or retrieval systems, and collaborate with product, data, and ML operations teams to translate research into production features. Model Development & Optimization: Design, train, fine-tune, and evaluate large language models (LLMs) for performance, efficiency, and alignment with product or research goals.

Systems Integration & Deployment: Implement scalable inference pipelines, optimize serving infrastructure (e.g., quantization, caching, distillation), and integrate models into applications or APIs.

Research & Cross-Functional Collaboration: Lead experimentation with new architectures, prompt-engineering techniques, or retrieval systems, and collaborate with product, data, and ML operations teams to translate research into production features.