LLM Engineer
Role: LLM Engineer
Location: San Jose, CA (2 Days onsite)
Duration: 12 Months
Only W2
Model Development & Optimization: Design, train, fine-tune, and evaluate large language models (LLMs) for performance, efficiency, and alignment with product or research goals.
Systems Integration & Deployment: Implement scalable inference pipelines, optimize serving infrastructure (e.g., quantization, caching, distillation), and integrate models into applications or APIs.
Research & Cross-Functional Collaboration: Lead experimentation with new architectures, prompt-engineering techniques, or retrieval systems, and collaborate with product, data, and ML operations teams to translate research into production features. Model Development & Optimization: Design, train, fine-tune, and evaluate large language models (LLMs) for performance, efficiency, and alignment with product or research goals.
Systems Integration & Deployment: Implement scalable inference pipelines, optimize serving infrastructure (e.g., quantization, caching, distillation), and integrate models into applications or APIs.
Research & Cross-Functional Collaboration: Lead experimentation with new architectures, prompt-engineering techniques, or retrieval systems, and collaborate with product, data, and ML operations teams to translate research into production features.