[Remote] Data Engineer
Note: The job is a remote job and is open to candidates in USA. SemiAnalysis is an independent research and analysis firm specializing in the Semiconductor and AI industries. They are seeking a motivated Data Engineer to architect and maintain the data pipelines and infrastructure that support their industry models and consulting work.
Responsibilities
- Design, develop, and maintain robust and scalable ETL pipelines in Python to power our industry models and analytics products.
- Work with lead analysts to ensure data accuracy, completeness, and utility value across multiple sources and formats.
- Build scalable and reusable data workflows in cloud environments (GCP, AWS, or Azure).
- Implement and maintain data quality monitoring.
- Able to effectively use a SQL database via automated cron jobs.
- Maintain and extend dashboards and APIs that deliver data to both internal analysts and external clients.
- Support the integration of new datasets, tools, and infrastructure components to enhance our analytics capabilities.
Skills
- 1–3 years of experience in a Data Engineering, Data Science or reasonably equivalent role
- Capable in Python, SQL, and Excel
- Strong ETL development experience
- Hands-on experience with at least one cloud platform (GCP, AWS, or Azure)
- Highly autonomous—able to take a problem from definition to deployment with minimal oversight
- The right combination of opinionated and low-drama
- Experience with Flask
- Redis
- Dash
- Airflow
- GitHub Actions
- Kubernetes
- Familiarity with automated regression, smoke, or unit testing methodologies
- Experience working with messy, real-world data
Company Overview
- SemiAnalysis offers AI and semiconductor research, consulting, and hosts tech events like Nvidia Blackwell GPU Hackathon. It was founded in 2019, and is headquartered in San Francisco, California, USA, with a workforce of 11-50 employees. Its website is https://semianalysis.com.