Senior Data Engineer
Position: Senior Data Engineer
Location: Chantilly, VA (Starting 2-3 week hybrid, then converted to remote)
Duration: Long-term Contract
Job Responsibilities
- Design and implement data pipelines and ETL processes to support the cyber investigative capabilities.
- Develop and maintain data analytics solutions for desktop and web-based visual analytic applications.
- Establish applications that produce manageable, actionable intelligence from streams of structured, semi-structured, and data.
- Design strategies for enterprise database systems and set standards for operations, programming, and security.
- Construct and optimize large relational databases across multi-enclave environments (Unclassified, Secret, and Top Secret).
- Tune performance of large-scale data workflows, ensuring cost efficiency, low latency, and high availability.
- Design and manage Elasticsearch/Apache Solr clusters for fast search, indexing, and retrieval of large-scale datasets.
- Integrate new systems with existing warehouse structures and refine system performance and functionality.
- Implement CI/CD pipelines for data systems, automate monitoring/alerting, and enforce infrastructure-as-code practices.
- Provide technical leadership and mentorship to other team members.
- Participate in Program Increments (PIs) and Agile Release Train (ART) activities.
Required Qualifications
-
Bachelor’s degree in Computer Science, Data Science, Engineering, or related field with 8 years of experience in data engineering or related field. Associate degree with 11 years of experience in data engineering or related field; OR High School/Diploma with 14 years of experience in data engineering or related field.
-
Experience with SAFe Agile framework.
-
Strong understanding of forensic and investigative data requirements.
-
Demonstrated experience designing and implementing data solutions in secure government environments.
-
May require occasional travel for Program Increment planning sessions.
-
May require flexible scheduling to support critical operations.
-
Proficiency with:
-
Python for data processing, automation and ETL workflow orchestration.
-
SQL (MySQL, PostgreSQL, Microsoft SQL) and query optimization.
-
Elasticsearch and Apache Solr (design, scaling, query optimization, cluster management).
-
Data pipeline technologies (Apache Kafka, Apache Nifi, Cribl).
-
Log analytics and observability platforms, particularly Splunk.
-
Containerization and orchestration technologies (Docker, Kubernetes).
-
Cloud platforms (AWS GovCloud, SC2S, C2S).
-
Experience with:
-
Infrastructure as Code (Terraform).
-
GraphQL: schema design, API development, query optimization, and integrations.
-
DevSecOps practices and tools.
-
RabbitMQ and Redis.
-
Angular, React, or other modern frontend frameworks.
-
SAFe Agile methodologies.