Data Engineer, Web Scraping

Sundayy Logo
  • IT
  • FullTime

About The Company 10a Labs is a leading provider of safety and threat‑intelligence solutions trusted by frontier AI labs, AI unicorns, Fortune 10 companies, and prominent global technology platforms. Our core expertise lies in adversarial red teaming, model evaluations, and intelligence collection, enabling engineering, safety, and security teams to stay ahead of rapidly evolving threats. We are committed to helping organizations deploy AI systems securely and responsibly by providing cutting-edge insights and robust security measures. Our innovative approach combines advanced technology with strategic intelligence to support our clients in maintaining their competitive edge and ensuring the safety of their AI deployments.

About The Role We are seeking a skilled Data Engineer to join our dynamic team. In this role, you will be responsible for designing, implementing, and optimizing end‑to‑end data pipelines that facilitate the collection, processing, and analysis of both structured and unstructured data. Your work will primarily involve leveraging cloud platforms, such as Google Cloud Platform, to develop scalable and efficient data solutions. You will conduct ad hoc web scraping and data collection activities to support research and intelligence initiatives, ensuring the data is prepared for subsequent analysis through cleaning, transformation, anonymization, and masking processes. Additionally, you will contribute to the development of internal and external APIs, adhering to best practices, and collaborate closely with machine learning engineers, data scientists, and software developers to deliver actionable insights, dashboards, APIs, and data dumps. Your efforts will be critical in driving key initiatives that enhance our intelligence capabilities and operational efficiency.

Qualifications

  • Degree in Computer Science, Engineering, Information Science, Data Science, or a related field (graduate degree preferred)
  • Minimum of 2+ years of professional experience in data engineering or a closely related area
  • Proficiency in Python and SQL programming languages
  • Hands-on experience with web scraping and crawling tools such as Beautiful Soup, Selenium, or Scrapy
  • Experience working with cloud platforms, especially Google Cloud Platform, including storage and database services like Cloud Storage, CloudSQL, and Cloud Spanner
  • Knowledge of workflow orchestration tools such as Cloud Composer, Airflow, Cloud Run, and Pub/Sub
  • Experience building and managing data pipelines, particularly for text data
  • Strong ability to communicate complex technical concepts clearly to non-technical stakeholders
  • Ability to thrive in fast-paced, high-impact environments such as startups, AI research labs, or security-focused teams

Responsibilities

  • Design, implement, and optimize end-to-end data pipelines for web scraping and data processing tasks
  • Conduct ad hoc web scraping and data collection activities to support research and intelligence initiatives
  • Prepare raw data for analysis through cleaning, transformation, anonymization, and masking techniques
  • Develop and maintain internal and external APIs following industry best practices
  • Collaborate with machine learning engineers, data scientists, and software developers to deliver actionable insights and functional tools
  • Create dashboards, data dumps, and APIs to facilitate data-driven decision-making
  • Drive critical initiatives related to data collection, processing, and security
  • Ensure data pipelines are scalable, reliable, and optimized for performance

Benefits

  • Competitive salary range of $105K–$125K, commensurate with experience and location
  • Performance-based annual bonus
  • Support for professional development including conferences, continuing education, and leadership training
  • Fully remote work environment based in the U.S.
  • Comprehensive health, dental, and vision insurance coverage
  • Generous paid time off and holiday schedule
  • Retirement plan with 401(k) options

Equal Opportunity

10a Labs is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We do not discriminate based on race, ethnicity, gender, sexual orientation, age, disability, religion, or any other protected characteristic. We believe that diverse teams foster innovation and drive success, and we welcome applicants from all backgrounds to apply.