REMOTE- Data Scientist (Healthcare) || W2

HYR Global Source Inc Logo
HYR Global Source Inc
  • IT
  • FullTime
  • Applications have closed

Job Title: Data Scientist (Healthcare)

Location: 100% Remote

Visa: USC, GC, H1B Transfer, EAD/W2

Interview Process: Recruiter screen (skills & domain fit)

Technical deep dive (SQL, modeling, case study)

Practical exercise (notebook or take-home)

Stakeholder panel (communication & healthcare use-case discussion

About the Role: We’re looking for a hands-on Data Scientist to solve real-world business problems in healthcare/health insurance using machine learning and modern cloud tooling. You’ll own the full lifecycle—from problem framing and data wrangling to modeling, deployment support, and stakeholder storytelling.

Top Skills: Python/R, SQL, Spark/Databricks, Azure ML or Vertex AI, MLflow, Git/GitHub, Jupyter/Databricks notebooks, Power BI/Tableau, Azure/GCP services.

What You’ll Do

  • Partner with business/clinical stakeholders to define measurable use cases (e.g., risk & cost forecasting, readmission/LOS prediction, member churn, fraud/waste/abuse).
  • Explore, cleanse, and engineer tabular data from claims, eligibility, EMR, utilization, and care-management sources.
  • Build and evaluate models using regression, classification, time series, clustering, trees/GBMs; pilot deep-learning where additive value.
  • Productionize with data & MLOps teams: feature pipelines (Spark/Databricks), model packaging, monitoring, drift/decay, and A/B testing.
  • Write clear analyses, dashboards, and exec-ready presentations; translate findings into actions and ROI.
  • Ensure privacy/compliance (HIPAA/PHI), data governance, and reproducible research (git, notebooks, experiment tracking).
  • Contribute to LLM initiatives: prompt engineering, RAG, basic fine-tuning, and experiments with agentic AI frameworks for workflow automation.

Required Qualifications

  • 6–8 years of hands-on data science delivering business impact (healthcare/health insurance experience is a big plus).
  • Strong tabular data handling/manipulation skills.
  • Solid SQL and proficiency in at least one statistical programming language (Python or R).
  • Working knowledge of MS Office (Excel/PowerPoint/Access) for quick analyses and stakeholder comms.
  • Experience with Big Data tools: HDFS, Hive, Spark, MapR-DB (or equivalent).
  • Practical experience with statistical & ML techniques: linear/logistic regression, time series, clustering, decision trees, tree ensembles (XGBoost/LightGBM).
  • Cloud & ML platforms: Databricks, Azure ML and/or Google Vertex AI (pipelines, model registry, deployment workflows).
  • Exposure to prompt engineering, LLM fine-tuning (LoRA/PEFT or platform-native), and agentic AI concepts.
  • Clear, concise communicator; able to convert ambiguity into structured analysis and decisions.
  • BS/MS in Computer Science, Statistics, Applied Math, Engineering, or related field (or equivalent experience).