Data Scientist (Remote - US)

Jobgether
United States
On-site
Full-time
Posted 19 days ago

Job Description

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Data Scientist in the United States.

This role offers the opportunity to lead the development of advanced machine learning and NLP systems that analyze complex clinical data to improve healthcare risk adjustment and ICD-10 coding accuracy. You will design, implement, and optimize predictive models, transformer-based architectures, and decision support systems, applying deep learning to large-scale clinical text datasets. The position emphasizes collaboration with engineering teams to deploy solutions into production while maintaining rigorous model evaluation and monitoring. You will also mentor junior team members, contribute to documentation and best practices, and leverage AI techniques to enhance data-driven decision-making. This is a high-impact role that allows you to directly influence healthcare analytics processes and operational efficiency.

Accountabilities

  • Lead development of NLP-based classification systems for ICD-10 code identification from clinical charts and encounters.
  • Design and implement deep learning models using PyTorch and transformer architectures for medical text analysis.
  • Build and optimize predictive models for healthcare risk adjustment and HCC coding.
  • Develop decision support systems for automated ICD-10 code suggestion and validation.
  • Create and maintain feature engineering pipelines for clinical text processing and model training.
  • Implement model evaluation metrics and performance optimization strategies.
  • Conduct system health checks and performance monitoring for deployed models.
  • Collaborate with engineering teams to integrate ML/NLP solutions into production workflows.
  • Mentor junior scientists and analysts, promoting best practices in ML, NLP, and AI.
  • Produce technical documentation and training materials to support knowledge sharing.

Requirements

  • Bachelor’s degree in Computer Science, Statistics, Mathematics, Data Science, or a related field; Master’s preferred.
  • 3+ years of experience in machine learning and AI, with a focus on NLP, classification, and predictive systems.
  • Strong hands-on expertise in PyTorch and transformer architectures (BERT, RoBERTa, etc.) for text classification.
  • Advanced Python programming skills and experience with NLP frameworks and libraries.
  • Experience with clinical text processing, healthcare claims data, ICD-10 coding, and risk adjustment/HCC methodologies preferred.
  • Proficiency in feature engineering, model evaluation, deployment, and monitoring in production environments.
  • Understanding of data quality frameworks and error detection methods for clinical datasets.
  • Strong analytical, problem-solving, and communication skills to explain technical concepts clearly.
  • Experience with SQL and collaborative development tools, including version control systems.

Disclaimer: Real Jobs From Anywhere is an independent platform dedicated to providing information about job openings. We are not affiliated with, nor do we represent, any company, agency, or agent mentioned in the job listings. Please refer to our Terms of Services for further details.