Job Description
At Zyte, we make the world’s web data accessible to everyone. Our technology powers data extraction at scale, helping businesses and researchers unlock the full potential of the web.
We’re a remote-first, multicultural team of engineers, data scientists, and innovators who believe in curiosity, collaboration, and continuous learning. If you’re passionate about building reliable AI systems and improving the quality of web data, we’d love to hear from you.
About the Role
As a Machine Learning Engineer (Web Data Quality), you’ll design and implement intelligent systems that automatically detect, measure, and improve the quality of large-scale web datasets. You’ll work at the intersection of data science, AI, and distributed systems, collaborating closely with product, engineering, and data teams to make data accuracy measurable, scalable, and actionable.
Requirements
What You’ll Do
- Develop and deploy ML models for anomaly detection, schema drift, and content validation
- Build and improve data quality pipelines leveraging modern data and MLOps tools
- Design and optimize embeddings and GenAI models to enhance data consistency
- Collaborate with engineers to integrate AI systems into production workflows
- Conduct experiments, evaluate performance, and iterate for continuous improvement
- Stay up to date on AI/ML and GenAI research to guide innovation within Zyte
Required
- 3+ years of experience in Machine Learning / Data Science / AI Engineering
- Strong Python skills and experience with ML frameworks (PyTorch, TensorFlow, scikit-learn)
- Experience with data validation, anomaly detection, or data quality systems
- Familiarity with data pipelines (Airflow, Spark, or similar)
- Understanding of model evaluation, metrics, and deployment best practices
- Excellent problem-solving, communication, and collaboration skills
Preferred
- Experience with LangChain, LlamaIndex, or GenAI model orchestration
- Familiarity with data labeling tools and active learning approaches
- Contributions to open-source or public ML projects
- Experience working in a remote, cross-functional team environment
Similar Jobs
Head of Product (SaaS) - Remote
Zyte
Core & ML Ops Team Lead - Remote
Zyte
Platform Team Lead - Remote
Zyte
Senior Product Designer - Remote
Zyte
Monitoring Engineer - Remote
Zyte
Senior Java Engineer - Remote
Zyte
Sr. Growth Marketing Manager - Remote
Zyte
AI/ML Engineer - Web Data Quality - Remote
Zyte
Staff Engineer - Remote
Zyte
Training and Development Specialists - Contract (Remote)
Fixpoint
Transportation, Storage, and Distribution Managers - Contract (Remote)
Fixpoint
Telephone Operators - Contract (Remote)
Fixpoint
Receptionists and Information Clerks - Contract (Remote)
Fixpoint
Production, Planning, and Expediting Clerks - Contract (Remote)
Fixpoint
Occupational Health and Safety Technicians - Contract (Remote)
Fixpoint
Disclaimer: Real Jobs From Anywhere is an independent platform dedicated to providing information about job openings. We are not affiliated with, nor do we represent, any company, agency, or agent mentioned in the job listings. Please refer to our Terms of Services for further details.
