Weekday AI logo

Data Engineer

Weekday AI
India
On-site
Full-time
Posted 29 days ago

Job Description

This role is for one of the Weekday's clients

Min Experience: 3 years

Location: Remote (India)

JobType: full-time

We are looking for a highly skilled and motivated Data Engineer to join our Development POD focused on integration projects. In this role, you will design, build, and maintain scalable data pipelines to ingest, clean, transform, and integrate diverse public datasets into a knowledge graph. The ideal candidate will have strong expertise in GCP, BigQuery, and Python, along with a solid grasp of data modeling, data quality, and scalability principles.

Requirements

Key Responsibilities

  • Design and develop robust, scalable data pipelines for ingestion, transformation, and integration of structured and unstructured data.
  • Perform comprehensive data wrangling, cleaning, and transformation from multiple formats (API, CSV, XLS, JSON, etc.).
  • Work with BigQuery, DataFlow (Apache Beam), and Google Cloud Storage (GCS) to manage large-scale data workflows.
  • Implement and maintain data validation and quality assurance processes.
  • Contribute to data modeling and schema design, especially for knowledge graph development (Schema.org, RDF, SPARQL, JSON-LD).
  • Collaborate within Agile teams to deliver scalable, reliable, and efficient data solutions.
  • Apply CI/CD practices (e.g., Cloud Build) to ensure seamless development and deployment workflows.

Core Competencies

Must Have:

  • Proficiency in SQL and Python.
  • Experience with BigQuery, GCS, and GCP DataFlow / Apache Beam.
  • Proven ability to handle complex data transformations across diverse data formats.

Secondary Skills:

  • Knowledge of SPARQL, Schema.org, Apigee, and Cloud Data Fusion.
  • Experience in data modeling, knowledge graph design, and Agile development.
  • Familiarity with CI/CD tools such as Cloud Build.

Preferred Qualifications

  • Exposure to LLM-based tools or techniques for data automation (e.g., auto-schematization).
  • Experience with large-scale public dataset integration projects.
  • Understanding of multilingual data integration workflows.

Skills

GCP | Google BigQuery | GCS | Python | SQL | DataFlow | Apache Beam | Data Modeling | Knowledge Graphs | Cloud Data Fusion | SPARQL | CI/CD

Disclaimer: Real Jobs From Anywhere is an independent platform dedicated to providing information about job openings. We are not affiliated with, nor do we represent, any company, agency, or agent mentioned in the job listings. Please refer to our Terms of Services for further details.