Job Description
JOB DESCRIPTION – ML Data Engineer
Role
This role supports machine learning data pipelines and backend data
workflows. The focus is on data engineering for ML. The engineer will work
closely with Data Science and ML Engineering teams to build, process, and
serve data required for ML training and inference.
Key Responsibilities
• Build and maintain scalable data pipelines for ML workflows
• Implement ETL / large-scale data processing using Spark (Scala or
Python)
• Support feature data preparation, transformation, and validation
• Assist with ML pipeline operationalization (batch/stream ingestion, data
readiness)
• Develop backend components for data and ML services
• Partner with ML engineers to understand data dependencies and
performance needs
Required Skills
• Strong hands-on experience with either Scala/Spark or Python/Spark
• Solid programming experience in Python and/or Scala
• Good understanding of data engineering concepts and large-scale data
processing
• Exposure to machine learning workflows and data requirements for ML
models
• Experience with backend development and data-driven services
Nice to Have
• Prior experience working closely with Data Science or ML teams
• Familiarity with ML concepts, feature engineering, or model pipelines
• Experience with distributed systems and large datasets