We're hiring a Senior Data Engineer to work on our Data Lake Team. Here is what we doing day to day:
What you'll do
- Maintain data pipeline job framework
- Develop Data Quality framework ( internal set of tools for internal and external data sources validation )
- Maintain and develop public facing data ingestion service with 17 000+ RPS.
- Maintain and develop core data pipelines in batch and streaming manners.
- Be a last line of support for our internal platform users.
- Take a part in an on-call rotation for data platform incidents (shared across the team).
What you need
- Fluent English
- 4+ years building production services and data pipelines (batch and/or streaming)
- Strong experience with Python or the readiness to ramp up quickly.
- Hands-on experience with at least one MPP system (Spark, Trino, Redshift etc.)
- Hands-on experience operating services in a cloud environment (AWS preferred)
Why this matters
Your primary focus will be on building and operating various data platform components (data quality, data pipelines, infrastructure, monitoring), with opportunities to contribute to API services and LLM-powered analytics tools. You’ll work closely with data scientists, ML engineers, and analytics teams to understand their needs, gather feedback, and improve platform reliability and usability.