Data Platform Engineer
Apply at source. Cursor handles the application directly; Houtini doesn't take a fee from candidates or companies. We curate which companies appear; the listings come from yubhub.
What the team is looking for.
Our mission is to automate coding. The first step in our journey is to build the best tool for professional programmers, using a combination of inventive research, design, and engineering.
As one of Cursor's first Data Platform Engineers, you'll own the systems that make company-wide data work reliable, secure, and easy to build on. You'll work hands-on across our data lakehouse architecture to support a fast-growing data team and uniquely data-savvy business stakeholders. You'll partner with Data, Product, GTM, and AI research teams to turn messy, repeated data needs into durable infrastructure.
Cursor is already operating at enormous scale, but our data platform is still early. This role is for someone who wants to own the low-level foundations: optimizing TB-scale ingestion, improving resource usage and alerting, codifying access control with infra-as-code, and making pragmatic build-vs-buy decisions across the modern data stack.
Your responsibilities will include:
- Own, operate, and improve Cursor's Databricks and lakehouse infrastructure as the data team size and data volume scales.
- Build and optimize ingestion systems for first-party product data and 3rd-party business systems.
- Ensure observability, alerting, and operational standards across all data infrastructure layers.
- Evaluate and roll out data tooling where it solves real stakeholder needs, including BI platforms, catalogs, ingestion tools, and reverse ETL systems.
- Partner with technical and non-technical partners to understand recurring data problems and turn them into scalable platform solutions.
We're looking for someone with 4+ years of full-time data platform engineering experience, who has built up modern data stacks at a low level, not just written jobs on top of it. Experience with Dagster is a strong plus.
- data platform engineering
- Dagster
- modern data stacks
- TB-scale ingestion
- resource usage and alerting
- access control with infra-as-code
- build-vs-buy decisions
Other roles you might consider.
Filtered through the same AI-companies allowlist.
Member of Technical Staff (Software Engineer, API Platform)
Perplexity
Member of Technical Staff (AI Software Engineer, Agents)
Perplexity
Field Reporting Insights Manager
Anthropic
Research Engineer, Code RL (Reinforcement Learning)
Anthropic
Product Manager, Safeguards Rare Harms
Anthropic
Product Manager, GTM Experiences
Anthropic
New to AI work? Start with these.
Six pieces of orientation. Most AI-company job specs assume you've done this kind of hands-on work already. If you haven't, an afternoon with one of these is the cheapest way to close the gap.
Claude Desktop, from zero.
The agentic-AI assistant most of the people you'd be working alongside use every day. Install, configure, first useful prompts.
What MCPs areThe best MCPs for Claude Desktop.
MCP servers extend an AI assistant with tools and data. The catalogue most teams use. Useful technical context for any AI-engineering role.
Code with AIClaude Code, the complete beginners' guide.
The CLI for AI-paired development. Required reading if you're applying for any engineering role that mentions agents, or any role full stop.
Run a local modelHow to set up LM Studio.
Running a model on your own machine teaches you more about how AI products work in three hours than a year of using ChatGPT will.
The hardware realityBeginner's guide to AI hardware.
What the infrastructure under the model actually looks like. Useful context for infrastructure, applied-AI and hardware roles.
Browse the stackMCP catalogue.
Eleven MCP servers Houtini maintains or recommends. Each detail page describes a real piece of working AI infrastructure.