AI Deployment Engineer, Startups
Apply at source. OpenAI handles the application directly; Houtini doesn't take a fee from candidates or companies. We curate which companies appear; the listings come from yubhub.
What the team is looking for.
About the team
The AI Deployment Engineering team works closely with frontier startups. We are trusted advisors to, and thought partners with, startups to ensure that OpenAI’s technology is deployed safely and effectively, whilst also partnering with engineering, research, and product to turn those insights into evaluation systems, product improvements, and better model behavior.
This team sits at the intersection of customer reality and model quality. We combine hands-on technical depth with strong product judgment, helping translate complex, high-value use cases into clear signals that can improve both the customer experience and the underlying systems.
About the role
We are seeking a technically proficient, product-minded engineer to help push the frontier of advanced AI with our strategic startup customers. You'll work with some of the most exciting AI startups in the world, helping them optimize their own systems and turning those learnings into durable improvements across OpenAI’s research and products. You will partner deeply on complex workflows, identify the gaps that matter, and help transform those gaps into reproducible evaluations, technical insights - helping shape OpenAI's research and product direction.
This role is well suited to engineers who are equally comfortable debugging a workflow, iterating on prompts or agents, designing evaluations, and collaborating across research and product. You should be excited by ambiguous, high-impact problems and motivated by the opportunity to shape how advanced AI systems improve in practice.
This role is based in Stockholm.
Responsibilities
- Work directly with strategic startup customers to understand critical workflows, uncover failure modes, and identify high-impact opportunities for improvement.
- Prototype and iterate on prompts, agents, and workflow designs to better understand system behavior and unlock customer value.
- Synthesize and deliver valuable feedback to the Product and Research teams, turning real usage patterns into clear, reproducible evals, benchmarks, and technical artifacts that improve model and product quality and ensure customer-grounded learnings influence roadmap and model development.
- Build repeatable tools, patterns, and evaluation approaches that raise the quality bar across multiple use cases.
- Operate with strong judgment in ambiguous environments, balancing immediate technical problem-solving with longer-term system improvement.
- Build relationships within the startup ecosystem, serving as a technical partner to both individual customers and the broader community.
Requirements
- Have strong software engineering & AI fundamentals. For example, experience as a startup CTO, software engineer, ML engineer, Data Scientist or equivalent. Experience shipping production systems end-to-end is a strong plus.
- Have experience as a technical founder, or engineer at an early stage startup
- Have familiarity with, or interest in, model training pipelines and reinforcement learning.
- Have experience building AI applications, agents, or evaluation systems, and can reason clearly about model behavior in complex workflows.
- Are comfortable working directly with highly technical users and translating their challenges into concrete technical signals.
- Can move fluidly between prototyping, debugging, evaluation design, and cross-functional collaboration.
- Communicate clearly across technical and non-technical audiences.
- Bring high agency, strong product sense, and a bias toward building durable improvements rather than one-off fixes.
- software engineering
- AI fundamentals
- model training pipelines
- reinforcement learning
- AI applications
- agents
- evaluation systems
- experience as a startup CTO
- software engineer
- ML engineer
- Data Scientist
- technical founder
- engineer at an early stage startup
Other roles you might consider.
Filtered through the same AI-companies allowlist.
Senior Site Reliability Enigneer
Synthesia
Platform Security Engineering - OpenBMC
Anthropic
Systems Engineer, HPC (APAC)
Mistral AI
Maintenance Planner
xAI
Staff Software Engineer, Developer Productivity (Dev Environments) - Claude Code
Anthropic
Staff Software Engineer, Developer Productivity (CI/CD) - Claude Code
Anthropic
New to AI work? Start with these.
Six pieces of orientation. Most AI-company job specs assume you've done this kind of hands-on work already. If you haven't, an afternoon with one of these is the cheapest way to close the gap.
Claude Desktop, from zero.
The agentic-AI assistant most of the people you'd be working alongside use every day. Install, configure, first useful prompts.
What MCPs areThe best MCPs for Claude Desktop.
MCP servers extend an AI assistant with tools and data. The catalogue most teams use. Useful technical context for any AI-engineering role.
Code with AIClaude Code, the complete beginners' guide.
The CLI for AI-paired development. Required reading if you're applying for any engineering role that mentions agents, or any role full stop.
Run a local modelHow to set up LM Studio.
Running a model on your own machine teaches you more about how AI products work in three hours than a year of using ChatGPT will.
The hardware realityBeginner's guide to AI hardware.
What the infrastructure under the model actually looks like. Useful context for infrastructure, applied-AI and hardware roles.
Browse the stackMCP catalogue.
Eleven MCP servers Houtini maintains or recommends. Each detail page describes a real piece of working AI infrastructure.