About The Role
As a Staff Product Engineer on Replit’s Product Platform team, you’ll build the shared product systems and primitives that power Replit’s core experiences — enabling product teams to ship faster and helping users (and agents) build better software.
What you’ll do
-
Lead major cross-team platform initiatives, taking foundational systems from 0 → 1 and scaling them to support millions of users
-
Build shared, extensible Agent primitives that Replit Agent can reuse safely and consistently (Meta Programming)
-
Identify the highest-leverage technical bottlenecks (performance, reliability, correctness, abuse, observability), then design and ship solutions for our scale
-
Raise the bar for engineering excellence through architecture reviews, code quality, reliability standards, and mentorship
-
Partner across teams to improve platform adoption, ergonomics, and velocity — turning platform work into measurable outcomes
Core areas you’ll work on
-
Agents and Replit users depend on us to build applications (e.g. Connectors framework, Content/configuration primitives (CMS + product surfaces), Data/analytics/events + experimentation primitives)
-
Replit Agent as a principal in third party systems. Agent can be fully used within ChatGPT and publishes straight to the iOS app store. We’ll be doing loads of that.
-
Platform product teams rely on us to ship consistently (e.g. Identity & Access platform (SSO/SCIM), Localization/i18n platform, Notifications & communications platform)
-
Core web platform infrastructure (e.g. performance & page load optimization, observability and debugging workflows, caching strategy and reliability)
Required skills and experience
-
7+ years of professional software engineering experience
-
Understanding of the full agentic software development stack, helping coding agents build, test and review correct code.
-
Strong track record leading complex projects with cross-functional stakeholders
-
Experience building and operating platform systems that other teams depend on
-
Experience operating and scaling systems in production (reliability, performance, incidents, on-call readiness)
-
Strong product judgment: you can balance UX, speed, correctness, and long-term maintainability
-
Comfort working in modern web stacks such as TypeScript, React, Node.js, Postgres
Bonus points
-
Experience working in environments with a high engineering bar (or a fast-growing startup where you shipped fast without burning out quality)
-
Experience with platform and distributed systems patterns (queues, workflows, caching, rate limiting, async processing)
-
Familiarity with systems like Redis, Postgres, Workflow engines (e.g. Temporal), Auth and enterprise identity (SSO, SCIM), Abuse protection and edge systems (Cloudflare), Cloud platforms (GCP), Observability (Datadog, Sentry), Localization, Experimentation and event pipelines (Statsig, Segment, analytics/event tracking)
Example Projects You’ll Work On
-
Connectors platform for agents — ship a secure connector framework (OAuth/permissions/data access) so agents can integrate with Slack/Notion/GitHub/etc.
-
Agent-facing external surfaces — own high-quality embedded experiences (desktop/extension/embeds) that let agents act in-context across tools
-
Safety + abuse controls for agent actions — design permissioning, rate limits, and policy enforcement so agents can operate safely at scale
-
Real-time notifications platform — design in-app/email surfaces + build reliable delivery/fanout, preferences, and observability
-
Core web platform performance + caching — improve latency and reliability via caching strategy (Redis), profiling, and safe fallbacks
-
Events + experimentation primitives — standardize tracking/metrics + feature flags/rollouts so teams can ship safely and measure impact
XML job scraping automation by YubHub