We're seeking experienced software engineers to create robust data pipelines, comprehensive evaluations for benchmarking LLMs, and automation frameworks to increase the productivity of researchers and engineers.
Typical problems you will deal with include designing efficient and robust environments for AI agents, improving evaluations and observability, onboarding new evaluation datasets, standardizing preprocessing pipelines, and creating data augmentation pipelines.
Responsibilities include creating and maintaining frameworks for agent, data, and model evaluation tasks, building environments for AI agents, tools for automating common workflows, improving alerts, metrics, and error handling on large-scale RL jobs, refactoring existing frameworks for better modularity, and designing operation procedures and coding standards.
Basic qualifications include experience building and maintaining frameworks, building high-performance sandboxes, virtual machines, and simulations, building full-stack apps for automating workflows and data visualization, rapid iteration of research to production cycles, and test automation, CI/CD.
Base salary is $180,000 – $440,000 USD, and our total rewards package includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.
XML job scraping automation by YubHub