We are looking for a Senior Product Manager to drive Copilot model capabilities such as tool-use to ensure that the language models that power Microsoft Copilot deliver high quality responses to our users whilst being grounded, reliable, and cost-efficient.
As a Senior Product Manager, you will work at the nexus of product and research, driving execution in partnership with engineers, language engineers, data scientists and researchers. You will develop and execute on LLM platform strategy for Copilot that extend language model's capabilities. You will prototype approaches by steering language models to drive response quality across a wide range of scenarios. You will identify and prioritise platform, orchestration and language model issues that impact quality, factuality and safety and working with engineers and researchers to find a path to resolution.
You will define and build measurable evaluations with relevant datasets to demonstrate quality improvements. You will define, deploy and manage experiments in production that impact language model's tool use, driving measurable improvements in relevance for and engagement with Copilot users. You will partner with product teams to scale tool building and work with inference, agents and orchestration teams to resolve dependencies. You will be accountable to own the status of key projects, proactively identifying risks and proposing solutions to ensure timely delivery.
Responsibilities include:
- Developing and executing on LLM platform strategy for Copilot that extend language model's capabilities
- Prototyping approaches by steering language models to drive response quality across a wide range of scenarios
- Identifying and prioritising platform, orchestration and language model issues that impact quality, factuality and safety and working with engineers and researchers to find a path to resolution
- Defining and building measurable evaluations with relevant datasets to demonstrate quality improvements
- Defining, deploying and managing experiments in production that impact language model's tool use, driving measurable improvements in relevance for and engagement with Copilot users
- Partnering with product teams to scale tool building and working with inference, agents and orchestration teams to resolve dependencies
- Being accountable to own the status of key projects, proactively identifying risks and proposing solutions to ensure timely delivery
XML job scraping automation by YubHub