We are seeking Data Research Engineers to join our Multimodal team, where we are building the next generation of foundation models across vision, language, audio, and beyond. If you are passionate about designing and curating high-quality datasets to power frontier AI models, this role is for you. In this role, you’ll work at the intersection of data and innovation—collaborating with scientists, engineers, and annotators to curate, analyze, and evaluate diverse multimodal data sources critical to model development. You will lead efforts to:
Develop novel data collection strategies
Improve dataset quality and integrity
Understand data-driven model behaviors
Align datasets with ethical and societal values
This is a cross-disciplinary, high-impact role ideal for engineers who want to push the boundaries of what AI can learn from data, especially in multimodal contexts.
XML job scraping automation by YubHub