We are seeking a talented and experienced software engineer to join our Research Platform team. As a DevOps/Platform Engineer, your responsibilities will include designing and implementing complex systems, building flexible yet solid and accessible development environment for researchers, designing, implementing and advocating for solutions addressing large amounts of data and maintainable data pipelines, optimizing a variety of builds, building strong relationships with researchers, communicating and producing documentation or any content that will help them to make the most out of the tools and systems you'll build, and being part of the team that "platformizes" research and constantly improve the daily experience for researchers while avoiding future roadblocks.
What you'll do
As a DevOps/Platform Engineer, your responsibilities will include (but may not be limited to):
-
Designing and implementing complex systems (e.g. scale our research CI with a strong focus toward reliability, reproducibility and speed)
-
Building flexible yet solid and accessible development environment for researchers, so they can focus on core mission.
-
Designing, implementing and advocating for solutions addressing large amounts of data and maintainable data pipelines.
-
Optimizing a variety of builds: container images, large libraries compilation times, python environments…
-
Building strong relationships with researchers, understanding their workflow and enabling them to achieve more by leveraging your expertise.
-
Communicating and producing documentation or any content that will help them to make the most out of the tools and systems you'll build.
-
Being part of the team that "platformizes" research and constantly improve the daily experience for researchers while avoiding future roadblocks.
What you need
-
5+ years of successful experience in a similar DX / DevOps / SRE role.
-
Proficiency in software development (Python, Go…) and programming best practices.
-
Exposure to site reliability engineering: root cause analysis, in-production troubleshooting, on-call rotations…
-
Exposure to infrastructure management: CI/CD, containerization, orchestration, infra-as-code, monitoring, logging, alerting, observability…)