Backend Software Engineer (Evals) San Francisco
Classified Tasks (16)
Augment (13)
AI assists, human decides
Prototype rapidly while prioritizing long-term quality and reliability when crafting products
technical
Create reusable solutions and patterns that can be applied across diverse domains within OpenAI
technical
Leverage OpenAI technologies (public and pre-released) to implement support automation solutions
technical
Design and build an evals infrastructure that measures the quality of OpenAI’s support automation
technical
Design eval pipelines that are reliable, reproducible, and extendable
technical
Build the infrastructure for continuous eval monitoring frameworks, including regression and drift monitoring
technical
Construct robust golden datasets for use in evals and monitoring
technical
Build feedback loops that strengthen and improve support automation systems
technical
Design, build, and maintain backend services and APIs to support intelligent automation and knowledge systems
technical
Integrate and structure data across internal platforms, transforming it into formats optimized for downstream systems and AI workflows
technical
Own the full development lifecycle of new backend systems and internal platform capabilities
technical
Build backend systems with scale and maintainability in mind while rapidly iterating on new ideas
technical
Build robust systems and backend services that enable creation, access, and application of knowledge across OpenAI
technical
Human-Only (3)
Requires human judgment
Develop an ecosystem of automation products that empower colleagues and drive impact
leadership
Collaborate closely with data science, research, and engineering teams to integrate OpenAI models into high-leverage workflows
communication
Work closely with Data Science and Research partners to design and build evals at scale
communication