Nuvepro - Task Intelligence for the Enterprise
OpenAI

Cpu Storage Tech Lead San Francisco

Comp$342K – $555K

Classified Tasks (17)

Automate 0%Augment 53%Human-Only 47%

Augment (9)

AI assists, human decides

Evaluate vendor roadmaps and CPU platforms across performance, efficiency, memory bandwidth, PCIe topology, cost, and roadmap alignment.

analytical

Ensure compute and storage systems are optimized for training, inference, and supporting services.

technical

Perform component-level analysis while driving long-range infrastructure technical decisions.

analytical

Define storage architectures for AI environments, including boot media, local NVMe, shared storage, caching tiers, metadata services, and high-performance data pipelines.

technical

Partner with performance modeling teams to quantify tradeoffs and identify compute, memory, I/O, and storage bottlenecks.

analytical

Partner with networking and cluster architecture teams to optimize end-to-end node design and data movement.

technical

Support supply chain and sourcing teams by conducting technical vendor assessments and defining second-source strategies.

operational

Drive reliability, serviceability, and fleet lifecycle planning for compute and storage platforms.

operational

Translate future AI workload requirements into infrastructure platform specifications.

technical

Human-Only (8)

Requires human judgment

Define and drive the server compute and storage architecture strategy for Stargate infrastructure.

leadership

Own technical direction across CPU platforms, memory configurations, local and disaggregated storage systems, and their integration into large-scale AI clusters.

technical

Lead platform tradeoff decisions across compute, memory, I/O, and storage dimensions.

leadership

Collaborate with hardware engineering, performance modeling, networking, supply chain, and deployment teams and external partners (AMD, Intel, OEMs, ODMs, storage vendors) to integrate platforms.

communication

Drive server platform decisions involving CPU, memory, NIC, GPU, and storage subsystem integration.

technical

Work with silicon and hardware vendors to influence roadmaps, submit feature requests, define qualification plans, and manage technical escalations.

communication

Lead bring-up and validation efforts for new CPU and storage platforms in lab and production environments.

operational

Lead technical reviews and provide technical leadership across cross-functional stakeholders and executive reviews.

leadership

Job description

--- BEGIN UNTRUSTED EXTERNAL CONTENT (source: https://openai.com/careers/cpu-storage-tech-lead-san-francisco/) --- Skip to main contentResearchProductsBusinessDevelopersCompanyFoundation(opens in a new window)Log inTry ChatGPT(opens in a new window)ResearchProductsBusinessDevelopersCompanyFoundation(opens in a new window)CPU Storage Tech Lead | OpenAICareersCPU Storage Tech LeadHardware - San Francisco and SeattleApply now(opens in a new window)About the TeamThe Stargate team is responsible for building the physical infrastructure that powers large-scale AI systems. We design and deliver next-generation data centers optimized for dense compute clusters, advanced networking, and rapidly evolving hardware platforms.This work sits at the intersection of hardware engineering, systems architecture, and infrastructure execution—translating cutting-edge compute roadmaps into scalable, production-ready environments.Our teams partner across silicon vendors, server and storage OEMs, networking teams, and data center engineering organizations to bring new capacity online quickly, reliably, and at global scale.About the RoleWe are seeking a CPU & Storage Technical Lead to define and drive the server compute and storage architecture strategy for Stargate infrastructure.In this role, you will own technical direction across CPU platforms, memory configurations, local and disaggregated storage systems, and their integration into large-scale AI clusters. You will evaluate vendor roadmaps, lead platform tradeoff decisions, and ensure compute and storage systems are optimized for training, inference, and supporting services.You will work cross-functionally with hardware engineering, performance modeling, networking, supply chain, and deployment teams, as well as external partners such as AMD, Intel, OEMs, ODMs, and storage vendors.This is a highly strategic role for someone who can operate deeply at the component level while also driving long-range infrastructure decisions.Key ResponsibilitiesOwn CPU and storage technical strategy for Stargate compute infrastructure across current and future generations.Evaluate CPU platforms across performance, efficiency, memory bandwidth, PCIe topology, cost, and roadmap alignment.Define storage architectures for AI environments, including boot media, local NVMe, shared storage, caching tiers, metadata services, and high-performance data pipelines.Drive server platform decisions involving CPU, memory, NIC, GPU, and storage subsystem integration.Partner with performance modeling teams to quantify tradeoffs across compute, memory, I/O, and storage bottlenecks.Work with silicon and hardware vendors on roadmap influence, feature requests, qualification plans, and technical escalations.Lead bring-up and validation efforts for new CPU and storage platforms in lab and production environments.Partner with networking and cluster architecture teams to optimize end-to-end node design and data movement.Support supply chain and sourcing teams with technical vendor assessments and second-source strategies.Drive reliability, serviceability, and fleet lifecycle planning for compute and storage platforms.Translate future AI workload requirements into infrastructure platform specifications.Provide technical leadership across cross-functional stakeholders and executive reviews.QualificationsBachelor’s degree in Computer Engineering, Electrical Engineering, Computer Science, or related technical field; advanced degree preferred.10+ years of experience in server hardware, systems architecture, data center infrastructure, or hyperscale compute platforms.Deep expertise in modern CPU architectures (x86, ARM, accelerator host systems) and server platform design.Strong understanding of memory systems, PCIe/CXL fabrics, NUMA behavior, and platform-level performance constraints.Experience with storage systems including NVMe, SSD qualification, RAID, distributed storage, object/file systems, or high-performance data pipelines.Experience
Source: OpenAI careers · scraped 2026-05-22
Apply at OpenAI