Back to Careers
Hardware Performance Engineer
San Francisco, CA (In-Person)
General Diffusion is a foundational AI research lab establishing the scientific discipline of Compute Intelligence. We build frontier models that learn the physics of heterogeneous hardware, decoupling intelligence from infrastructure.
<br/>About the role
As a Hardware Performance Engineer, you will build HP1 (Hardware Profiler Agent), the sensory system of our OS. You will write the low-level probes that interrogate silicon for its true capabilities, bypassing marketing spec sheets to find the raw limits of memory bandwidth, FLOPs, and thermal headroom.
<br/>What you might work on
- Writing micro-benchmarks to reverse-engineer the undocumented behavior of new accelerators.
- Building the HP1 telemetry pipeline to stream real-time metrics from thousands of devices.
- Analyzing performance regressions across different driver versions and firmware updates.
- Collaborating with vendors (AMD, Intel, Cerebras) to debug silicon errata.
- Optimizing the "cold start" time of our profiling suite to run in milliseconds.
What we’re looking for
- Low-level systems programming experience (C, C++, Assembly).
- Experience with performance analysis tools (Nsight Systems, rocprof, VTune).
- Understanding of PCIe, NVLink, and interconnect topologies.
- Fearlessness in the face of undocumented hardware and closed-source drivers.
- A hacker mindset—you enjoy breaking things to see how they work.
Our culture
- Compute Intelligence. We are establishing a new scientific discipline.
- Silicon Neutrality. We build foundational models that run on any chip.
- Deep Work. We value long periods of uninterrupted focus over endless meetings.
