General Diffusion
Back to Careers

Hardware Performance Engineer

San Francisco, CA (In-Person)

General Diffusion is a foundational AI research lab establishing the scientific discipline of Compute Intelligence. We build frontier models that learn the physics of heterogeneous hardware, decoupling intelligence from infrastructure.

<br/>

About the role

As a Hardware Performance Engineer, you will build HP1 (Hardware Profiler Agent), the sensory system of our OS. You will write the low-level probes that interrogate silicon for its true capabilities, bypassing marketing spec sheets to find the raw limits of memory bandwidth, FLOPs, and thermal headroom.

<br/>

What you might work on

  • Writing micro-benchmarks to reverse-engineer the undocumented behavior of new accelerators.
  • Building the HP1 telemetry pipeline to stream real-time metrics from thousands of devices.
  • Analyzing performance regressions across different driver versions and firmware updates.
  • Collaborating with vendors (AMD, Intel, Cerebras) to debug silicon errata.
  • Optimizing the "cold start" time of our profiling suite to run in milliseconds.
<br/>

What we’re looking for

  • Low-level systems programming experience (C, C++, Assembly).
  • Experience with performance analysis tools (Nsight Systems, rocprof, VTune).
  • Understanding of PCIe, NVLink, and interconnect topologies.
  • Fearlessness in the face of undocumented hardware and closed-source drivers.
  • A hacker mindset—you enjoy breaking things to see how they work.
<br/>

Our culture

  • Compute Intelligence. We are establishing a new scientific discipline.
  • Silicon Neutrality. We build foundational models that run on any chip.
  • Deep Work. We value long periods of uninterrupted focus over endless meetings.

Apply for this role

PDF, DOCX, or TXT (Max 5MB)