← all roles

Performance & Kernel Engineering Jobs

Wringing performance out of hardware — kernels, compilers, Triton and low-level optimization for ML workloads. 78 open now, refreshed daily.

open roles
78
companies
23
list salary
29 · $120K–$485K
visa mention
18
remote
4

Observed across current open postings, refreshed daily — not a survey. Salary band is drawn only from roles that publish a range. Salary breakdown →

Performance and kernel-engineering roles are the optimization core of this niche — compilers, custom kernels (often Triton), graph-level transforms, and the low-level work of making a given workload run measurably faster on given hardware. It is consistently the largest specialty on this board, which tracks the moment: every accelerator company and serving stack is now competing on efficiency. These roles favor people who profile before they optimize, who are comfortable at the IR/compiler layer, and who treat a benchmark regression as a bug to be bisected.

Hiring most for this specialty: Cerebras Systems 13 · Tenstorrent 10 · Graphcore 9 · Anthropic 8 · OpenAI 8 · d-Matrix 5 · see all who's hiring →

filter
view
78 roles · refreshed 2026-06-01 11:35 UTC