Who's hiring GPU, inference & ML-systems engineers right now
38 AI labs and infrastructure companies have 178 open GPU, CUDA, ML-systems, inference and performance engineering roles between them. Ranked by openings, refreshed daily from each company's own public job feed.
- open roles
- 178
- companies
- 38
- list salary
- 78 · $109K–$850K
- visa mention
- 42
- remote
- 17
Observed across current open postings, refreshed daily — not a survey. Salary band is drawn only from roles that publish a range. Salary breakdown →
The useful question for someone in this field usually isn't “what jobs exist” — a search box answers that — but which companies are actually staffing up, and for what. That is what this page tracks. The shape of the demand is fairly consistent: dedicated AI-silicon companies (the Graphcores and Tenstorrents of the list) skew heavily toward performance, compiler and kernel work, because their whole product is making specific hardware fast. GPU-cloud and infrastructure providers (CoreWeave, Nebius and similar) hire for the systems and reliability side — operating accelerator fleets, not designing them. Frontier labs concentrate in inference and ML-systems, where the bottleneck is serving large models efficiently and training them across thousands of GPUs.
Two numbers worth reading off the table: how many of a company's roles disclose a salary band (a rough proxy for hiring maturity and US-comp transparency norms) and how many mention visa sponsorship or relocation — the single most decision-relevant fact for an engineer hiring across borders. Counts move as companies open and close postings; this is a snapshot of the current open set, not a survey.
| Company | Open | GPU | ML-sys | Inf | Perf | Rem | Visa | $ |
|---|---|---|---|---|---|---|---|---|
| Cerebras Systems | 22 | · | 2 | 9 | 13 | 3 | · | 1 |
| Anthropic | 19 | 2 | 1 | 11 | 8 | · | 19 | 15 |
| Tenstorrent | 13 | · | · | 1 | 10 | · | · | 8 |
| CoreWeave | 11 | 2 | · | 4 | 2 | · | 1 | 9 |
| Databricks | 11 | · | · | 7 | 4 | 1 | · | 11 |
| Graphcore | 11 | · | · | · | 9 | · | 4 | · |
| Nebius | 11 | 5 | 1 | 1 | 1 | 9 | · | 5 |
| OpenAI | 10 | · | 1 | 2 | 8 | · | 5 | 1 |
| Together AI | 7 | 2 | · | 4 | 1 | · | · | 6 |
| d-Matrix | 5 | · | · | · | 5 | · | · | · |
| Inworld AI | 5 | · | · | 5 | · | · | 5 | 1 |
| SambaNova | 4 | · | 1 | 1 | 4 | · | · | 1 |
| Baseten | 3 | · | · | 3 | · | · | · | · |
| Cursor | 3 | · | · | 2 | 1 | · | · | · |
| Fal | 3 | · | · | · | 1 | 1 | 1 | 1 |
| FriendliAI | 3 | 1 | · | 2 | 1 | · | · | · |
| MatX | 3 | · | · | · | 2 | · | · | 3 |
| Scale AI | 3 | · | 2 | 1 | · | · | · | 3 |
| xAI | 3 | · | 1 | 1 | · | · | · | 2 |
| Applied Intuition | 2 | · | 1 | · | 1 | · | · | 2 |
| Cohere | 2 | · | · | 1 | 1 | · | · | · |
| EnCharge AI | 2 | · | · | 1 | 1 | 1 | · | 1 |
| Etched | 2 | · | · | · | 1 | · | 2 | · |
| Fireworks AI | 2 | · | · | · | 1 | · | · | 2 |
| Mistral AI | 2 | · | · | · | · | · | · | · |
| Parasail | 2 | · | · | · | 1 | · | · | · |
| Perplexity | 2 | · | · | 2 | · | · | · | · |
| Thinking Machines | 2 | · | · | 1 | 1 | · | 2 | 2 |
| Anyscale | 1 | · | · | 1 | · | · | 1 | · |
| Crusoe | 1 | 1 | · | · | · | · | · | 1 |
| FluidStack | 1 | 1 | · | · | · | · | · | 1 |
| Krea | 1 | · | · | · | · | · | · | · |
| Lightmatter | 1 | · | 1 | · | · | · | · | · |
| Lightning AI | 1 | 1 | · | · | · | 1 | 1 | 1 |
| Modal | 1 | · | · | · | 1 | · | · | · |
| Prime Intellect | 1 | 1 | · | · | · | · | · | · |
| RunPod | 1 | · | · | · | · | 1 | · | 1 |
| SF Compute | 1 | 1 | · | · | · | · | 1 | · |
Browse by cut: Salaries · Remote · Europe · Senior · Staff · GPU & CUDA Engineering · ML Systems & Infrastructure · Inference & Model Serving · Performance & Kernel Engineering