Fractile logo

Senior ML Runtime Engineer

Fractile·London·Posted 14 days ago

At Fractile, we’re taking a revolutionary approach to computing to run the world’s largest language models 100x faster than existing systems. Our fast-growing team is working at the cutting edge of the latest AI developments in both hardware and software. Want to get involved?


We are looking for Senior ML Runtime Engineers with experience of key ML software ecosystem components to work on inference server integrations and the runtime stack of our ground-breaking AI accelerators. You can be based in either our London office or Bristol, the choice is yours.

In this role, you will:

  • Integrate Fractile's innovative AI acceleration hardware with leading open source projects like PyTorch, vLLM, and SGLang
  • Develop our underlying high-performance Rust runtime
  • Work with hardware, lower-level software, and ML engineers in a highly collaborative hardware-software co-design methodology

It would be great if you have:

  • Proven experience of working with major ML software ecosystem projects
  • A good understanding of the latest ML workloads and inference deployment challenges
  • Excellent Python and Rust skills and solid experience of industry standard development tools and technologies
  • A creative and innovative mindset, and a willingness to take ownership and drive results in a fast-paced environment
  • Computer Science, Electronic Engineering, Maths, Physics, or related degree and 5+ years of industry experience

You may also have:

  • Experience of working with GPUs or other machine learning accelerators
  • Previous experience in a startup or small team environment
Fractile logo

About Fractile

Fractile is a London-based AI chip startup developing in-memory computing processors designed to run large language model inference up to 100x faster and 10x cheaper than current GPU systems. Founded by Oxford Robotics Institute PhD graduate Walter Goodwin, the company's novel chip architecture fuses computation with memory to eliminate the data-shuttling bottleneck that limits conventional hardware. Fractile emerged from stealth in July 2024 and has since announced a £100M commitment to expand UK operations, including a new hardware engineering facility in Bristol. The team includes senior hires from NVIDIA, ARM, and Imagination Technologies.

London
View Fractile profile →