
London or Bristol, 3 days in the office, 2 days WFH
At Fractile, we’re building what we believe will be the world’s fastest AI inference chip from the ground up. We’re balanced across hardware and software engineering, and HW/SW co-design is how we work. We move fast, and we help each other move fast. We care about each other, the software we ship, and the people who rely on it.
We build the control chain that powers on, monitors, updates, and protects devices and racks across bare metal, RTOS, and embedded Linux. It’s production-critical software that keeps racks stable, updates safe, and hardware secure. It’s a critical layer in turning a tokens per second benchmark into a tokens per month system, turning great silicon into reliable output at rack scale.
You’ll be there for the first racks coming to life and rollout days where update safety matters. Your work makes the difference between a bad failure and a clean recovery path. This is the work that makes the system something operators can trust.
If you want to build the control software that keeps rack-scale systems stable, safe, and secure for next-gen AI, come build it together.
Fractile is a London-based AI chip startup developing in-memory computing processors designed to run large language model inference up to 100x faster and 10x cheaper than current GPU systems. Founded by Oxford Robotics Institute PhD graduate Walter Goodwin, the company's novel chip architecture fuses computation with memory to eliminate the data-shuttling bottleneck that limits conventional hardware. Fractile emerged from stealth in July 2024 and has since announced a £100M commitment to expand UK operations, including a new hardware engineering facility in Bristol. The team includes senior hires from NVIDIA, ARM, and Imagination Technologies.