Measured. Verified. Revolutionary.
Flow Parallel Processing Unit (PPU)
Delivers significant performance gains in validated workloads, redefining what’s possible for general-purpose computing.
Request full results ↓
Performance beyond the limits of traditional CPUs
Flow demonstrated how its 256-core PPU accelerates real workloads across AI, HPC, and hyperscale use cases.
The results show a near-linear scaling of performance as tasks are distributed across PPU units, eliminating long-standing CPU bottlenecks caused by synchronization, cache coherence, and thread-management overhead.
Breaking through the CPU bottleneck
Modern workloads have outgrown the limits of sequential and multi-core CPU scaling.
Flow’s architecture introduces a new execution model that brings true parallelism inside the CPU, achieving high throughput without relying on external accelerators.
The result is a scalable, efficient foundation for next-generation computing, from AI and cloud infrastructure to embedded and edge systems.
Scalable
Near-linear throughput as PPU resources increase
Efficient
Lower energy per operation
Compatible
Integrates into Arm, x86, RISC-V, and Power architectures
Request access to our performance results + poster
Fill out the form below to request access to our full benchmark set, technical notes, and the official poster (presented at Hot Chips 2025).