Measured. Verified. Revolutionary.

Flow Parallel Processing Unit (PPU)

Delivers significant performance gains in validated workloads, redefining what’s possible for general-purpose computing.

Request full results ↓

Performance beyond the limits of traditional CPUs

Flow demonstrated how its 256-core PPU accelerates real workloads across AI, HPC, and hyperscale use cases.

The results show a near-linear scaling of performance as tasks are distributed across PPU units, eliminating long-standing CPU bottlenecks caused by synchronization, cache coherence, and thread-management overhead.

Breaking through the CPU bottleneck

Modern workloads have outgrown the limits of sequential and multi-core CPU scaling.

Flow’s architecture introduces a new execution model that brings true parallelism inside the CPU, achieving high throughput without relying on external accelerators.

The result is a scalable, efficient foundation for next-generation computing, from AI and cloud infrastructure to embedded and edge systems.

Scalable

Near-linear throughput as PPU resources increase

Efficient

Lower energy per operation

Compatible

Integrates into Arm, x86, RISC-V, and Power architectures

Request access to our performance results + poster

Fill out the form below to request access to our full benchmark set, technical notes, and the official poster (presented at Hot Chips 2025).

Contact usX