The GPU Is the New Storage Controller
For 40 years, the CPU decided what gets read from storage, when, and where it lands in memory. AI inference is breaking that model. The GPU now knows what data it needs next, and routing that request through the CPU is pure overhead. GPU-initiated I/O is the architectural response. But the protocol stack wasn't designed for it.