Nothing here yet
Long-context LLM inference hits a memory wall: each decode step requires scanning the entire KV cache, incurring $O(n)$ memory bandwidth that cannot be solved by faster arithmetic. PRISM proposes a thin-film lithium niobate photonic accelerator that performs the block-selection similarity search in $O(1)$ optical latency using a broadcast-and-weight architecture, eliminating the $O(n)$ scan entirely. The work claims $16\times$–$32\times$ traffic reduction at 64K–128K tokens and a four-order-of-magnitude energy advantage over GPU baselines by matching photonic hardware capabilities—passive query broadcast, quasi-static microring weights, and low-precision rank output—to the selection task.
Precision free-space optics demands sub-millimeter and sub-degree tolerances where traditional robotic pick-and-place fails. This work introduces a closed-loop robotics framework integrating hierarchical computer vision, Newton-based spatial optimization, and Bayesian angular optimization to autonomously construct, align, and maintain optical systems. The authors demonstrate this by building a tabletop laser cavity from randomly distributed components—achieving beam alignment, mode selection, and self-recovery without human intervention. The system bridges the gap between coarse robotic manipulation and the extreme precision required for functional optical experiments.