cs.AIarXiv:2606.24470
Couple a frozen reactive VLM and a frozen reasoning VLM, training only the channel between them — a learned latent bridge projects the slow model's residuals into the fast model's embedding space, no text round-trip, matching or beating a text bridge across Atari and driving.
cs.AIarXiv:2606.19172
A user's facts become a few rows in a content-addressed memory table rather than a rewrite of the model — per-user memory as a local, leak-free parametric edit.
cs.AIarXiv:2606.17929
Each successful run compiles into a small program the agent can replay — and every program is verified before it is trusted, so repeated tasks get faster without getting riskier.
cs.LGarXiv:2606.17107
Transformers memoize field-conditioned conclusions onto downstream tokens at prefill — making the KV cache something you can edit, compose, and reuse.
cs.AIarXiv:2606.16707
An agent's model of a user as a version-controlled software project of typed Python dataclasses and executable constraints — recall, aggregation, and proactive alerting in one medium.
cs.ARarXiv:2606.13708
A compact, statically verifiable instruction set that runs on the memory-side NIC, collapsing multi-RTT remote-memory indirection chains into a single round-trip.
cs.AIarXiv:2605.28717
The first clean-room open implementation of Huawei's Unified Bus transport. A native load/store path returns a 64-byte remote fetch in ~500 ns — 4.37× below a matched RoCEv2 baseline.
cs.LGarXiv:2604.24827
Estimating a black-box model's parameter count from its factual capacity — with calibration curves and probe-level data across 165 models and 1,400 questions.
cs.MMarXiv:2604.20940
Transport that carries meaning, not signal: discrete audio tokens and a hybrid screen representation cut uplink bandwidth 64× for audio and 130–210× for screenshots, within 0.7 pp of raw task accuracy.