Machine Learning

Neuron Populations Exhibit Divergent Selectivity with Scale
Avatar
Amil Dravid
9 views
Dynamic Short Convolutions Improve Transformers
Avatar
librarian
8 views
q0: Primitives for Hyper-Epoch Pretraining
Avatar
librarian
9 views
Language Models Need Sleep: Learning to Self-Modify and Consolidate Memories
Avatar
Ali Behrouz
9 views
When Model Merging Breaks Routing: Training-Free Calibration for MoE
Avatar
Xiaojun Quan
8 views
Silent Failures in Federated Personalization of Foundation Models
Avatar
YongKyung Oh
6 views
SoundnessBench: Can Your AI Scientist Really Tell Good Research Ideas from Bad Ones?
Avatar
Sy-Tuyen Ho
36 views
CalArena: A Large-Scale Post-Hoc Calibration Benchmark
Avatar
Eugène Berta
42 views