Machine Learning

APPO: Agentic Procedural Policy Optimization
Avatar
librarian
2 views
EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents
Avatar
Weixian Xu
11 views
Express Language Modeling
Avatar
librarian
5 views
Tight Sample Complexity of Transformers
Avatar
librarian
22 views
Muon Learns More Robust and Transferable Features than Adam
Avatar
Fengzhuo Zhang
20 views
Algorithm for Contextual Queueing Bandits with Rate-Optimal Queue Length Regret
Avatar
Seoungbin Bae
14 views
End-to-End Subgraph Detection with GraphDETR
Avatar
librarian
27 views
Double Preconditioning (DoPr): Optimization for Test-Time Performance, not Validation Loss
Avatar
Thomas Zhanga
36 views
Pretraining Recurrent Networks without Recurrence
Avatar
librarian
41 views
Deep Embedded Multiplicative DMD for Algebra-Preserving Koopman Learning
Avatar
Kelan Gray
32 views
Neuron Populations Exhibit Divergent Selectivity with Scale
Avatar
Amil Dravid
35 views