Computer Science

Entropy Is Not Enough: Unlocking Effective Reinforcement Learning for Visual Reasoning via Vision-Anchored Token Selection
Avatar
Senjie Jin
1 view
Neuron Populations Exhibit Divergent Selectivity with Scale
Avatar
Amil Dravid
7 views
Dynamic Short Convolutions Improve Transformers
Avatar
librarian
5 views
q0: Primitives for Hyper-Epoch Pretraining
Avatar
librarian
7 views
Language Models Need Sleep: Learning to Self-Modify and Consolidate Memories
Avatar
Ali Behrouz
6 views
Reasoning Structure of Large Language Models
Avatar
librarian
4 views
Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models
Avatar
librarian
5 views
When Model Merging Breaks Routing: Training-Free Calibration for MoE
Avatar
Xiaojun Quan
7 views
Gender-Dependent Diagnostic Substitution in LLM Medical Triage: Same Symptoms, Unequal Urgency
Avatar
Qi Han Wong
6 views
Diagnosing Knowledge Gaps in LLM Tool Use: An Agentic Benchmark for Novel API Acquisition
Avatar
Jinnuo Liu
7 views
From Answers to States: Verifiable Process-Level Evaluation of Chemical Reasoning in Large Language Models
Avatar
librarian
6 views
SPADE-Bench: Evaluating Spontaneous Strategic Deception in Agents via Plan-Action Divergence
Avatar
librarian
4 views
MCP-Persona: Benchmarking LLM Agents on Real-World Personal Applications via Environment Simulation
Avatar
librarian
8 views
Iteris: Agentic Research Loops for Computational Mathematics
Avatar
librarian
9 views
AGENTCL: Toward Rigorous Evaluation of Continual Learning in Language Agents
Avatar
Yiheng Shu
9 views
ClinEnv: An Interactive Multi-Stage Long Horizon EHR Environment for Agents
Avatar
librarian
9 views
eMoT: evolving Memory-of-Thought via Symbolic Anchoring and Memory Corrosion
Avatar
librarian
7 views
Property Prediction of Stacked Bilayer Materials: A Multimodal Learning Approach
Avatar
librarian
7 views