Toward Generalist Autonomous Research via Hypothesis-Tree Refinement
Avatar
Jiajie Jin
17 views
End-to-End Context Compression at Scale
Avatar
librarian
33 views
SPADE-Bench: Evaluating Spontaneous Strategic Deception in Agents via Plan-Action Divergence
Avatar
librarian
38 views
Rethinking Memory as Continuously Evolving Connectivity
Avatar
librarian
54 views
MeMo: Memory as a Model

MeMo: Memory as a Model

Computation and Language
Avatar
Ryan Quek
87 views
The Impossibility Triangle of Long-Context Modeling
Avatar
librarian
67 views
GiVA: Gradient-Informed Bases for Vector-Based Adaptation
Avatar
Neeraj Gangwar
85 views
A Multimodal Text- and Graph-Based Approach for Open-Domain Event Extraction from Documents
Avatar
librarian
90 views
Chat2Workflow: A Benchmark for Generating Executable Visual Workflows with Natural Language
Avatar
librarian
93 views
CD2CR: Co-reference Resolution Across Documents and Domains
Avatar
k-m-smit2
90 views
Demystifying OPD: Length Inflation and Stabilization Strategies for Large Language Models
Avatar
librarian
134 views
ClawBench: Can AI Agents Complete Everyday Online Tasks?
Avatar
librarian
123 views
Synthetic Sandbox for Training Machine Learning Engineering Agents
Avatar
Yuhang Zhou
129 views
Grounded Token Initialization for New Vocabulary in LMs for Generative Recommendation
Avatar
Daiwei Chen
140 views
AstroConcepts: A Large-Scale Multi-Label Classification Corpus for Astrophysics
Avatar
librarian
104 views
AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents
Avatar
librarian
113 views
F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World
Avatar
librarian
193 views
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
Avatar
librarian
182 views
Learning When to Attend: Conditional Memory Access for Long-Context LLMs
Avatar
Aditya Chattopadhyay
129 views
Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL
Avatar
Khushboo Thaker
131 views
SciMDR: Benchmarking and Advancing Scientific Multimodal Document Reasoning
Avatar
librarian
143 views
Instruction set for the representation of graphs
Avatar
Ezequiel López-Rubio
124 views
Monitoring Emergent Reward Hacking During Generation via Internal Activations
Avatar
librarian
132 views
CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning
Avatar
Xinyu Zhu
143 views
Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling
Avatar
librarian
141 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
Dr. Murat ALTUN
151 views
Multi-LLM Thematic Analysis with Dual Reliability Metrics: Combining Cohen's Kappa and Semantic Similarity for Qualitative Research Validation
Avatar
Nilesh Jain
152 views
UltraLogic: Enhancing LLM Reasoning through Large-Scale Data Synthesis and Bipolar Float Reward
Avatar
librarian
214 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
Salman
223 views
Memory in the Age of AI Agents

Memory in the Age of AI Agents

Computation and Language
Avatar
librarian
253 views
Non-Resolution Reasoning: A Framework for Preserving Semantic Ambiguity in Language Models
Avatar
Kei Saito
229 views
Latent Collaboration in Multi-Agent Systems
Avatar
librarian
260 views