Computation and Language

WebSwarm: Recursive Multi-Agent Orchestration for Deep-and-Wide Web Search

WebSwarm: Recursive Multi-Agent Orchestration ...

Computation and Language

Xiaoshuai Song

18 views

Reinforcement Learning with Metacognitive Feedback Elicits Faithful Uncertainty Expression in LLMs

Reinforcement Learning with Metacognitive Feed...

Computation and Language

librarian

34 views

Why Multi-Step Tool-Use Reinforcement Learning Collapses and How Supervisory Signals Fix It

Why Multi-Step Tool-Use Reinforcement Learning...

Computation and Language

abcdezzy688

47 views

Staying In Character: Perspective-Bounded Memory For Book-Based Role-Playing Agents

Staying In Character: Perspective-Bounded Memo...

Computation and Language

Xushuo Tang

47 views

Toward Generalist Autonomous Research via Hypothesis-Tree Refinement

Toward Generalist Autonomous Research via Hypo...

Computation and Language

Jiajie Jin

71 views

End-to-End Context Compression at Scale

End-to-End Context Compression at Scale

Computation and Language

librarian

63 views

SPADE-Bench: Evaluating Spontaneous Strategic Deception in Agents via Plan-Action Divergence

SPADE-Bench: Evaluating Spontaneous Strategic ...

Computation and Language

librarian

80 views

Rethinking Memory as Continuously Evolving Connectivity

Rethinking Memory as Continuously Evolving Con...

Computation and Language

librarian

95 views

MeMo: Memory as a Model

MeMo: Memory as a Model

Computation and Language

Ryan Quek

129 views

The Impossibility Triangle of Long-Context Modeling

The Impossibility Triangle of Long-Context Mod...

Computation and Language

librarian

105 views

GiVA: Gradient-Informed Bases for Vector-Based Adaptation

GiVA: Gradient-Informed Bases for Vector-Based...

Computation and Language

Neeraj Gangwar

118 views

A Multimodal Text- and Graph-Based Approach for Open-Domain Event Extraction from Documents

A Multimodal Text- and Graph-Based Approach fo...

Computation and Language

librarian

137 views

Chat2Workflow: A Benchmark for Generating Executable Visual Workflows with Natural Language

Chat2Workflow: A Benchmark for Generating Exec...

Computation and Language

librarian

125 views

CD2CR: Co-reference Resolution Across Documents and Domains

CD2CR: Co-reference Resolution Across Document...

Computation and Language

k-m-smit2

128 views

Demystifying OPD: Length Inflation and Stabilization Strategies for Large Language Models

Demystifying OPD: Length Inflation and Stabili...

Computation and Language

librarian

183 views

ClawBench: Can AI Agents Complete Everyday Online Tasks?

ClawBench: Can AI Agents Complete Everyday Onl...

Computation and Language

librarian

166 views

Synthetic Sandbox for Training Machine Learning Engineering Agents

Synthetic Sandbox for Training Machine Learnin...

Computation and Language

Yuhang Zhou

175 views

Grounded Token Initialization for New Vocabulary in LMs for Generative Recommendation

Grounded Token Initialization for New Vocabula...

Computation and Language

Daiwei Chen

183 views

AstroConcepts: A Large-Scale Multi-Label Classification Corpus for Astrophysics

AstroConcepts: A Large-Scale Multi-Label Class...

Computation and Language

librarian

133 views

AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents

AgentSwing: Adaptive Parallel Context Manageme...

Computation and Language

librarian

158 views

F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World

F2LLM-v2: Inclusive, Performant, and Efficient...

Computation and Language

librarian

264 views

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Nemotron-Cascade 2: Post-Training LLMs with Ca...

Computation and Language

librarian

225 views

Learning When to Attend: Conditional Memory Access for Long-Context LLMs

Learning When to Attend: Conditional Memory Ac...

Computation and Language

Aditya Chattopadhyay

169 views

Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL

Knowledge Distillation with Structured Chain-o...

Computation and Language

Khushboo Thaker

161 views

SciMDR: Benchmarking and Advancing Scientific Multimodal Document Reasoning

SciMDR: Benchmarking and Advancing Scientific ...

Computation and Language

librarian

189 views

Instruction set for the representation of graphs

Instruction set for the representation of graphs

Computation and Language

Ezequiel López-Rubio

155 views

Monitoring Emergent Reward Hacking During Generation via Internal Activations

Monitoring Emergent Reward Hacking During Gene...

Computation and Language

librarian

164 views

CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning

CHIMERA: Compact Synthetic Data for Generaliza...

Computation and Language

Xinyu Zhu

172 views

Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling

Team of Thoughts: Efficient Test-time Scaling ...

Computation and Language

librarian

181 views

Attention Is All You Need

Attention Is All You Need

Computation and Language

Dr. Murat ALTUN

192 views

Multi-LLM Thematic Analysis with Dual Reliability Metrics: Combining Cohen's Kappa and Semantic Similarity for Qualitative Research Validation

Multi-LLM Thematic Analysis with Dual Reliabil...

Computation and Language

Nilesh Jain

184 views

UltraLogic: Enhancing LLM Reasoning through Large-Scale Data Synthesis and Bipolar Float Reward

UltraLogic: Enhancing LLM Reasoning through La...

Computation and Language

librarian

250 views

Web analytics