Computation and Language

SPADE-Bench: Evaluating Spontaneous Strategic Deception in Agents via Plan-Action Divergence
Avatar
librarian
5 views
Rethinking Memory as Continuously Evolving Connectivity
Avatar
librarian
33 views
MeMo: Memory as a Model

MeMo: Memory as a Model

Computation and Language
Avatar
Ryan Quek
68 views
The Impossibility Triangle of Long-Context Modeling
Avatar
librarian
45 views
GiVA: Gradient-Informed Bases for Vector-Based Adaptation
Avatar
Neeraj Gangwar
60 views
A Multimodal Text- and Graph-Based Approach for Open-Domain Event Extraction from Documents
Avatar
librarian
71 views
Chat2Workflow: A Benchmark for Generating Executable Visual Workflows with Natural Language
Avatar
librarian
73 views
CD2CR: Co-reference Resolution Across Documents and Domains
Avatar
k-m-smit2
63 views
Demystifying OPD: Length Inflation and Stabilization Strategies for Large Language Models
Avatar
librarian
116 views
ClawBench: Can AI Agents Complete Everyday Online Tasks?
Avatar
librarian
97 views
Synthetic Sandbox for Training Machine Learning Engineering Agents
Avatar
Yuhang Zhou
105 views
Grounded Token Initialization for New Vocabulary in LMs for Generative Recommendation
Avatar
Daiwei Chen
113 views
AstroConcepts: A Large-Scale Multi-Label Classification Corpus for Astrophysics
Avatar
librarian
86 views
AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents
Avatar
librarian
88 views
F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World
Avatar
librarian
167 views
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
Avatar
librarian
157 views
Learning When to Attend: Conditional Memory Access for Long-Context LLMs
Avatar
Aditya Chattopadhyay
106 views
Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL
Avatar
Khushboo Thaker
103 views
SciMDR: Benchmarking and Advancing Scientific Multimodal Document Reasoning
Avatar
librarian
118 views
Instruction set for the representation of graphs
Avatar
Ezequiel López-Rubio
101 views
Monitoring Emergent Reward Hacking During Generation via Internal Activations
Avatar
librarian
104 views
CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning
Avatar
Xinyu Zhu
121 views
Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling
Avatar
librarian
119 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
Dr. Murat ALTUN
131 views
Multi-LLM Thematic Analysis with Dual Reliability Metrics: Combining Cohen's Kappa and Semantic Similarity for Qualitative Research Validation
Avatar
Nilesh Jain
120 views
UltraLogic: Enhancing LLM Reasoning through Large-Scale Data Synthesis and Bipolar Float Reward
Avatar
librarian
192 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
Salman
192 views
Memory in the Age of AI Agents

Memory in the Age of AI Agents

Computation and Language
Avatar
librarian
234 views
Non-Resolution Reasoning: A Framework for Preserving Semantic Ambiguity in Language Models
Avatar
Kei Saito
204 views
Latent Collaboration in Multi-Agent Systems
Avatar
librarian
233 views
Generalist Foundation Models Are Not Clinical Enough for Hospital Operations
Avatar
librarian
234 views
Instella: Fully Open Language Models with Stellar Performance
Avatar
librarian
252 views