Computation and Language

MeMo: Memory as a Model

MeMo: Memory as a Model

Computation and Language
Avatar
Ryan Quek
14 views
The Impossibility Triangle of Long-Context Modeling
Avatar
librarian
28 views
GiVA: Gradient-Informed Bases for Vector-Based Adaptation
Avatar
Neeraj Gangwar
45 views
A Multimodal Text- and Graph-Based Approach for Open-Domain Event Extraction from Documents
Avatar
librarian
50 views
Chat2Workflow: A Benchmark for Generating Executable Visual Workflows with Natural Language
Avatar
librarian
60 views
CD2CR: Co-reference Resolution Across Documents and Domains
Avatar
k-m-smit2
48 views
Demystifying OPD: Length Inflation and Stabilization Strategies for Large Language Models
Avatar
librarian
95 views
ClawBench: Can AI Agents Complete Everyday Online Tasks?
Avatar
librarian
72 views
Synthetic Sandbox for Training Machine Learning Engineering Agents
Avatar
Yuhang Zhou
80 views
Grounded Token Initialization for New Vocabulary in LMs for Generative Recommendation
Avatar
Daiwei Chen
97 views
AstroConcepts: A Large-Scale Multi-Label Classification Corpus for Astrophysics
Avatar
librarian
70 views
AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents
Avatar
librarian
74 views
F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World
Avatar
librarian
142 views
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
Avatar
librarian
138 views
Learning When to Attend: Conditional Memory Access for Long-Context LLMs
Avatar
Aditya Chattopadhyay
82 views
Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL
Avatar
Khushboo Thaker
88 views
SciMDR: Benchmarking and Advancing Scientific Multimodal Document Reasoning
Avatar
librarian
97 views
Instruction set for the representation of graphs
Avatar
Ezequiel López-Rubio
86 views
Monitoring Emergent Reward Hacking During Generation via Internal Activations
Avatar
librarian
87 views
CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning
Avatar
Xinyu Zhu
102 views
Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling
Avatar
librarian
103 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
Dr. Murat ALTUN
117 views
Multi-LLM Thematic Analysis with Dual Reliability Metrics: Combining Cohen's Kappa and Semantic Similarity for Qualitative Research Validation
Avatar
Nilesh Jain
107 views
UltraLogic: Enhancing LLM Reasoning through Large-Scale Data Synthesis and Bipolar Float Reward
Avatar
librarian
172 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
Salman
172 views
Memory in the Age of AI Agents

Memory in the Age of AI Agents

Computation and Language
Avatar
librarian
215 views
Non-Resolution Reasoning: A Framework for Preserving Semantic Ambiguity in Language Models
Avatar
Kei Saito
187 views
Latent Collaboration in Multi-Agent Systems
Avatar
librarian
216 views
Generalist Foundation Models Are Not Clinical Enough for Hospital Operations
Avatar
librarian
214 views
Instella: Fully Open Language Models with Stellar Performance
Avatar
librarian
235 views
Kimi Linear: An Expressive, Efficient Attention Architecture
Avatar
librarian
366 views
Tongyi DeepResearch Technical Report

Tongyi DeepResearch Technical Report

Computation and Language
Avatar
librarian
318 views