Computation and Language

ATLAS: Learning to Optimally Memorize the Context at Test Time
Avatar
librarian
3 views
ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning
  Engineering
Avatar
librarian
3 views
LoLA: Low-Rank Linear Attention With Sparse Caching
Avatar
librarian
3 views
DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural
  Language and Reinforcement Learning
Avatar
Jiahao Xu
2 views
Learning Composable Chains-of-Thought
Avatar
librarian
3 views
"KAN you hear me?" Exploring Kolmogorov-Arnold Networks for Spoken
  Language Understanding
Avatar
Alkis Koudounas
10 views
THiNK: Can Large Language Models Think-aloud?
Avatar
Yongan Yu
9 views
Do Large Language Models Excel in Complex Logical Reasoning with Formal
  Language?
Avatar
Jin Jiang
8 views
MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent
  Systems
Avatar
librarian
8 views
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs
  via Reinforcement Learning
Avatar
librarian
11 views
Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous
  Concept Space
Avatar
librarian
8 views
A Federated Splitting Framework for LLMs: Security, Efficiency, and
  Adaptability
Avatar
librarian
7 views
VerifyBench: Benchmarking Reference-based Reward Systems for Large
  Language Models
Avatar
librarian
5 views
BIM-GPT: a Prompt-Based Virtual Assistant Framework for BIM Information
  Retrieval
Avatar
Hervé Onguéné
15 views
Learning Dynamics in Continual Pre-Training for Large Language Models
Avatar
librarian
14 views
ComPO: Preference Alignment via Comparison Oracles
Avatar
librarian
14 views
Reasoning Models Don't Always Say What They Think
Avatar
Yanda Chen
19 views
Whisper-LM: Improving ASR Models with Language Models for Low-Resource
  Languages
Avatar
Hussein Kedir
26 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
경택 오
82 views
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Avatar
yorba
58 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
Ilya Baimetov
263 views
A Pipeline For Discourse Circuits From CCG
Avatar
ScienceCast Board
212 views
All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated
  Text
Avatar
Yael Flax
215 views
Meta-path Augmented Response Generation
Avatar
ScienceCast Board
201 views
CliNER 2.0: Accessible and Accurate Clinical Concept Extraction
Avatar
Sasa Pure
183 views
A Hybrid Architecture for Multi-Party Conversational Systems
Avatar
priaon-flag
191 views
Analyzing the Structure of Attention in a Transformer Language Model
Avatar
levymoshe16
207 views
Direct Neural Machine Translation with Task-level Mixture of Experts
  models
Avatar
Isidora Tourni
207 views
Transformers as Soft Reasoners over Language
Avatar
ScienceCast Board
214 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
ScienceCast Board
469 views
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Avatar
Xing Han
233 views
Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in
  Closed-Source LLMs
Avatar
burke-atilla
198 views