Computer Science

Beyond Safe Data: Pretraining-Stage Alignment with Regular Safety Reflection
Avatar
Jinhan Li
1 view
ARIADNE: Agnostic Routing for Inference-time Adapter DyNamic sElection
Avatar
Enrico Cassano
1 view
Explaining Attention with Program Synthesis
Avatar
Amiri Hayes
0 views
User as Engram: Internalizing Per-User Memory as Local Parametric Edits
Avatar
librarian
1 view
NeSyCat Torch: A Differentiable Tensor Implementation of Categorical Semantics for Neurosymbolic Learning
Avatar
librarian
1 view
Rethinking Reward Supervision: Rubric-Conditioned Self-Distillation
Avatar
librarian
2 views
EvolveNav: Proactive Preflection and Self-Evolving Memory for Zero-Shot Object Goal Navigation
Avatar
librarian
4 views
The Stanford EDGAR Filings Dataset: Reconstructing U.S. Corporate and Financial Disclosures into Layout-Faithful and Token-Efficient Pretraining Data
Avatar
Nick Bettencourt
4 views
Looped World Models

Looped World Models

Machine Learning
Avatar
librarian
6 views
Rethinking Dataset Distillation for Classification: Do Distilled Sets Outperform Coresets?
Avatar
Trisha Mittal
4 views
PseudoBench: Measuring How Agentic Auto-Research Fuels Pseudoscience
Avatar
librarian
3 views
Learning Cardiac Electrophysiology Digital Twins Through Agentic Discovery of Hybrid Structure
Avatar
Ziqi Zhou
3 views
Fixed-Point Reasoners: Stable and Adaptive Deep Looped Transformers
Avatar
librarian
4 views
RAID: Semantic Graph Diffusion for True Cold-Start and Cross-Lingual Forecasting
Avatar
librarian
4 views
Scaling LLM Reasoning from Minimal Labels: A Semi-Supervised Framework with a Lightweight Verifier
Avatar
librarian
3 views
When in Doubt, Plan It Out: Committed Small Language Model Deliberation for Reactive Reinforcement Learning
Avatar
librarian
3 views
MA-SBI: Misspecification-Aware Simulation-Based Inference via Side-Channel Guidance
Avatar
librarian
5 views
Greed Is Learned: Visible Incentives as Reward-Hacking Triggers
Avatar
librarian
3 views
OpenClaw-Skill: Collective Skill Tree Search for Agentic Large Language Models
Avatar
librarian
5 views
A First-Principles Derivation of LLM Policy Optimization: From Expected Reward to GRPO and Its Structural Extensions
Avatar
librarian
6 views
GIST-CMTF: Goal-State Inference for Causal Minimal Tool Filtering in LLM Agents
Avatar
Rahul Suresh Babu
4 views
A Causal Model of Theory of Mind in Conflict for Artificial Intelligence
Avatar
Nikolos Gurney
4 views