Computer Science

On-Policy Self-Distillation with Sampled Demonstrations Reduces Output Diversity
Avatar
Andrei Liviu Nicolicioiu
0 views
RevengeBench: Reverse Engineering Code-Space Policies from Behavioral Experiments
Avatar
Babak Rahmani
0 views
InvestPhilBench: A Multi-Layer Dynamic Benchmark for Evaluating Large Language Model Procedural Reasoning in Expert Investment Philosophy
Avatar
librarian
1 view
Autodata: An agentic data scientist to create high quality synthetic data
Avatar
Ilia Kulikov
1 view
The Unfireable Safety Kernel: Execution-Time AI Alignment for AI Agents and Other Escapable AI Systems
Avatar
librarian
1 view
Cliff Tokens: Identifying Single-Token Failure Triggers in LLM Mathematical Reasoning
Avatar
librarian
2 views
AI Snitches Get Glitches: Towards Evading Agentic Surveillance
Avatar
Hyejun Jeong
2 views
Staying In Character: Perspective-Bounded Memory For Book-Based Role-Playing Agents
Avatar
Xushuo Tang
1 view
Confidence Sequences for Online Statistical Model Checking of Markov Decision Processes
Avatar
librarian
2 views
Decentralised AI Training and Inference with BlockTrain
Avatar
librarian
3 views
Parallel Manifold Steering: Efficient Adaptation of Large Associative Memories via Residual Energy Shaping
Avatar
Kanishk Awadhiya
5 views
World Models in Pieces: Structural Certification for General Agents
Avatar
Tongxin Li
5 views
OpenThoughts-Agent: Data Recipes for Agentic Models
Avatar
librarian
5 views
Data Augmentation: A Fourier Analysis Perspective
Avatar
Behrooz Tahmasebi
4 views
A specialized reasoning large language model for accelerating rare disease diagnosis: a randomized AI physician assistance trial
Avatar
librarian
5 views
ReM-MoA: Reasoning Memory Sustains Mixture-of-Agents Scaling
Avatar
Heng Ping
4 views
VeriEvol: Scaling Multimodal Mathematical Reasoning via Verifiable Evol-Instruct
Avatar
Haoling Li
10 views
The Topology of Ill-Posed Questions: Persistent Homology for Detection and Steering in LLMs
Avatar
Guangyu Jiang
9 views
Tapered Language Models

Tapered Language Models

Machine Learning
Avatar
librarian
12 views
SPIRAL: Learning to Search and Aggregate
Avatar
librarian
8 views