Computer Science

Right in the Right Way: LM Training with Verifiable Rewards and Human Demonstrations
Avatar
Mehul Damani
0 views
Optimal Resource Utilization for Autonomous Laboratory Orchestrators
Avatar
Austin McDannald
1 view
Agentic generation of verifiable rules for deterministic, self-expanding reaction classification
Avatar
librarian
1 view
Theoria: Rewrite-Acceptability Verification over Informal Reasoning States
Avatar
librarian
1 view
AutoMem: Automated Learning of Memory as a Cognitive Skill
Avatar
librarian
1 view
Is One Layer Enough? Training A Single Transformer Layer Can Match Full-Parameter RL Training
Avatar
Zijian Zhang
1 view
Self-Evolving Agents with Anytime-Valid Certificates
Avatar
librarian
1 view
Graph-Native Reinforcement Learning Enables Traceable Scientific Hypothesis Generation through Conceptual Recombination
Avatar
librarian
1 view
Reinforcement Learning with Metacognitive Feedback Elicits Faithful Uncertainty Expression in LLMs
Avatar
librarian
4 views
Evo-PI: Aligning Medical Reasoning via Evolving Principle-Guided Supervision
Avatar
Xianda Zheng
3 views
A Self-Evolving Agentic System for Automated Generation and Execution of Biological Protocols
Avatar
librarian
5 views
QVal: Cheaply Evaluating Dense Supervision Signals for Long-Horizon LLM Agents
Avatar
Sergio Hernández-Gutiérrez
14 views
Harnessing Textual Refusal Directions for Multimodal Safety
Avatar
librarian
7 views
An Agentic AI Framework to Accelerate Scientific Discovery in Plant Phenotyping
Avatar
Renan Souza
4 views
RAISE: LLM-based Automated Heuristic Design with Robust Adversary Instance Search
Avatar
librarian
5 views
TreeAgent: A Generalizable Multi-Agent Framework for Automated Bias Labeling in Forestry via Compiled Expert Rules and Vision-Language Models
Avatar
librarian
5 views
AxDafny: Agentic Verified Code Generation in Dafny
Avatar
librarian
13 views
TabPATE: Differentially Private Tabular In-Context Learning Without Public Data
Avatar
Dariush Wahdany
5 views
Evil Spectra: How Optimisers can Amplify or Suppress Emergent Misalignment
Avatar
Jason Brown
5 views
The FIL Hypothesis: Inductive Biases Help with Kernel Engineering
Avatar
librarian
5 views
Whose Side Is Your Agent On? Multi-Party Principal Loyalty in LLM Agents
Avatar
librarian
6 views
Linguistic Firewall: Geometry as Defense in Multi-Agent Systems Routing
Avatar
Dvir Alsheich
7 views