Artificial Intelligence

How Uncertainty Estimation Scales with Sampling in Reasoning Models
Avatar
librarian
13 views
D5P4: Partition Determinantal Point Process for Diversity in Parallel Discrete Diffusion Decoding
Avatar
librarian
11 views
cuGenOpt: A GPU-Accelerated General-Purpose Metaheuristic Framework for Combinatorial Optimization
Avatar
Yuyang Liu
11 views
Box Maze: A Process-Control Architecture for Reliable LLM Reasoning
Avatar
Zou Qiang
11 views
dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models
Avatar
librarian
12 views
MANAR: Memory-augmented Attention with Navigational Abstract Conceptual Representation
Avatar
librarian
11 views
Memento-Skills: Let Agents Design Agents
Avatar
librarian
33 views
Reasoning over mathematical objects: on-policy reward modeling and test time aggregation
Avatar
librarian
11 views
Facts as First Class Objects: Knowledge Objects for Persistent LLM Memory
Avatar
Oliver Zahn
10 views
RPMS: Enhancing LLM-Based Embodied Planning through Rule-Augmented Memory Synergy
Avatar
Zhenhang Yuan
11 views
AgentFactory: A Self-Evolving Framework Through Executable Subagent Accumulation and Reuse
Avatar
Zhang Zhang
12 views
When Only the Final Text Survives: Implicit Execution Tracing for Multi-Agent Attribution
Avatar
librarian
12 views
Contrastive Reasoning Alignment: Reinforcement Learning from Hidden Representations
Avatar
Haozheng Luo
18 views
Towards Safer Large Reasoning Models by Promoting Safety Decision-Making before Chain-of-Thought Generation
Avatar
librarian
14 views
Proactive Knowledge Inquiry in Doctor-Patient Dialogue: Stateful Extraction, Belief Updating, and Path-Aware Action Planning
Avatar
librarian
11 views
Proactive Knowledge Inquiry in Doctor-Patient Dialogue: Stateful Extraction, Belief Updating, and Path-Aware Action Planning
Avatar
librarian
11 views
Machines acquire scientific taste from institutional traces
Avatar
librarian
20 views
Anticipatory Planning for Multimodal AI Agents
Avatar
librarian
14 views
Internalizing Agency from Reflective Experience
Avatar
librarian
14 views
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models
Avatar
librarian
15 views
Via Negativa for AI Alignment: Why Negative Constraints Are Structurally Superior to Positive Preferences
Avatar
Quan Cheng
15 views
Adaptive Theory of Mind for LLM-based Multi-Agent Coordination
Avatar
Chunjiang Mu
13 views
TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas
Avatar
Jian Ai
13 views
Talk, Evaluate, Diagnose: User-aware Agent Evaluation with Automated Error Analysis
Avatar
librarian
18 views
Portfolio of Solving Strategies in CEGAR-based Object Packing and Scheduling for Sequential 3D Printing
Avatar
Pavel Surynek
30 views
Compiling Temporal Numeric Planning into Discrete PDDL+: Extended Version
Avatar
librarian
17 views
Increasing intelligence in AI agents can worsen collective outcomes
Avatar
librarian
28 views
On Information Self-Locking in Reinforcement Learning for Active Reasoning of LLM agents
Avatar
librarian
30 views
TopoBench: Benchmarking LLMs on Hard Topological Reasoning
Avatar
Mayug Maniparambil
30 views
Examining Reasoning LLMs-as-Judges in Non-Verifiable LLM Post-Training
Avatar
librarian
27 views
FAME: Formal Abstract Minimal Explanation for Neural Networks
Avatar
librarian
31 views
Emulating Clinician Cognition via Self-Evolving Deep Clinical Research
Avatar
Ruiyang Ren
37 views