Machine Learning

TRACE: Turn-level Reward Assignment via Credit Estimation for Long-Horizon Agents

TRACE: Turn-level Reward Assignment via Credit...

Machine Learning

librarian

2 views

MxGPS: Multiplex Graph Transformers for a Power Grid Foundation Model

MxGPS: Multiplex Graph Transformers for a Powe...

Machine Learning

librarian

2 views

Transforming Rank: How Architecture Navigates the Spectral Pathologies of Depth

Transforming Rank: How Architecture Navigates ...

Machine Learning

librarian

2 views

Leveraging unlabelled data for generalizable neural population decoding

Leveraging unlabelled data for generalizable n...

Machine Learning

librarian

2 views

How to Tame Grokking: Representation Geometry as a Control Signal

How to Tame Grokking: Representation Geometry ...

Machine Learning

librarian

6 views

An Exact Instrument for State Usage in Selective State-Space Models, and the Input-Driven Migration It Reveals

An Exact Instrument for State Usage in Selecti...

Machine Learning

librarian

7 views

CDFM: Towards a General-Purpose Causal Discovery Foundation Model

CDFM: Towards a General-Purpose Causal Discove...

Machine Learning

librarian

6 views

Inside the Unfair Judge: A Mechanistic Interpretability Account of LLM-as-Judge Bias

Inside the Unfair Judge: A Mechanistic Interpr...

Machine Learning

Xiuying Chen

7 views

Invariant Learning Dynamics of Transformers in Inductive Reasoning Tasks

Invariant Learning Dynamics of Transformers in...

Machine Learning

librarian

7 views

Requential Coding: Pushing the Limits of Model Compression with Self-Generated Training Data

Requential Coding: Pushing the Limits of Model...

Machine Learning

librarian

5 views

Dimensionality Reduction Meets Network Science: Sensemaking on UMAP's kNN Graph

Dimensionality Reduction Meets Network Science...

Machine Learning

librarian

31 views

EdgeRefine: Privacy-Utility Balance for Graphs via Jaccard Sampling under Edge Differential Privacy

EdgeRefine: Privacy-Utility Balance for Graphs...

Machine Learning

librarian

19 views

BiSCo-LLM: Lookup-Free Binary Spherical Coding for Extreme Low-Bit Large Language Model Compression

BiSCo-LLM: Lookup-Free Binary Spherical Coding...

Machine Learning

librarian

23 views

Latent Memory Palace: Reasoning for Control as Autoregressive Variational Inference

Latent Memory Palace: Reasoning for Control as...

Machine Learning

Chuning Zhu

20 views

Super Weights in LLMs and the Failure of Selective Training

Super Weights in LLMs and the Failure of Selec...

Machine Learning

librarian

16 views

SLORR: Simple and Efficient In-Training Low-Rank Regularization

SLORR: Simple and Efficient In-Training Low-Ra...

Machine Learning

librarian

20 views

How Data Shapes RoPE Frequency Usage: From Positional Scale Matching to Length Generalization

How Data Shapes RoPE Frequency Usage: From Pos...

Machine Learning

librarian

25 views

Single-Rollout Asynchronous Optimization for Agentic Reinforcement Learning

Single-Rollout Asynchronous Optimization for A...

Machine Learning

librarian

24 views

Agon: Competitive Cross-Model RL with Implicit Rival Grading of Reasoning

Agon: Competitive Cross-Model RL with Implicit...

Machine Learning

Vladislav Beliaev

18 views

Selective Timestep Weighting and Advantage-Based Replay for Sample-Efficient Diffusion RLHF

Selective Timestep Weighting and Advantage-Bas...

Machine Learning

Eric Zhu

18 views

The Key to Going Linear: Analysis-Driven Transformer Linearization

The Key to Going Linear: Analysis-Driven Trans...

Machine Learning

librarian

22 views

Graph Convolutional Attention: A Spectral Perspective on Graph Denoising and Diffusion

Graph Convolutional Attention: A Spectral Pers...

Machine Learning

librarian

46 views

Canopy: A Heterograph Foundation Model for Metabolic Engineering

Canopy: A Heterograph Foundation Model for Met...

Machine Learning

librarian

27 views

Physics-Informed Neural Embeddings of PDE Solution Families

Physics-Informed Neural Embeddings of PDE Solu...

Machine Learning

Leonid Sarieddine

29 views

Quantitative Gaussian-Process limits of Tensor Programs

Quantitative Gaussian-Process limits of Tensor...

Machine Learning

librarian

25 views

TILDE: TILt-based Distributional Erasure for Concept Unlearning

TILDE: TILt-based Distributional Erasure for C...

Machine Learning

librarian

17 views

CompactionRL: Reinforcement Learning with Context Compaction for Long-Horizon Agents

CompactionRL: Reinforcement Learning with Cont...

Machine Learning

librarian

104 views

Beyond Adam: SOAP and Muon for Faster, Label-Efficient Training of Machine Learning Interatomic Potentials

Beyond Adam: SOAP and Muon for Faster, Label-E...

Machine Learning

librarian

39 views

DecompRL: Solving Harder Problems by Learning Modular Code Generation

DecompRL: Solving Harder Problems by Learning ...

Machine Learning

librarian

33 views

Neuron-Aware Data Selection for Annotation-Free LLM Self-Distillation

Neuron-Aware Data Selection for Annotation-Fre...

Machine Learning

librarian

33 views

DemoPSD: Disagreement-Modulated Policy Self-Distillation

DemoPSD: Disagreement-Modulated Policy Self-Di...

Machine Learning

Yunhe Li

65 views

Program-as-Weights: A Programming Paradigm for Fuzzy Functions

Program-as-Weights: A Programming Paradigm for...

Machine Learning

librarian

32 views

Web analytics