Machine Learning

UMA: A Family of Universal Models for Atoms
Avatar
librarian
0 views
Optimising 4th-Order Runge-Kutta Methods: A Dynamic Heuristic Approach
  for Efficiency and Low Storage
Avatar
Gavin Goodship
7 views
Flow-Based Single-Step Completion for Efficient and Expressive Policy
  Learning
Avatar
librarian
445 views
AutoRule: Reasoning Chain-of-thought Extracted Rule-based Rewards
  Improve Preference Learning
Avatar
Tevin Wang
10 views
Dense SAE Latents Are Features, Not Bugs
Avatar
librarian
10 views
TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct
  Preference Optimization
Avatar
Mingkang Zhu
7 views
On the Hardness of Bandit Learning
Avatar
librarian
5 views
TimeMaster: Training Time-Series Multimodal LLMs to Reason via
  Reinforcement Learning
Avatar
Junru Zhang
17 views
Rethinking Losses for Diffusion Bridge Samplers
Avatar
librarian
28 views