📊
Hot点·热榜
首页
综合
科技
娱乐
社区
购物
财经
更多 ▾
开发
AI
设计
🔍
✕
← 返回
cs.LG updates on arXiv.org
AI
更新于 2026-05-15 01:27
共 100 条
1
MathlibPR: Pull Request Merge-Readiness Benchmark for Formal Mathematical Libraries
↗
2
Block-R1: Rethinking the Role of Block Size in Multi-domain Reinforcement Learning for Diffusion Large Language Models
↗
3
How Do Transformers Learn to Associate Tokens: Gradient Leading Terms Bring Mechanistic Interpretability
↗
4
Cascaded Flow Matching for Heterogeneous Tabular Data with Mixed-Type Features
↗
5
Stochastic Dimension-Free Zeroth-Order Estimator for High-Dimensional and High-Order PINNs
↗
6
Parallel Scan Recurrent Neural Quantum States for Scalable Variational Monte Carlo
↗
7
CR-Net: Scaling Parameter-Efficient Training with Cross-Layer Low-Rank Structure
↗
8
Multi-Rollout On-Policy Distillation via Peer Successes and Failures
↗
9
Plan Before You Trade: Inference-Time Optimization for RL Trading Agents
↗
10
Population Risk Bounds for Kolmogorov-Arnold Networks Trained by DP-SGD with Correlated Noise
↗
11
Runtime Monitoring of Perception-Based Autonomous Systems via Embedding Temporal Logic
↗
12
Learning to Decide with AI Assistance under Human-Alignment
↗
13
OceanCBM: A Concept Bottleneck Model for Mechanistic Interpretability in Ocean Forecasting
↗
14
Towards Robust Federated Multimodal Graph Learning under Modality Heterogeneity
↗
15
Discrete Diffusion for Complex and Congested Multi-Agent Path Finding with Sparse Social Attention
↗
16
scShapeBench: Discovering geometry from high dimensional scRNAseq data
↗
17
ArcVQ-VAE: A Spherical Vector Quantization Framework with ArcCosine Additive Margin
↗
18
ODRPO: Ordinal Decompositions of Discrete Rewards for Robust Policy Optimization
↗
19
Efficient distributional regression trees learning algorithms for calibrated non-parametric probabilistic forecasts
↗
20
Parallel-in-Time Training of Recurrent Neural Networks for Dynamical Systems Reconstruction
↗
21
Energy Scaling Laws for Diffusion Models: Quantifying Compute in Image Generation
↗
22
A Unified Perspective for Learning Graph Representations Across Multi-Level Abstractions
↗
23
TS-Haystack: A Multi-Task Retrieval Benchmark for Long-Context Time-Series Reasoning
↗
24
IGT-OMD: Implicit Gradient Transport for Decision-Focused Learning under Delayed Feedback
↗
25
A Call to Lagrangian Action: Learning Population Mechanics from Temporal Snapshots
↗
26
Modeling Heterophily in Multiplex Graphs: An Adaptive Approach for Node Classification
↗
27
Do Activation Verbalization Methods Convey Privileged Information?
↗
28
UFO: A Domain-Unification-Free Operator Framework for Generalized Operator Learning
↗
29
SubspaceAD: Training-Free Few-Shot Anomaly Detection via Subspace Modeling
↗
30
Do Fair Models Reason Fairly? Counterfactual Explanation Consistency for Procedural Fairness in Credit Decisions
↗
31
When and Why is Optimistic Multiplicative Weights Slow? The Geometry of Energy Dissipation
↗
32
Early Data Exposure Improves Robustness to Subsequent Fine-Tuning
↗
33
Continual Learning with Multilingual Foundation Model
↗
34
A Resampling-Based Framework for Network Structure Learning in High-Dimensional Data
↗
35
Characterizing Universal Object Representations Across Vision Models
↗
36
Spectral Energy Centroid: a Metric for Improving Performance and Analyzing Spectral Bias in Implicit Neural Representations
↗
37
Tighter Learning Guarantees on Digital Computers via Concentration of Measure on Finite Spaces
↗
38
Layer-wise Representation Dynamics: An Empirical Investigation Across Embedders and Base LLMs
↗
39
RDMA: Cost Effective Agent-Driven Rare Disease Mining from Electronic Health Records
↗
40
Scaling Laws for Mixture Pretraining Under Data Constraints
↗
41
Latent-Augmented Discrete Diffusion Models
↗
42
Before the Last Token: Diagnosing Final-Token Safety Probe Failures
↗
43
High-Order Epistasis Detection Using Factorization Machine with Quadratic Optimization Annealing and MDR-Based Evaluation
↗
44
From Generalist to Specialist Representation
↗
45
Pragmatic Curiosity: A Unified Framework for Hybrid Learning and Optimization via Active Inference
↗
46
ConRetroBert: EMA Stabilized Dual Encoders for Template-Based Single-Step Retrosynthesis
↗
47
Data Agent: Learning to Select Data via End-to-End Dynamic Optimization
↗
48
Learning with Rare Success but Rich Feedback via Reflection-Enhanced Self-Distillation
↗
49
(Sparse) Attention to the Details: Preserving Spectral Fidelity in ML-based Weather Forecasting Models
↗
50
Low-Rank Adapters Initialization via Gradient Surgery for Continual Learning
↗
51
DataMaster: Data-Centric Autonomous AI Research
↗
52
Constraint-Aware Flow Matching: Decision Aligned End-to-End Training for Constrained Sampling
↗
53
A3 : an Analytical Low-Rank Approximation Framework for Attention
↗
54
Predicting Channel Closures in the Lightning Network with Machine Learning
↗
55
Differentially Private Nonparametric Confidence Intervals Under Minimal Distributional Assumptions
↗
56
Multi-Quantile Regression for Extreme Precipitation Downscaling
↗
57
DiscoverLLM: From Executing Intents to Discovering Them
↗
58
State-Space NTK Collapse Near Bifurcations
↗
59
TStore: Rethinking AI Model Hub with Tensor-Centric Compression
↗
60
Inference-Time Machine Unlearning via Gated Activation Redirection
↗
61
LLMs for Secure Hardware Design and Related Problems: Opportunities and Challenges
↗
62
WriteSAE: Sparse Autoencoders for Recurrent State
↗
63
Proximal-Based Generative Modeling for Bayesian Inverse Problems
↗
64
Graph-Based Financial Fraud Detection with Calibrated Risk Scoring and Structural Regularization
↗
65
Context-Aware Web Attack Detection in Open-Source SIEM Systems via MITRE ATT&CK-Enriched Behavioral Profiling
↗
66
ToolMol: Evolutionary Agentic Framework for Multi-objective Drug Discovery
↗
67
On the Limits of Latent Reuse in Diffusion Models
↗
68
Identifying the nonlinear string dynamics with port-Hamiltonian neural networks
↗
69
Reframing preprocessing selection as model-internal calibration in near-infrared spectroscopy: A large-scale benchmark of operator-adaptive PLS and Ridge models
↗
70
From Heuristics to Analytics: Forecasting Effort and Progress in Online Learning
↗
71
Humanwashing -- It Should Leave You Feeling Dirty
↗
72
SoK: A Comprehensive Analysis of the Current Status of Neural Tangent Generalization Attacks with Research Directions
↗
73
R-DMesh: Video-Guided 3D Animation via Rectified Dynamic Mesh Flow
↗
74
Emergent and Subliminal Misalignment Through the Lens of Data-Mediated Transfer
↗
75
Causal Fine-Tuning under Latent Confounded Shift
↗
76
Pitfalls of Unlabeled Disagreement-Based Drift Detection in Streaming Tree Ensembles
↗
77
A Faster Generalized Two-Stage Approximate Top-K
↗
78
Discrete MeanFlow: One-Step Generation via Conditional Transition Kernels
↗
79
Centralized Adaptive Sampling for Reliable Co-Training of Independent Multi-Agent Policies
↗
80
Neurodata Without Boredom: Benchmarking Agentic AI for Data Reuse
↗
81
Beyond Softmax: A Natural Parameterization for Categorical Random Variables
↗
82
Correcting Influence: Unboxing LLM Outputs with Orthogonal Latent Spaces
↗
83
Higher-order Linear Attention
↗
84
AGOP as Explanation: From Feature Learning to Per-Sample Attribution in Image Classifiers
↗
85
QSMOTE-PGM/kPGM: QSMOTE Based PGM and kPGM for Imbalanced Dataset Classification
↗
86
Training Large Language Models to Predict Clinical Events
↗
87
Perceptrons and localization of attention's mean-field landscape
↗
88
Hessian Matching for Machine-Learned Coarse-Grained Molecular Dynamics
↗
89
Auditing Sybil: Explaining Deep Lung Cancer Risk Prediction Through Generative Interventional Attributions
↗
90
Orthrus: Memory-Efficient Parallel Token Generation via Dual-View Diffusion
↗
91
Diffusion-Inspired Reconfiguration of Transformers for Uncertainty Calibration
↗
92
Quantifying Potential Observation Missingness in Inverse Reinforcement Learning
↗
93
Physics-informed neural particle flow for the Bayesian update step
↗
94
Discrete Stochastic Localization for Non-autoregressive Generation
↗
95
Delightful Distributed Policy Gradient
↗
96
Bayesian Model Merging
↗
97
Filter-then-Weight: Online Data Selection and Reweighting for LLM Fine-Tuning
↗
98
Multitask Multimodal Fusion with Tabular Foundation Models for Peak and Durability Prediction of Pertussis Booster Response
↗
99
Multi-Dimensional Behavioral Evaluation of Agentic Stock Prediction Systems Using Large Language Model Judges with Closed-Loop Reinforcement Learning Feedback
↗
100
SMA: Submodular Modality Aligner For Data Efficient Multimodal Learning
↗
🏠
全部
📡
综合
💻
科技
🎬
娱乐
💬
社区
↑