← 返回
cs.LG updates on arXiv.org

cs.LG updates on arXiv.org

AI
更新于 2026-05-15 01:27 共 100 条
  1. 1 MathlibPR: Pull Request Merge-Readiness Benchmark for Formal Mathematical Libraries
  2. 2 Block-R1: Rethinking the Role of Block Size in Multi-domain Reinforcement Learning for Diffusion Large Language Models
  3. 3 How Do Transformers Learn to Associate Tokens: Gradient Leading Terms Bring Mechanistic Interpretability
  4. 4 Cascaded Flow Matching for Heterogeneous Tabular Data with Mixed-Type Features
  5. 5 Stochastic Dimension-Free Zeroth-Order Estimator for High-Dimensional and High-Order PINNs
  6. 6 Parallel Scan Recurrent Neural Quantum States for Scalable Variational Monte Carlo
  7. 7 CR-Net: Scaling Parameter-Efficient Training with Cross-Layer Low-Rank Structure
  8. 8 Multi-Rollout On-Policy Distillation via Peer Successes and Failures
  9. 9 Plan Before You Trade: Inference-Time Optimization for RL Trading Agents
  10. 10 Population Risk Bounds for Kolmogorov-Arnold Networks Trained by DP-SGD with Correlated Noise
  11. 11 Runtime Monitoring of Perception-Based Autonomous Systems via Embedding Temporal Logic
  12. 12 Learning to Decide with AI Assistance under Human-Alignment
  13. 13 OceanCBM: A Concept Bottleneck Model for Mechanistic Interpretability in Ocean Forecasting
  14. 14 Towards Robust Federated Multimodal Graph Learning under Modality Heterogeneity
  15. 15 Discrete Diffusion for Complex and Congested Multi-Agent Path Finding with Sparse Social Attention
  16. 16 scShapeBench: Discovering geometry from high dimensional scRNAseq data
  17. 17 ArcVQ-VAE: A Spherical Vector Quantization Framework with ArcCosine Additive Margin
  18. 18 ODRPO: Ordinal Decompositions of Discrete Rewards for Robust Policy Optimization
  19. 19 Efficient distributional regression trees learning algorithms for calibrated non-parametric probabilistic forecasts
  20. 20 Parallel-in-Time Training of Recurrent Neural Networks for Dynamical Systems Reconstruction
  21. 21 Energy Scaling Laws for Diffusion Models: Quantifying Compute in Image Generation
  22. 22 A Unified Perspective for Learning Graph Representations Across Multi-Level Abstractions
  23. 23 TS-Haystack: A Multi-Task Retrieval Benchmark for Long-Context Time-Series Reasoning
  24. 24 IGT-OMD: Implicit Gradient Transport for Decision-Focused Learning under Delayed Feedback
  25. 25 A Call to Lagrangian Action: Learning Population Mechanics from Temporal Snapshots
  26. 26 Modeling Heterophily in Multiplex Graphs: An Adaptive Approach for Node Classification
  27. 27 Do Activation Verbalization Methods Convey Privileged Information?
  28. 28 UFO: A Domain-Unification-Free Operator Framework for Generalized Operator Learning
  29. 29 SubspaceAD: Training-Free Few-Shot Anomaly Detection via Subspace Modeling
  30. 30 Do Fair Models Reason Fairly? Counterfactual Explanation Consistency for Procedural Fairness in Credit Decisions
  31. 31 When and Why is Optimistic Multiplicative Weights Slow? The Geometry of Energy Dissipation
  32. 32 Early Data Exposure Improves Robustness to Subsequent Fine-Tuning
  33. 33 Continual Learning with Multilingual Foundation Model
  34. 34 A Resampling-Based Framework for Network Structure Learning in High-Dimensional Data
  35. 35 Characterizing Universal Object Representations Across Vision Models
  36. 36 Spectral Energy Centroid: a Metric for Improving Performance and Analyzing Spectral Bias in Implicit Neural Representations
  37. 37 Tighter Learning Guarantees on Digital Computers via Concentration of Measure on Finite Spaces
  38. 38 Layer-wise Representation Dynamics: An Empirical Investigation Across Embedders and Base LLMs
  39. 39 RDMA: Cost Effective Agent-Driven Rare Disease Mining from Electronic Health Records
  40. 40 Scaling Laws for Mixture Pretraining Under Data Constraints
  41. 41 Latent-Augmented Discrete Diffusion Models
  42. 42 Before the Last Token: Diagnosing Final-Token Safety Probe Failures
  43. 43 High-Order Epistasis Detection Using Factorization Machine with Quadratic Optimization Annealing and MDR-Based Evaluation
  44. 44 From Generalist to Specialist Representation
  45. 45 Pragmatic Curiosity: A Unified Framework for Hybrid Learning and Optimization via Active Inference
  46. 46 ConRetroBert: EMA Stabilized Dual Encoders for Template-Based Single-Step Retrosynthesis
  47. 47 Data Agent: Learning to Select Data via End-to-End Dynamic Optimization
  48. 48 Learning with Rare Success but Rich Feedback via Reflection-Enhanced Self-Distillation
  49. 49 (Sparse) Attention to the Details: Preserving Spectral Fidelity in ML-based Weather Forecasting Models
  50. 50 Low-Rank Adapters Initialization via Gradient Surgery for Continual Learning
  51. 51 DataMaster: Data-Centric Autonomous AI Research
  52. 52 Constraint-Aware Flow Matching: Decision Aligned End-to-End Training for Constrained Sampling
  53. 53 A3 : an Analytical Low-Rank Approximation Framework for Attention
  54. 54 Predicting Channel Closures in the Lightning Network with Machine Learning
  55. 55 Differentially Private Nonparametric Confidence Intervals Under Minimal Distributional Assumptions
  56. 56 Multi-Quantile Regression for Extreme Precipitation Downscaling
  57. 57 DiscoverLLM: From Executing Intents to Discovering Them
  58. 58 State-Space NTK Collapse Near Bifurcations
  59. 59 TStore: Rethinking AI Model Hub with Tensor-Centric Compression
  60. 60 Inference-Time Machine Unlearning via Gated Activation Redirection
  61. 61 LLMs for Secure Hardware Design and Related Problems: Opportunities and Challenges
  62. 62 WriteSAE: Sparse Autoencoders for Recurrent State
  63. 63 Proximal-Based Generative Modeling for Bayesian Inverse Problems
  64. 64 Graph-Based Financial Fraud Detection with Calibrated Risk Scoring and Structural Regularization
  65. 65 Context-Aware Web Attack Detection in Open-Source SIEM Systems via MITRE ATT&CK-Enriched Behavioral Profiling
  66. 66 ToolMol: Evolutionary Agentic Framework for Multi-objective Drug Discovery
  67. 67 On the Limits of Latent Reuse in Diffusion Models
  68. 68 Identifying the nonlinear string dynamics with port-Hamiltonian neural networks
  69. 69 Reframing preprocessing selection as model-internal calibration in near-infrared spectroscopy: A large-scale benchmark of operator-adaptive PLS and Ridge models
  70. 70 From Heuristics to Analytics: Forecasting Effort and Progress in Online Learning
  71. 71 Humanwashing -- It Should Leave You Feeling Dirty
  72. 72 SoK: A Comprehensive Analysis of the Current Status of Neural Tangent Generalization Attacks with Research Directions
  73. 73 R-DMesh: Video-Guided 3D Animation via Rectified Dynamic Mesh Flow
  74. 74 Emergent and Subliminal Misalignment Through the Lens of Data-Mediated Transfer
  75. 75 Causal Fine-Tuning under Latent Confounded Shift
  76. 76 Pitfalls of Unlabeled Disagreement-Based Drift Detection in Streaming Tree Ensembles
  77. 77 A Faster Generalized Two-Stage Approximate Top-K
  78. 78 Discrete MeanFlow: One-Step Generation via Conditional Transition Kernels
  79. 79 Centralized Adaptive Sampling for Reliable Co-Training of Independent Multi-Agent Policies
  80. 80 Neurodata Without Boredom: Benchmarking Agentic AI for Data Reuse
  81. 81 Beyond Softmax: A Natural Parameterization for Categorical Random Variables
  82. 82 Correcting Influence: Unboxing LLM Outputs with Orthogonal Latent Spaces
  83. 83 Higher-order Linear Attention
  84. 84 AGOP as Explanation: From Feature Learning to Per-Sample Attribution in Image Classifiers
  85. 85 QSMOTE-PGM/kPGM: QSMOTE Based PGM and kPGM for Imbalanced Dataset Classification
  86. 86 Training Large Language Models to Predict Clinical Events
  87. 87 Perceptrons and localization of attention's mean-field landscape
  88. 88 Hessian Matching for Machine-Learned Coarse-Grained Molecular Dynamics
  89. 89 Auditing Sybil: Explaining Deep Lung Cancer Risk Prediction Through Generative Interventional Attributions
  90. 90 Orthrus: Memory-Efficient Parallel Token Generation via Dual-View Diffusion
  91. 91 Diffusion-Inspired Reconfiguration of Transformers for Uncertainty Calibration
  92. 92 Quantifying Potential Observation Missingness in Inverse Reinforcement Learning
  93. 93 Physics-informed neural particle flow for the Bayesian update step
  94. 94 Discrete Stochastic Localization for Non-autoregressive Generation
  95. 95 Delightful Distributed Policy Gradient
  96. 96 Bayesian Model Merging
  97. 97 Filter-then-Weight: Online Data Selection and Reweighting for LLM Fine-Tuning
  98. 98 Multitask Multimodal Fusion with Tabular Foundation Models for Peak and Durability Prediction of Pertussis Booster Response
  99. 99 Multi-Dimensional Behavioral Evaluation of Agentic Stock Prediction Systems Using Large Language Model Judges with Closed-Loop Reinforcement Learning Feedback
  100. 100 SMA: Submodular Modality Aligner For Data Efficient Multimodal Learning