2025-06-20 |
No Free Lunch: Rethinking Internal Feedback for LLM Reasoning |
Yanzhi Zhang et.al. |
2506.17219v1 |
null |
2025-06-20 |
Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens |
Zeyuan Yang et.al. |
2506.17218v1 |
null |
2025-06-20 |
Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting |
Tianjiao Yu et.al. |
2506.17212v1 |
null |
2025-06-20 |
BREAD: Branched Rollouts from Expert Anchors Bridge SFT & RL for Reasoning |
Xuechen Zhang et.al. |
2506.17211v1 |
null |
2025-06-20 |
DreamCube: 3D Panorama Generation via Multi-plane Synchronization |
Yukun Huang et.al. |
2506.17206v1 |
null |
2025-06-20 |
Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning |
Guozheng Ma et.al. |
2506.17204v1 |
null |
2025-06-20 |
UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation |
Teng Li et.al. |
2506.17202v1 |
null |
2025-06-20 |
Tighter Error Bounds for the qDRIFT Algorithm |
I. J. David et.al. |
2506.17199v1 |
null |
2025-06-20 |
Dex1B: Learning with 1B Demonstrations for Dexterous Manipulation |
Jianglong Ye et.al. |
2506.17198v1 |
null |
2025-06-20 |
Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres |
Samuel Howard et.al. |
2506.17197v1 |
null |
2025-06-20 |
Detecting LLM-Generated Short Answers and Effects on Learner Performance |
Shambhavi Bhushan et.al. |
2506.17196v1 |
null |
2025-06-20 |
Facial Landmark Visualization and Emotion Recognition Through Neural Networks |
Israel Juárez-Jiménez et.al. |
2506.17191v1 |
null |
2025-06-20 |
Optimal Implicit Bias in Linear Regression |
Kanumuri Nithin Varma et.al. |
2506.17187v1 |
null |
2025-06-20 |
YASMOT: Yet another stereo image multi-object tracker |
Ketil Malde et.al. |
2506.17186v1 |
null |
2025-06-20 |
A Common Pool of Privacy Problems: Legal and Technical Lessons from a Large-Scale Web-Scraped Machine Learning Dataset |
Rachel Hong et.al. |
2506.17185v1 |
null |
2025-06-20 |
Variational Learning of Disentangled Representations |
Yuli Slavutsky et.al. |
2506.17182v1 |
null |
2025-06-20 |
Deep generative models as the probability transformation functions |
Vitalii Bondar et.al. |
2506.17171v1 |
null |
2025-06-20 |
Continual Learning with Columnar Spiking Neural Networks |
Denis Larionov et.al. |
2506.17169v1 |
null |
2025-06-20 |
Analyzing PDFs like Binaries: Adversarially Robust PDF Malware Analysis via Intermediate Representation and Language Model |
Side Liu et.al. |
2506.17162v1 |
null |
2025-06-20 |
Co-Seg++: Mutual Prompt-Guided Collaborative Learning for Versatile Medical Segmentation |
Qing Xu et.al. |
2506.17159v1 |
null |
2025-06-20 |
Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity |
Samin Yeasar Arnob et.al. |
2506.17155v1 |
null |
2025-06-20 |
Profile monitoring of random functions with Gaussian process basis expansions |
Takayuki Iguchi et.al. |
2506.17153v1 |
null |
2025-06-20 |
Do We Need Large VLMs for Spotting Soccer Actions? |
Ritabrata Chakraborty et.al. |
2506.17144v1 |
null |
2025-06-20 |
MeDi: Metadata-Guided Diffusion Models for Mitigating Biases in Tumor Classification |
David Jacob Drexlin et.al. |
2506.17140v1 |
null |
2025-06-20 |
Consistent Sampling and Simulation: Molecular Dynamics with Energy-Based Diffusion Models |
Michael Plainer et.al. |
2506.17139v1 |
null |
2025-06-20 |
Semi-Supervised Multi-Modal Medical Image Segmentation for Complex Situations |
Dongdong Meng et.al. |
2506.17136v1 |
null |
2025-06-20 |
Robust Training with Data Augmentation for Medical Imaging Classification |
Josué Martínez-Martínez et.al. |
2506.17133v1 |
null |
2025-06-20 |
Chain-of-Trust: A Progressive Trust Evaluation Framework Enabled by Generative AI |
Botao Zhu et.al. |
2506.17130v1 |
null |
2025-06-20 |
Rapid and Continuous Trust Evaluation for Effective Task Collaboration Through Siamese Model |
Botao Zhu et.al. |
2506.17128v1 |
null |
2025-06-20 |
Large Language Model Unlearning for Source Code |
Xue Jiang et.al. |
2506.17125v1 |
null |