2025-06-20 |
VLN-R1: Vision-Language Navigation via Reinforcement Fine-Tuning |
Zhangyang Qi et.al. |
2506.17221v1 |
null |
2025-06-20 |
Emergent Temporal Correspondences from Video Diffusion Transformers |
Jisu Nam et.al. |
2506.17220v1 |
link |
2025-06-20 |
No Free Lunch: Rethinking Internal Feedback for LLM Reasoning |
Yanzhi Zhang et.al. |
2506.17219v1 |
null |
2025-06-20 |
Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens |
Zeyuan Yang et.al. |
2506.17218v1 |
null |
2025-06-20 |
Bias hardened estimators of patchy screening profiles |
Noah Sailer et.al. |
2506.17217v1 |
null |
2025-06-20 |
Hierarchical constraints on gravitational waves from horizonless compact objects |
Rajrupa Mondal et.al. |
2506.17215v1 |
null |
2025-06-20 |
Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting |
Tianjiao Yu et.al. |
2506.17212v1 |
null |
2025-06-20 |
DreamCube: 3D Panorama Generation via Multi-plane Synchronization |
Yukun Huang et.al. |
2506.17206v1 |
null |
2025-06-20 |
Efficient Implementation of Multi-sensor Adaptive Birth Samplers for Labeled Random Finite Set Tracking |
Jennifer Bondarchuk et.al. |
2506.17205v1 |
null |
2025-06-20 |
Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres |
Samuel Howard et.al. |
2506.17197v1 |
null |
2025-06-20 |
Detecting LLM-Generated Short Answers and Effects on Learner Performance |
Shambhavi Bhushan et.al. |
2506.17196v1 |
null |
2025-06-20 |
Gravitational lensing observables in stationary and axisymmetric solutions in general relativity |
Matteo Luca Ruggiero et.al. |
2506.17192v1 |
null |
2025-06-20 |
YASMOT: Yet another stereo image multi-object tracker |
Ketil Malde et.al. |
2506.17186v1 |
null |
2025-06-20 |
Variational Learning of Disentangled Representations |
Yuli Slavutsky et.al. |
2506.17182v1 |
null |
2025-06-20 |
Feedback cooling scheme for an optically levitated oscillator with controlled cross-talk |
J. M. H. Gosling et.al. |
2506.17172v1 |
null |
2025-06-20 |
Partition function for several Ising model interface structures |
Alessio Squarcini et.al. |
2506.17170v1 |
null |
2025-06-20 |
Scaling limits for sample autocovariance operators of Hilbert space-valued linear processes |
Marie-Christine Düker et.al. |
2506.17168v1 |
null |
2025-06-20 |
Analyzing PDFs like Binaries: Adversarially Robust PDF Malware Analysis via Intermediate Representation and Language Model |
Side Liu et.al. |
2506.17162v1 |
null |
2025-06-20 |
Profile monitoring of random functions with Gaussian process basis expansions |
Takayuki Iguchi et.al. |
2506.17153v1 |
null |
2025-06-20 |
Fully Self-Consistent Semiclassical Gravity |
R. Muciño et.al. |
2506.17149v1 |
null |
2025-06-20 |
Do We Need Large VLMs for Spotting Soccer Actions? |
Ritabrata Chakraborty et.al. |
2506.17144v1 |
null |
2025-06-20 |
On the Theory of Conditional Feature Alignment for Unsupervised Domain-Adaptive Counting |
Zhuonan Liang et.al. |
2506.17137v1 |
null |
2025-06-20 |
Dynamic Watermark Generation for Digital Images using Perimeter Gated SPAD Imager PUFs |
Md Sakibur Sajal et.al. |
2506.17134v1 |
null |
2025-06-20 |
Robust Training with Data Augmentation for Medical Imaging Classification |
Josué Martínez-Martínez et.al. |
2506.17133v1 |
null |
2025-06-20 |
Rapid and Continuous Trust Evaluation for Effective Task Collaboration Through Siamese Model |
Botao Zhu et.al. |
2506.17128v1 |
null |
2025-06-20 |
Reassessing Code Authorship Attribution in the Era of Language Models |
Atish Kumar Dipongkor et.al. |
2506.17120v1 |
null |
2025-06-20 |
RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking |
Teng Guo et.al. |
2506.17119v1 |
null |
2025-06-20 |
MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation |
Shoubin Yu et.al. |
2506.17113v1 |
null |
2025-06-20 |
Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping |
Teng Guo et.al. |
2506.17110v1 |
null |
2025-06-20 |
Searching for a Hidden Markov Anomaly over Multiple Processes |
Levli Citron et.al. |
2506.17108v1 |
null |