2025-06-20 |
Emergent Temporal Correspondences from Video Diffusion Transformers |
Jisu Nam et.al. |
2506.17220v1 |
link |
2025-06-20 |
No Free Lunch: Rethinking Internal Feedback for LLM Reasoning |
Yanzhi Zhang et.al. |
2506.17219v1 |
null |
2025-06-20 |
Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens |
Zeyuan Yang et.al. |
2506.17218v1 |
null |
2025-06-20 |
DreamCube: 3D Panorama Generation via Multi-plane Synchronization |
Yukun Huang et.al. |
2506.17206v1 |
null |
2025-06-20 |
UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation |
Teng Li et.al. |
2506.17202v1 |
null |
2025-06-20 |
Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres |
Samuel Howard et.al. |
2506.17197v1 |
null |
2025-06-20 |
Gravitational lensing observables in stationary and axisymmetric solutions in general relativity |
Matteo Luca Ruggiero et.al. |
2506.17192v1 |
null |
2025-06-20 |
Facial Landmark Visualization and Emotion Recognition Through Neural Networks |
Israel Juárez-Jiménez et.al. |
2506.17191v1 |
null |
2025-06-20 |
YASMOT: Yet another stereo image multi-object tracker |
Ketil Malde et.al. |
2506.17186v1 |
null |
2025-06-20 |
Variational Learning of Disentangled Representations |
Yuli Slavutsky et.al. |
2506.17182v1 |
null |
2025-06-20 |
High-accuracy inference using HfO$_x$S$_y$/HfS$_2$ Memristors |
Aferdita Xhameni et.al. |
2506.17174v1 |
null |
2025-06-20 |
Deep generative models as the probability transformation functions |
Vitalii Bondar et.al. |
2506.17171v1 |
null |
2025-06-20 |
Proportional Sensitivity in Generative Adversarial Network (GAN)-Augmented Brain Tumor Classification Using Convolutional Neural Network |
Mahin Montasir Afif et.al. |
2506.17165v1 |
null |
2025-06-20 |
Walking Fingerprinting Using Wrist Accelerometry During Activities of Daily Living in NHANES |
Lily Koffman et.al. |
2506.17160v1 |
null |
2025-06-20 |
Co-Seg++: Mutual Prompt-Guided Collaborative Learning for Versatile Medical Segmentation |
Qing Xu et.al. |
2506.17159v1 |
null |
2025-06-20 |
Shock formation in 1D conservation laws II: Vanishing viscosity |
John Anderson et.al. |
2506.17156v1 |
null |
2025-06-20 |
Do We Need Large VLMs for Spotting Soccer Actions? |
Ritabrata Chakraborty et.al. |
2506.17144v1 |
null |
2025-06-20 |
MeDi: Metadata-Guided Diffusion Models for Mitigating Biases in Tumor Classification |
David Jacob Drexlin et.al. |
2506.17140v1 |
null |
2025-06-20 |
Semi-Supervised Multi-Modal Medical Image Segmentation for Complex Situations |
Dongdong Meng et.al. |
2506.17136v1 |
null |
2025-06-20 |
Dynamic Watermark Generation for Digital Images using Perimeter Gated SPAD Imager PUFs |
Md Sakibur Sajal et.al. |
2506.17134v1 |
null |
2025-06-20 |
Robust Training with Data Augmentation for Medical Imaging Classification |
Josué Martínez-Martínez et.al. |
2506.17133v1 |
null |
2025-06-20 |
Real-time Broadband RFI Excision for the Upgraded GMRT |
Ruta Kale et.al. |
2506.17131v1 |
null |
2025-06-20 |
Large Average Subtensor Problem: Ground-State, Algorithms, and Algorithmic Barriers |
Abhishek Hegade K. R. et.al. |
2506.17118v1 |
null |
2025-06-20 |
Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping |
Teng Guo et.al. |
2506.17110v1 |
null |
2025-06-20 |
Open-Path Methane Sensing via Backscattered Light in a Nonlinear Interferometer |
Jinghan Dong et.al. |
2506.17107v1 |
null |
2025-06-20 |
Neural Polar Decoders for DNA Data Storage |
Ziv Aharoni et.al. |
2506.17076v1 |
null |
2025-06-20 |
Assembler: Scalable 3D Part Assembly via Anchor Point Diffusion |
Wang Zhao et.al. |
2506.17074v1 |
null |
2025-06-20 |
Client Selection Strategies for Federated Semantic Communications in Heterogeneous IoT Networks |
Samer Lahoud et.al. |
2506.17063v1 |
null |
2025-06-20 |
From Concepts to Components: Concept-Agnostic Attention Module Discovery in Transformers |
Jingtong Su et.al. |
2506.17052v1 |
null |
2025-06-20 |
Navigating the Deep: Signature Extraction on Deep Neural Networks |
Haolin Liu et.al. |
2506.17047v1 |
null |