2025-05-23 |
REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders |
Savya Khosla et.al. |
2505.18153v1 |
null |
2025-05-23 |
Embracing Contradiction: Theoretical Inconsistency Will Not Impede the Road of Building Responsible AI Systems |
Gordon Dai et.al. |
2505.18139v1 |
null |
2025-05-23 |
VideoGameBench: Can Vision-Language Models complete popular video games? |
Alex L. Zhang et.al. |
2505.18134v1 |
null |
2025-05-23 |
One RL to See Them All: Visual Triple Unified Reinforcement Learning |
Yan Ma et.al. |
2505.18129v1 |
null |
2025-05-23 |
Adapting SAM 2 for Visual Object Tracking: 1st Place Solution for MMVPR Challenge Multi-Modal Tracking |
Cheng-Yen Yang et.al. |
2505.18111v1 |
null |
2025-05-23 |
DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation |
Junhao Chen et.al. |
2505.18078v1 |
null |
2025-05-23 |
Asymptotically optimal regret in communicating Markov decision processes |
Victor Boone et.al. |
2505.18064v1 |
null |
2025-05-23 |
Learning with Restricted Boltzmann Machines: Asymptotics of AMP and GD in High Dimensions |
Yizhou Xu et.al. |
2505.18046v1 |
null |
2025-05-23 |
Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation |
Li Zhong et.al. |
2505.18039v1 |
null |
2025-05-23 |
Efficient Conditional Gradient Methods for Solving Stochastic Convex Bilevel Optimization Problems |
Khanh-Hung Giang-Tran et.al. |
2505.18037v1 |
null |
2025-05-23 |
Automata Learning of Preferences over Temporal Logic Formulas from Pairwise Comparisons |
Hazhar Rahmani et.al. |
2505.18030v1 |
null |
2025-05-23 |
A Wavelet-based Stereo Matching Framework for Solving Frequency Convergence Inconsistency |
Xiaobao Wei et.al. |
2505.18024v1 |
null |
2025-05-23 |
SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification |
Shashank Agnihotri et.al. |
2505.18015v1 |
null |
2025-05-23 |
DiFache: Efficient and Scalable Caching on Disaggregated Memory using Decentralized Coherence |
Hanze Zhang et.al. |
2505.18013v1 |
null |
2025-05-23 |
TRACE for Tracking the Emergence of Semantic Representations in Transformers |
Nura Aljaafari et.al. |
2505.17998v1 |
null |
2025-05-23 |
Segment Anyword: Mask Prompt Inversion for Open-Set Grounded Segmentation |
Zhihua Liu et.al. |
2505.17994v1 |
null |
2025-05-23 |
Canonical Pose Reconstruction from Single Depth Image for 3D Non-rigid Pose Recovery on Limited Datasets |
Fahd Alhamazani et.al. |
2505.17992v1 |
null |
2025-05-23 |
A Principled Bayesian Framework for Training Binary and Spiking Neural Networks |
James A. Walker et.al. |
2505.17962v1 |
null |
2025-05-23 |
Mind the Domain Gap: Measuring the Domain Gap Between Real-World and Synthetic Point Clouds for Automated Driving Development |
Nguyen Duc et.al. |
2505.17959v1 |
null |
2025-05-23 |
The impact of compact object deformation on thin accretion disk properties |
Shokoufe Faraji et.al. |
2505.17924v1 |
null |
2025-05-23 |
Object-level Cross-view Geo-localization with Location Enhancement and Multi-Head Cross Attention |
Zheyang Huang et.al. |
2505.17911v1 |
null |
2025-05-23 |
Tracking phase entanglement during propagation of downconverted photons |
Rounak Chatterjee et.al. |
2505.17906v1 |
null |
2025-05-23 |
Geometric Shape Modelling and Volume Estimation of Dry Bulk Cargo Piles using a Single Image |
Debanshu Ratha et.al. |
2505.17896v1 |
null |
2025-05-23 |
DataRater: Meta-Learned Dataset Curation |
Dan A. Calian et.al. |
2505.17895v1 |
null |
2025-05-23 |
A model-free approach to control barrier functions using funnel control |
Lukas Lanza et.al. |
2505.17887v1 |
null |
2025-05-23 |
Track Anything Annotate: Video annotation and dataset generation of computer vision models |
Nikita Ivanov et.al. |
2505.17884v1 |
null |
2025-05-23 |
FastCAV: Efficient Computation of Concept Activation Vectors for Explaining Deep Neural Networks |
Laines Schmalwasser et.al. |
2505.17883v1 |
null |
2025-05-23 |
Semi-Supervised Multi-Label Feature Selection with Consistent Sparse Graph Learning |
Yan Zhong et.al. |
2505.17875v1 |
null |
2025-05-23 |
BLAST: Balanced Sampling Time Series Corpus for Universal Forecasting Models |
Zezhi Shao et.al. |
2505.17871v1 |
null |
2025-05-23 |
Best Group Identification in Multi-Objective Bandits |
Mohammad Shahverdikondori et.al. |
2505.17869v1 |
null |