2025-05-23 |
REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders |
Savya Khosla et.al. |
2505.18153v1 |
null |
2025-05-23 |
WonderPlay: Dynamic 3D Scene Generation from a Single Image and Actions |
Zizhang Li et.al. |
2505.18151v1 |
null |
2025-05-23 |
Embracing Contradiction: Theoretical Inconsistency Will Not Impede the Road of Building Responsible AI Systems |
Gordon Dai et.al. |
2505.18139v1 |
null |
2025-05-23 |
Graph-Linguistic Fusion: Using Language Models for Wikidata Vandalism Detection |
Mykola Trokhymovych et.al. |
2505.18136v1 |
null |
2025-05-23 |
VideoGameBench: Can Vision-Language Models complete popular video games? |
Alex L. Zhang et.al. |
2505.18134v1 |
null |
2025-05-23 |
One RL to See Them All: Visual Triple Unified Reinforcement Learning |
Yan Ma et.al. |
2505.18129v1 |
null |
2025-05-23 |
Frankentext: Stitching random text fragments into long-form narratives |
Chau Minh Pham et.al. |
2505.18128v1 |
null |
2025-05-23 |
Multiparty entanglement loops in quantum spin liquids |
Liuke Lyu et.al. |
2505.18124v1 |
null |
2025-05-23 |
Adapting SAM 2 for Visual Object Tracking: 1st Place Solution for MMVPR Challenge Multi-Modal Tracking |
Cheng-Yen Yang et.al. |
2505.18111v1 |
null |
2025-05-23 |
Zeta functions of K3 categories over finite fields |
Asher Auel et.al. |
2505.18104v1 |
null |
2025-05-23 |
How Can I Publish My LLM Benchmark Without Giving the True Answers Away? |
Takashi Ishida et.al. |
2505.18102v1 |
null |
2025-05-23 |
DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations |
Ziqiao Peng et.al. |
2505.18096v1 |
null |
2025-05-23 |
Rotational Multi-material 3D Printing of Soft Robotic Matter with Asymmetrical Embedded Pneumatics |
Jackson K. Wilt et.al. |
2505.18095v1 |
null |
2025-05-23 |
Beyond flat-panel displays, applications of stereographic and holographic devices in 3D microscopy data analysis |
Yong Wan et.al. |
2505.18075v1 |
null |
2025-05-23 |
Assessing the performance of 8 AI chatbots in bibliographic reference retrieval: Grok and DeepSeek outperform ChatGPT, but none are fully accurate |
Álvaro Cabezas-Clavijo et.al. |
2505.18059v1 |
null |
2025-05-23 |
A Foundation Model Framework for Multi-View MRI Classification of Extramural Vascular Invasion and Mesorectal Fascia Invasion in Rectal Cancer |
Yumeng Zhang et.al. |
2505.18058v1 |
null |
2025-05-23 |
SHARDeg: A Benchmark for Skeletal Human Action Recognition in Degraded Scenarios |
Simon Malzard et.al. |
2505.18048v1 |
null |
2025-05-23 |
Learning with Restricted Boltzmann Machines: Asymptotics of AMP and GD in High Dimensions |
Yizhou Xu et.al. |
2505.18046v1 |
null |
2025-05-23 |
Modelling multiwavelength afterglows of the VHE-GRB population |
Monica Barnard et.al. |
2505.18041v1 |
null |
2025-05-23 |
Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation |
Li Zhong et.al. |
2505.18039v1 |
null |
2025-05-23 |
Efficient Conditional Gradient Methods for Solving Stochastic Convex Bilevel Optimization Problems |
Khanh-Hung Giang-Tran et.al. |
2505.18037v1 |
null |
2025-05-23 |
CAMME: Adaptive Deepfake Image Detection with Multi-Modal Cross-Attention |
Naseem Khan et.al. |
2505.18035v1 |
null |
2025-05-23 |
Mahalanobis++: Improving OOD Detection via Feature Normalization |
Maximilian Mueller et.al. |
2505.18032v1 |
null |
2025-05-23 |
Automata Learning of Preferences over Temporal Logic Formulas from Pairwise Comparisons |
Hazhar Rahmani et.al. |
2505.18030v1 |
null |
2025-05-23 |
3D Face Reconstruction Error Decomposed: A Modular Benchmark for Fair and Fast Method Evaluation |
Evangelos Sariyanidi et.al. |
2505.18025v1 |
null |
2025-05-23 |
A Wavelet-based Stereo Matching Framework for Solving Frequency Convergence Inconsistency |
Xiaobao Wei et.al. |
2505.18024v1 |
null |
2025-05-23 |
RemoteSAM: Towards Segment Anything for Earth Observation |
Liang Yao et.al. |
2505.18022v1 |
null |
2025-05-23 |
Building Floor Number Estimation from Crowdsourced Street-Level Images: Munich Dataset and Baseline Method |
Yao Sun et.al. |
2505.18021v1 |
null |
2025-05-23 |
SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification |
Shashank Agnihotri et.al. |
2505.18015v1 |
null |
2025-05-23 |
DiFache: Efficient and Scalable Caching on Disaggregated Memory using Decentralized Coherence |
Hanze Zhang et.al. |
2505.18013v1 |
null |