Skip to content

3D Object Tracking

3D Object Tracking

Publish Date Title Authors PDF Code
2025-05-23 REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders Savya Khosla et.al. 2505.18153v1 null
2025-05-23 WonderPlay: Dynamic 3D Scene Generation from a Single Image and Actions Zizhang Li et.al. 2505.18151v1 null
2025-05-23 Embracing Contradiction: Theoretical Inconsistency Will Not Impede the Road of Building Responsible AI Systems Gordon Dai et.al. 2505.18139v1 null
2025-05-23 VideoGameBench: Can Vision-Language Models complete popular video games? Alex L. Zhang et.al. 2505.18134v1 null
2025-05-23 One RL to See Them All: Visual Triple Unified Reinforcement Learning Yan Ma et.al. 2505.18129v1 null
2025-05-23 Adapting SAM 2 for Visual Object Tracking: 1st Place Solution for MMVPR Challenge Multi-Modal Tracking Cheng-Yen Yang et.al. 2505.18111v1 null
2025-05-23 DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations Ziqiao Peng et.al. 2505.18096v1 null
2025-05-23 Rotational Multi-material 3D Printing of Soft Robotic Matter with Asymmetrical Embedded Pneumatics Jackson K. Wilt et.al. 2505.18095v1 null
2025-05-23 DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation Junhao Chen et.al. 2505.18078v1 null
2025-05-23 Beyond flat-panel displays, applications of stereographic and holographic devices in 3D microscopy data analysis Yong Wan et.al. 2505.18075v1 null
2025-05-23 Asymptotically optimal regret in communicating Markov decision processes Victor Boone et.al. 2505.18064v1 null
2025-05-23 SHARDeg: A Benchmark for Skeletal Human Action Recognition in Degraded Scenarios Simon Malzard et.al. 2505.18048v1 null
2025-05-23 Learning with Restricted Boltzmann Machines: Asymptotics of AMP and GD in High Dimensions Yizhou Xu et.al. 2505.18046v1 null
2025-05-23 Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation Li Zhong et.al. 2505.18039v1 null
2025-05-23 Efficient Conditional Gradient Methods for Solving Stochastic Convex Bilevel Optimization Problems Khanh-Hung Giang-Tran et.al. 2505.18037v1 null
2025-05-23 Automata Learning of Preferences over Temporal Logic Formulas from Pairwise Comparisons Hazhar Rahmani et.al. 2505.18030v1 null
2025-05-23 3D Face Reconstruction Error Decomposed: A Modular Benchmark for Fair and Fast Method Evaluation Evangelos Sariyanidi et.al. 2505.18025v1 null
2025-05-23 A Wavelet-based Stereo Matching Framework for Solving Frequency Convergence Inconsistency Xiaobao Wei et.al. 2505.18024v1 null
2025-05-23 Building Floor Number Estimation from Crowdsourced Street-Level Images: Munich Dataset and Baseline Method Yao Sun et.al. 2505.18021v1 null
2025-05-23 SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification Shashank Agnihotri et.al. 2505.18015v1 null
2025-05-23 DiFache: Efficient and Scalable Caching on Disaggregated Memory using Decentralized Coherence Hanze Zhang et.al. 2505.18013v1 null
2025-05-23 TRACE for Tracking the Emergence of Semantic Representations in Transformers Nura Aljaafari et.al. 2505.17998v1 null
2025-05-23 A 1.8 m class pathfinder Raman LIDAR for the Northern Site of the Cherenkov Telescope Array Observatory -- Performance Pedro Jose Bauza-Ruiz et.al. 2505.17996v1 null
2025-05-23 Segment Anyword: Mask Prompt Inversion for Open-Set Grounded Segmentation Zhihua Liu et.al. 2505.17994v1 null
2025-05-23 Canonical Pose Reconstruction from Single Depth Image for 3D Non-rigid Pose Recovery on Limited Datasets Fahd Alhamazani et.al. 2505.17992v1 null
2025-05-23 To Glue or Not to Glue? Classical vs Learned Image Matching for Mobile Mapping Cameras to Textured Semantic 3D Building Models Simone Gaisbauer et.al. 2505.17973v1 null
2025-05-23 Explainable Anatomy-Guided AI for Prostate MRI: Foundation Models and In Silico Clinical Trials for Virtual Biopsy-based Risk Assessment Danial Khan et.al. 2505.17971v1 null
2025-05-23 Is Single-View Mesh Reconstruction Ready for Robotics? Frederik Nolte et.al. 2505.17966v1 null
2025-05-23 A Principled Bayesian Framework for Training Binary and Spiking Neural Networks James A. Walker et.al. 2505.17962v1 null
2025-05-23 Mind the Domain Gap: Measuring the Domain Gap Between Real-World and Synthetic Point Clouds for Automated Driving Development Nguyen Duc et.al. 2505.17959v1 null