arxiv-daily
Automated deployment @ 2025-05-26 11:48:23 Asia/Shanghai
Welcome to contribute! Add your topics and keywords in
topic.yml
. You can also view historical data through the storage.
3D Vision
3D Object Detection
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders | Savya Khosla et.al. | 2505.18153v1 | null |
2025-05-23 | WonderPlay: Dynamic 3D Scene Generation from a Single Image and Actions | Zizhang Li et.al. | 2505.18151v1 | null |
2025-05-23 | Embracing Contradiction: Theoretical Inconsistency Will Not Impede the Road of Building Responsible AI Systems | Gordon Dai et.al. | 2505.18139v1 | null |
2025-05-23 | Graph-Linguistic Fusion: Using Language Models for Wikidata Vandalism Detection | Mykola Trokhymovych et.al. | 2505.18136v1 | null |
2025-05-23 | VideoGameBench: Can Vision-Language Models complete popular video games? | Alex L. Zhang et.al. | 2505.18134v1 | null |
2025-05-23 | One RL to See Them All: Visual Triple Unified Reinforcement Learning | Yan Ma et.al. | 2505.18129v1 | null |
2025-05-23 | Frankentext: Stitching random text fragments into long-form narratives | Chau Minh Pham et.al. | 2505.18128v1 | null |
2025-05-23 | Multiparty entanglement loops in quantum spin liquids | Liuke Lyu et.al. | 2505.18124v1 | null |
2025-05-23 | Adapting SAM 2 for Visual Object Tracking: 1st Place Solution for MMVPR Challenge Multi-Modal Tracking | Cheng-Yen Yang et.al. | 2505.18111v1 | null |
2025-05-23 | Zeta functions of K3 categories over finite fields | Asher Auel et.al. | 2505.18104v1 | null |
2025-05-23 | How Can I Publish My LLM Benchmark Without Giving the True Answers Away? | Takashi Ishida et.al. | 2505.18102v1 | null |
2025-05-23 | DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations | Ziqiao Peng et.al. | 2505.18096v1 | null |
2025-05-23 | Rotational Multi-material 3D Printing of Soft Robotic Matter with Asymmetrical Embedded Pneumatics | Jackson K. Wilt et.al. | 2505.18095v1 | null |
2025-05-23 | Beyond flat-panel displays, applications of stereographic and holographic devices in 3D microscopy data analysis | Yong Wan et.al. | 2505.18075v1 | null |
2025-05-23 | Assessing the performance of 8 AI chatbots in bibliographic reference retrieval: Grok and DeepSeek outperform ChatGPT, but none are fully accurate | Álvaro Cabezas-Clavijo et.al. | 2505.18059v1 | null |
2025-05-23 | A Foundation Model Framework for Multi-View MRI Classification of Extramural Vascular Invasion and Mesorectal Fascia Invasion in Rectal Cancer | Yumeng Zhang et.al. | 2505.18058v1 | null |
2025-05-23 | SHARDeg: A Benchmark for Skeletal Human Action Recognition in Degraded Scenarios | Simon Malzard et.al. | 2505.18048v1 | null |
2025-05-23 | Learning with Restricted Boltzmann Machines: Asymptotics of AMP and GD in High Dimensions | Yizhou Xu et.al. | 2505.18046v1 | null |
2025-05-23 | Modelling multiwavelength afterglows of the VHE-GRB population | Monica Barnard et.al. | 2505.18041v1 | null |
2025-05-23 | Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation | Li Zhong et.al. | 2505.18039v1 | null |
2025-05-23 | Efficient Conditional Gradient Methods for Solving Stochastic Convex Bilevel Optimization Problems | Khanh-Hung Giang-Tran et.al. | 2505.18037v1 | null |
2025-05-23 | CAMME: Adaptive Deepfake Image Detection with Multi-Modal Cross-Attention | Naseem Khan et.al. | 2505.18035v1 | null |
2025-05-23 | Mahalanobis++: Improving OOD Detection via Feature Normalization | Maximilian Mueller et.al. | 2505.18032v1 | null |
2025-05-23 | Automata Learning of Preferences over Temporal Logic Formulas from Pairwise Comparisons | Hazhar Rahmani et.al. | 2505.18030v1 | null |
2025-05-23 | 3D Face Reconstruction Error Decomposed: A Modular Benchmark for Fair and Fast Method Evaluation | Evangelos Sariyanidi et.al. | 2505.18025v1 | null |
2025-05-23 | A Wavelet-based Stereo Matching Framework for Solving Frequency Convergence Inconsistency | Xiaobao Wei et.al. | 2505.18024v1 | null |
2025-05-23 | RemoteSAM: Towards Segment Anything for Earth Observation | Liang Yao et.al. | 2505.18022v1 | null |
2025-05-23 | Building Floor Number Estimation from Crowdsourced Street-Level Images: Munich Dataset and Baseline Method | Yao Sun et.al. | 2505.18021v1 | null |
2025-05-23 | SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification | Shashank Agnihotri et.al. | 2505.18015v1 | null |
2025-05-23 | DiFache: Efficient and Scalable Caching on Disaggregated Memory using Decentralized Coherence | Hanze Zhang et.al. | 2505.18013v1 | null |
Point Cloud Completion
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders | Savya Khosla et.al. | 2505.18153v1 | null |
2025-05-23 | Generative Distribution Embeddings | Nic Fishman et.al. | 2505.18150v1 | null |
2025-05-23 | First Finish Search: Efficient Test-Time Scaling in Large Language Models | Aradhye Agarwal et.al. | 2505.18149v1 | null |
2025-05-23 | TokBench: Evaluating Your Visual Tokenizer before Visual Generation | Junfeng Wu et.al. | 2505.18142v1 | null |
2025-05-23 | Embracing Contradiction: Theoretical Inconsistency Will Not Impede the Road of Building Responsible AI Systems | Gordon Dai et.al. | 2505.18139v1 | null |
2025-05-23 | VideoGameBench: Can Vision-Language Models complete popular video games? | Alex L. Zhang et.al. | 2505.18134v1 | null |
2025-05-23 | Comment on "Geometry of the Grosse-Wulkenhaar model" | Dragan Prekrat et.al. | 2505.18123v1 | null |
2025-05-23 | ProgRM: Build Better GUI Agents with Progress Rewards | Danyang Zhang et.al. | 2505.18121v1 | null |
2025-05-23 | Zeta functions of K3 categories over finite fields | Asher Auel et.al. | 2505.18104v1 | null |
2025-05-23 | How Can I Publish My LLM Benchmark Without Giving the True Answers Away? | Takashi Ishida et.al. | 2505.18102v1 | null |
2025-05-23 | Ballistic macroscopic fluctuation theory via mapping to point particles | Jitendra Kethepalli et.al. | 2505.18093v1 | null |
2025-05-23 | QwenLong-CPRS: Towards $\infty$-LLMs with Dynamic Context Optimization | Weizhou Shen et.al. | 2505.18092v1 | null |
2025-05-23 | Explaining the extra crystal field mode in ACeX2 | Allen O. Scheie et.al. | 2505.18089v1 | null |
2025-05-23 | Stable Reinforcement Learning for Efficient Reasoning | Muzhi Dai et.al. | 2505.18086v1 | null |
2025-05-23 | First astrometric constraints on parity-violation in the gravitational wave background | Santiago Jaraba et.al. | 2505.18085v1 | null |
2025-05-23 | The Noether formalism for constructing conserved quantities in teleparallel equivalents of general relativity | E. D. Emtsova et.al. | 2505.18084v1 | null |
2025-05-23 | Bayesian Deep Learning for Discrete Choice | Daniel F. Villarraga et.al. | 2505.18077v1 | null |
2025-05-23 | Modelling multiwavelength afterglows of the VHE-GRB population | Monica Barnard et.al. | 2505.18041v1 | null |
2025-05-23 | Rethinking Climate Econometrics: Data Cleaning, Flexible Trend Controls, and Predictive Validation | Christof Schötz et.al. | 2505.18033v1 | null |
2025-05-23 | Automata Learning of Preferences over Temporal Logic Formulas from Pairwise Comparisons | Hazhar Rahmani et.al. | 2505.18030v1 | null |
2025-05-23 | 3D Face Reconstruction Error Decomposed: A Modular Benchmark for Fair and Fast Method Evaluation | Evangelos Sariyanidi et.al. | 2505.18025v1 | null |
2025-05-23 | LLM assisted web application functional requirements generation: A case study of four popular LLMs over a Mess Management System | Rashmi Gupta et.al. | 2505.18019v1 | null |
2025-05-23 | ExoGait-MS: Learning Periodic Dynamics with Multi-Scale Graph Network for Exoskeleton Gait Recognition | Lijiang Liu et.al. | 2505.18018v1 | null |
2025-05-23 | Pressure tuning of competing interactions on a honeycomb lattice | Piyush Sakrikar et.al. | 2505.18016v1 | null |
2025-05-23 | SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification | Shashank Agnihotri et.al. | 2505.18015v1 | null |
2025-05-23 | On the geometric $k$-colored crossing number of $K_n$ | Benedikt Hahn et.al. | 2505.18014v1 | null |
2025-05-23 | Empathic network learning for multi-expert emergency decision-making under incomplete and inconsistent information | Simin Shen et.al. | 2505.18009v1 | null |
2025-05-23 | A 1.8 m class pathfinder Raman LIDAR for the Northern Site of the Cherenkov Telescope Array Observatory -- Performance | Pedro Jose Bauza-Ruiz et.al. | 2505.17996v1 | null |
2025-05-23 | Dark Matter EFT Landscape Probed by QUEST-DMC | QUEST-DMC Collaboration et.al. | 2505.17995v1 | null |
2025-05-23 | Finding d-Cuts in Claw-free Graphs | Jungho Ahn et.al. | 2505.17993v1 | null |
Point Cloud
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders | Savya Khosla et.al. | 2505.18153v1 | null |
2025-05-23 | Generative Distribution Embeddings | Nic Fishman et.al. | 2505.18150v1 | null |
2025-05-23 | Comment on "Geometry of the Grosse-Wulkenhaar model" | Dragan Prekrat et.al. | 2505.18123v1 | null |
2025-05-23 | Zeta functions of K3 categories over finite fields | Asher Auel et.al. | 2505.18104v1 | null |
2025-05-23 | Ballistic macroscopic fluctuation theory via mapping to point particles | Jitendra Kethepalli et.al. | 2505.18093v1 | null |
2025-05-23 | QwenLong-CPRS: Towards $\infty$-LLMs with Dynamic Context Optimization | Weizhou Shen et.al. | 2505.18092v1 | null |
2025-05-23 | Explaining the extra crystal field mode in ACeX2 | Allen O. Scheie et.al. | 2505.18089v1 | null |
2025-05-23 | First astrometric constraints on parity-violation in the gravitational wave background | Santiago Jaraba et.al. | 2505.18085v1 | null |
2025-05-23 | Bayesian Deep Learning for Discrete Choice | Daniel F. Villarraga et.al. | 2505.18077v1 | null |
2025-05-23 | Rethinking Climate Econometrics: Data Cleaning, Flexible Trend Controls, and Predictive Validation | Christof Schötz et.al. | 2505.18033v1 | null |
2025-05-23 | 3D Face Reconstruction Error Decomposed: A Modular Benchmark for Fair and Fast Method Evaluation | Evangelos Sariyanidi et.al. | 2505.18025v1 | null |
2025-05-23 | Pressure tuning of competing interactions on a honeycomb lattice | Piyush Sakrikar et.al. | 2505.18016v1 | null |
2025-05-23 | A 1.8 m class pathfinder Raman LIDAR for the Northern Site of the Cherenkov Telescope Array Observatory -- Performance | Pedro Jose Bauza-Ruiz et.al. | 2505.17996v1 | null |
2025-05-23 | Outcome-based Reinforcement Learning to Predict the Future | Benjamin Turtel et.al. | 2505.17989v1 | null |
2025-05-23 | Inflaton Dynamics in Higher-Derivative Scalar-Tensor Theories of Gravity | Sam E. Brady et.al. | 2505.17986v1 | null |
2025-05-23 | Mind the Domain Gap: Measuring the Domain Gap Between Real-World and Synthetic Point Clouds for Automated Driving Development | Nguyen Duc et.al. | 2505.17959v1 | null |
2025-05-23 | The Upward-Driven Disk, a Steadily Forced Chaotic Pendulum | Leo Maas et.al. | 2505.17957v1 | null |
2025-05-23 | Counting quadratic points on Fano varieties | Francesca Balestrieri et.al. | 2505.17940v1 | null |
2025-05-23 | Isospectrality and non-locality of generalized Dirac combs | Giuliano Angelone et.al. | 2505.17920v1 | null |
2025-05-23 | Tunability of the magnetic properties in Ni intercalated transition metal dichalcogenide NbSe$_2$ | Xujia Gong et.al. | 2505.17916v1 | null |
2025-05-23 | Promptable cancer segmentation using minimal expert-curated data | Lynn Karam et.al. | 2505.17915v1 | null |
2025-05-23 | Legendrian doubles, twist spuns, and clusters | James Hughes et.al. | 2505.17901v1 | null |
2025-05-23 | DataRater: Meta-Learned Dataset Curation | Dan A. Calian et.al. | 2505.17895v1 | null |
2025-05-23 | Anisotropic spin-polarized conductivity in collinear altermagnets | Mingbo Dou et.al. | 2505.17888v1 | null |
2025-05-23 | LLM4SP: Large Language Models for Scatterer Prediction via Synesthesia of Machines | Zengrui Han et.al. | 2505.17879v1 | null |
2025-05-23 | Robust Distributed Estimation: Extending Gossip Algorithms to Ranking and Trimmed Means | Anna Van Elst et.al. | 2505.17836v1 | null |
2025-05-23 | Quantifying uncertainty in spectral clusterings: expectations for perturbed and incomplete data | Jürgen Dölz et.al. | 2505.17819v1 | null |
2025-05-23 | Light-Driven Bound State of Interacting Impurities in a Dirac-Like Bath | Vinayak M. Kulkarni et.al. | 2505.17811v1 | null |
2025-05-23 | Hyperparameter Optimization via Interacting with Probabilistic Circuits | Jonas Seng et.al. | 2505.17804v1 | null |
2025-05-23 | Anytime-valid simultaneous lower confidence bounds for the true discovery proportion | Friederike Preusse et.al. | 2505.17803v1 | null |
Visual Localization
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders | Savya Khosla et.al. | 2505.18153v1 | null |
2025-05-23 | UNJOIN: Enhancing Multi-Table Text-to-SQL Generation via Schema Simplification | Poojah Ganesan et.al. | 2505.18122v1 | null |
2025-05-23 | Bidirectional Knowledge Distillation for Enhancing Sequential Recommendation with Large Language Models | Jiongran Wu et.al. | 2505.18120v1 | null |
2025-05-23 | Facility Location with Public Locations and Private Doubly-Peaked Costs | Richard Cole et.al. | 2505.18114v1 | null |
2025-05-23 | Adapting SAM 2 for Visual Object Tracking: 1st Place Solution for MMVPR Challenge Multi-Modal Tracking | Cheng-Yen Yang et.al. | 2505.18111v1 | null |
2025-05-23 | ManuSearch: Democratizing Deep Search in Large Language Models with a Transparent and Open Multi-Agent Framework | Lisheng Huang et.al. | 2505.18105v1 | null |
2025-05-23 | Assessing the performance of 8 AI chatbots in bibliographic reference retrieval: Grok and DeepSeek outperform ChatGPT, but none are fully accurate | Álvaro Cabezas-Clavijo et.al. | 2505.18059v1 | null |
2025-05-23 | SpikeGen: Generative Framework for Visual Spike Stream Processing | Gaole Dai et.al. | 2505.18049v1 | null |
2025-05-23 | Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation | Li Zhong et.al. | 2505.18039v1 | null |
2025-05-23 | AI Literacy for Legal AI Systems: A practical approach | Gizem Gultekin-Varkonyi et.al. | 2505.18006v1 | null |
2025-05-23 | Revisiting Feature Interactions from the Perspective of Quadratic Neural Networks for Click-through Rate Prediction | Honghao Li et.al. | 2505.17999v1 | null |
2025-05-23 | A 1.8 m class pathfinder Raman LIDAR for the Northern Site of the Cherenkov Telescope Array Observatory -- Performance | Pedro Jose Bauza-Ruiz et.al. | 2505.17996v1 | null |
2025-05-23 | AVerImaTeC: A Dataset for Automatic Verification of Image-Text Claims with Evidence from the Web | Rui Cao et.al. | 2505.17978v1 | null |
2025-05-23 | To Glue or Not to Glue? Classical vs Learned Image Matching for Mobile Mapping Cameras to Textured Semantic 3D Building Models | Simone Gaisbauer et.al. | 2505.17973v1 | null |
2025-05-23 | The Effects of Climate and Weather on Economic Output: Evidence from Global Subnational Data | Jinchi Dong et.al. | 2505.17946v1 | null |
2025-05-23 | Enhancing CTR Prediction with De-correlated Expert Networks | Jiancheng Wang et.al. | 2505.17925v1 | null |
2025-05-23 | Object-level Cross-view Geo-localization with Location Enhancement and Multi-Head Cross Attention | Zheyang Huang et.al. | 2505.17911v1 | null |
2025-05-23 | Superplatforms Have to Attack AI Agents | Jianghao Lin et.al. | 2505.17861v1 | null |
2025-05-23 | Out of the Shadows: Exploring a Latent Space for Neural Network Verification | Lukas Koller et.al. | 2505.17854v1 | null |
2025-05-23 | The NEXT-100 Detector | NEXT Collaboration et.al. | 2505.17848v1 | null |
2025-05-23 | The bridge function as a functional of the radial distribution function: Operator learning and application | Martin Panholzer et.al. | 2505.17840v1 | null |
2025-05-23 | Low-Resource NMT: A Case Study on the Written and Spoken Languages in Hong Kong | Hei Yi Mak et.al. | 2505.17816v1 | null |
2025-05-23 | VIBE: Vector Index Benchmark for Embeddings | Elias Jääsaari et.al. | 2505.17810v1 | null |
2025-05-23 | DetailFusion: A Dual-branch Framework with Detail Enhancement for Composed Image Retrieval | Yuxin Yang et.al. | 2505.17796v1 | null |
2025-05-23 | RECIPE-TKG: From Sparse History to Structured Reasoning for LLM-based Temporal Knowledge Graph Completion | Ömer Faruk Akgül et.al. | 2505.17794v1 | null |
2025-05-23 | Resolving Conflicting Evidence in Automated Fact-Checking: A Study on Retrieval-Augmented LLMs | Ziyu Ge et.al. | 2505.17762v1 | null |
2025-05-23 | Modeling Ranking Properties with In-Context Learning | Nilanjan Sinhababu et.al. | 2505.17736v1 | null |
2025-05-23 | RQR3D: Reparametrizing the regression targets for BEV-based 3D object detection | Ozsel Kilinc et.al. | 2505.17732v1 | null |
2025-05-23 | Towards Dynamic 3D Reconstruction of Hand-Instrument Interaction in Ophthalmic Surgery | Ming Hu et.al. | 2505.17677v1 | null |
2025-05-23 | Stereotype Detection in Natural Language Processing | Alessandra Teresa Cignarella et.al. | 2505.17642v1 | null |
3D Object Tracking
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders | Savya Khosla et.al. | 2505.18153v1 | null |
2025-05-23 | WonderPlay: Dynamic 3D Scene Generation from a Single Image and Actions | Zizhang Li et.al. | 2505.18151v1 | null |
2025-05-23 | Embracing Contradiction: Theoretical Inconsistency Will Not Impede the Road of Building Responsible AI Systems | Gordon Dai et.al. | 2505.18139v1 | null |
2025-05-23 | VideoGameBench: Can Vision-Language Models complete popular video games? | Alex L. Zhang et.al. | 2505.18134v1 | null |
2025-05-23 | One RL to See Them All: Visual Triple Unified Reinforcement Learning | Yan Ma et.al. | 2505.18129v1 | null |
2025-05-23 | Adapting SAM 2 for Visual Object Tracking: 1st Place Solution for MMVPR Challenge Multi-Modal Tracking | Cheng-Yen Yang et.al. | 2505.18111v1 | null |
2025-05-23 | DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations | Ziqiao Peng et.al. | 2505.18096v1 | null |
2025-05-23 | Rotational Multi-material 3D Printing of Soft Robotic Matter with Asymmetrical Embedded Pneumatics | Jackson K. Wilt et.al. | 2505.18095v1 | null |
2025-05-23 | DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation | Junhao Chen et.al. | 2505.18078v1 | null |
2025-05-23 | Beyond flat-panel displays, applications of stereographic and holographic devices in 3D microscopy data analysis | Yong Wan et.al. | 2505.18075v1 | null |
2025-05-23 | Asymptotically optimal regret in communicating Markov decision processes | Victor Boone et.al. | 2505.18064v1 | null |
2025-05-23 | SHARDeg: A Benchmark for Skeletal Human Action Recognition in Degraded Scenarios | Simon Malzard et.al. | 2505.18048v1 | null |
2025-05-23 | Learning with Restricted Boltzmann Machines: Asymptotics of AMP and GD in High Dimensions | Yizhou Xu et.al. | 2505.18046v1 | null |
2025-05-23 | Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation | Li Zhong et.al. | 2505.18039v1 | null |
2025-05-23 | Efficient Conditional Gradient Methods for Solving Stochastic Convex Bilevel Optimization Problems | Khanh-Hung Giang-Tran et.al. | 2505.18037v1 | null |
2025-05-23 | Automata Learning of Preferences over Temporal Logic Formulas from Pairwise Comparisons | Hazhar Rahmani et.al. | 2505.18030v1 | null |
2025-05-23 | 3D Face Reconstruction Error Decomposed: A Modular Benchmark for Fair and Fast Method Evaluation | Evangelos Sariyanidi et.al. | 2505.18025v1 | null |
2025-05-23 | A Wavelet-based Stereo Matching Framework for Solving Frequency Convergence Inconsistency | Xiaobao Wei et.al. | 2505.18024v1 | null |
2025-05-23 | Building Floor Number Estimation from Crowdsourced Street-Level Images: Munich Dataset and Baseline Method | Yao Sun et.al. | 2505.18021v1 | null |
2025-05-23 | SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification | Shashank Agnihotri et.al. | 2505.18015v1 | null |
2025-05-23 | DiFache: Efficient and Scalable Caching on Disaggregated Memory using Decentralized Coherence | Hanze Zhang et.al. | 2505.18013v1 | null |
2025-05-23 | TRACE for Tracking the Emergence of Semantic Representations in Transformers | Nura Aljaafari et.al. | 2505.17998v1 | null |
2025-05-23 | A 1.8 m class pathfinder Raman LIDAR for the Northern Site of the Cherenkov Telescope Array Observatory -- Performance | Pedro Jose Bauza-Ruiz et.al. | 2505.17996v1 | null |
2025-05-23 | Segment Anyword: Mask Prompt Inversion for Open-Set Grounded Segmentation | Zhihua Liu et.al. | 2505.17994v1 | null |
2025-05-23 | Canonical Pose Reconstruction from Single Depth Image for 3D Non-rigid Pose Recovery on Limited Datasets | Fahd Alhamazani et.al. | 2505.17992v1 | null |
2025-05-23 | To Glue or Not to Glue? Classical vs Learned Image Matching for Mobile Mapping Cameras to Textured Semantic 3D Building Models | Simone Gaisbauer et.al. | 2505.17973v1 | null |
2025-05-23 | Explainable Anatomy-Guided AI for Prostate MRI: Foundation Models and In Silico Clinical Trials for Virtual Biopsy-based Risk Assessment | Danial Khan et.al. | 2505.17971v1 | null |
2025-05-23 | Is Single-View Mesh Reconstruction Ready for Robotics? | Frederik Nolte et.al. | 2505.17966v1 | null |
2025-05-23 | A Principled Bayesian Framework for Training Binary and Spiking Neural Networks | James A. Walker et.al. | 2505.17962v1 | null |
2025-05-23 | Mind the Domain Gap: Measuring the Domain Gap Between Real-World and Synthetic Point Clouds for Automated Driving Development | Nguyen Duc et.al. | 2505.17959v1 | null |
Point Cloud Registration
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders | Savya Khosla et.al. | 2505.18153v1 | null |
2025-05-23 | Generative Distribution Embeddings | Nic Fishman et.al. | 2505.18150v1 | null |
2025-05-23 | Comment on "Geometry of the Grosse-Wulkenhaar model" | Dragan Prekrat et.al. | 2505.18123v1 | null |
2025-05-23 | Zeta functions of K3 categories over finite fields | Asher Auel et.al. | 2505.18104v1 | null |
2025-05-23 | Ballistic macroscopic fluctuation theory via mapping to point particles | Jitendra Kethepalli et.al. | 2505.18093v1 | null |
2025-05-23 | QwenLong-CPRS: Towards $\infty$-LLMs with Dynamic Context Optimization | Weizhou Shen et.al. | 2505.18092v1 | null |
2025-05-23 | Explaining the extra crystal field mode in ACeX2 | Allen O. Scheie et.al. | 2505.18089v1 | null |
2025-05-23 | First astrometric constraints on parity-violation in the gravitational wave background | Santiago Jaraba et.al. | 2505.18085v1 | null |
2025-05-23 | Bayesian Deep Learning for Discrete Choice | Daniel F. Villarraga et.al. | 2505.18077v1 | null |
2025-05-23 | Rethinking Climate Econometrics: Data Cleaning, Flexible Trend Controls, and Predictive Validation | Christof Schötz et.al. | 2505.18033v1 | null |
2025-05-23 | 3D Face Reconstruction Error Decomposed: A Modular Benchmark for Fair and Fast Method Evaluation | Evangelos Sariyanidi et.al. | 2505.18025v1 | null |
2025-05-23 | Pressure tuning of competing interactions on a honeycomb lattice | Piyush Sakrikar et.al. | 2505.18016v1 | null |
2025-05-23 | A 1.8 m class pathfinder Raman LIDAR for the Northern Site of the Cherenkov Telescope Array Observatory -- Performance | Pedro Jose Bauza-Ruiz et.al. | 2505.17996v1 | null |
2025-05-23 | Outcome-based Reinforcement Learning to Predict the Future | Benjamin Turtel et.al. | 2505.17989v1 | null |
2025-05-23 | Inflaton Dynamics in Higher-Derivative Scalar-Tensor Theories of Gravity | Sam E. Brady et.al. | 2505.17986v1 | null |
2025-05-23 | To Glue or Not to Glue? Classical vs Learned Image Matching for Mobile Mapping Cameras to Textured Semantic 3D Building Models | Simone Gaisbauer et.al. | 2505.17973v1 | null |
2025-05-23 | Mind the Domain Gap: Measuring the Domain Gap Between Real-World and Synthetic Point Clouds for Automated Driving Development | Nguyen Duc et.al. | 2505.17959v1 | null |
2025-05-23 | The Upward-Driven Disk, a Steadily Forced Chaotic Pendulum | Leo Maas et.al. | 2505.17957v1 | null |
2025-05-23 | Counting quadratic points on Fano varieties | Francesca Balestrieri et.al. | 2505.17940v1 | null |
2025-05-23 | Isospectrality and non-locality of generalized Dirac combs | Giuliano Angelone et.al. | 2505.17920v1 | null |
2025-05-23 | Tunability of the magnetic properties in Ni intercalated transition metal dichalcogenide NbSe$_2$ | Xujia Gong et.al. | 2505.17916v1 | null |
2025-05-23 | Promptable cancer segmentation using minimal expert-curated data | Lynn Karam et.al. | 2505.17915v1 | null |
2025-05-23 | Legendrian doubles, twist spuns, and clusters | James Hughes et.al. | 2505.17901v1 | null |
2025-05-23 | DataRater: Meta-Learned Dataset Curation | Dan A. Calian et.al. | 2505.17895v1 | null |
2025-05-23 | Anisotropic spin-polarized conductivity in collinear altermagnets | Mingbo Dou et.al. | 2505.17888v1 | null |
2025-05-23 | LLM4SP: Large Language Models for Scatterer Prediction via Synesthesia of Machines | Zengrui Han et.al. | 2505.17879v1 | null |
2025-05-23 | Robust Distributed Estimation: Extending Gossip Algorithms to Ranking and Trimmed Means | Anna Van Elst et.al. | 2505.17836v1 | null |
2025-05-23 | Quantifying uncertainty in spectral clusterings: expectations for perturbed and incomplete data | Jürgen Dölz et.al. | 2505.17819v1 | null |
2025-05-23 | Light-Driven Bound State of Interacting Impurities in a Dirac-Like Bath | Vinayak M. Kulkarni et.al. | 2505.17811v1 | null |
2025-05-23 | Hyperparameter Optimization via Interacting with Probabilistic Circuits | Jonas Seng et.al. | 2505.17804v1 | null |
Point Cloud Matching
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders | Savya Khosla et.al. | 2505.18153v1 | null |
2025-05-23 | Generative Distribution Embeddings | Nic Fishman et.al. | 2505.18150v1 | null |
2025-05-23 | First Finish Search: Efficient Test-Time Scaling in Large Language Models | Aradhye Agarwal et.al. | 2505.18149v1 | null |
2025-05-23 | INN-FF: A Scalable and Efficient Machine Learning Potential for Molecular Dynamics | Taskin Mehereen et.al. | 2505.18141v1 | null |
2025-05-23 | Comment on "Geometry of the Grosse-Wulkenhaar model" | Dragan Prekrat et.al. | 2505.18123v1 | null |
2025-05-23 | UNJOIN: Enhancing Multi-Table Text-to-SQL Generation via Schema Simplification | Poojah Ganesan et.al. | 2505.18122v1 | null |
2025-05-23 | Bridging Supervised Learning and Reinforcement Learning in Math Reasoning | Huayu Chen et.al. | 2505.18116v1 | null |
2025-05-23 | Zeta functions of K3 categories over finite fields | Asher Auel et.al. | 2505.18104v1 | null |
2025-05-23 | Ballistic macroscopic fluctuation theory via mapping to point particles | Jitendra Kethepalli et.al. | 2505.18093v1 | null |
2025-05-23 | QwenLong-CPRS: Towards $\infty$-LLMs with Dynamic Context Optimization | Weizhou Shen et.al. | 2505.18092v1 | null |
2025-05-23 | Explaining the extra crystal field mode in ACeX2 | Allen O. Scheie et.al. | 2505.18089v1 | null |
2025-05-23 | Early-Exit Graph Neural Networks | Andrea Giuseppe Di Francesco et.al. | 2505.18088v1 | null |
2025-05-23 | First astrometric constraints on parity-violation in the gravitational wave background | Santiago Jaraba et.al. | 2505.18085v1 | null |
2025-05-23 | Bayesian Deep Learning for Discrete Choice | Daniel F. Villarraga et.al. | 2505.18077v1 | null |
2025-05-23 | Posted Pricing and Competition in Large Markets | José Correa et.al. | 2505.18061v1 | null |
2025-05-23 | Semantic Correspondence: Unified Benchmarking and a Strong Baseline | Kaiyan Zhang et.al. | 2505.18060v1 | link |
2025-05-23 | BOTM: Echocardiography Segmentation via Bi-directional Optimal Token Matching | Zhihua Liu et.al. | 2505.18052v1 | null |
2025-05-23 | Rethinking Climate Econometrics: Data Cleaning, Flexible Trend Controls, and Predictive Validation | Christof Schötz et.al. | 2505.18033v1 | null |
2025-05-23 | 3D Face Reconstruction Error Decomposed: A Modular Benchmark for Fair and Fast Method Evaluation | Evangelos Sariyanidi et.al. | 2505.18025v1 | null |
2025-05-23 | A Wavelet-based Stereo Matching Framework for Solving Frequency Convergence Inconsistency | Xiaobao Wei et.al. | 2505.18024v1 | null |
2025-05-23 | Pressure tuning of competing interactions on a honeycomb lattice | Piyush Sakrikar et.al. | 2505.18016v1 | null |
2025-05-23 | A 1.8 m class pathfinder Raman LIDAR for the Northern Site of the Cherenkov Telescope Array Observatory -- Performance | Pedro Jose Bauza-Ruiz et.al. | 2505.17996v1 | null |
2025-05-23 | Finding d-Cuts in Claw-free Graphs | Jungho Ahn et.al. | 2505.17993v1 | null |
2025-05-23 | Outcome-based Reinforcement Learning to Predict the Future | Benjamin Turtel et.al. | 2505.17989v1 | null |
2025-05-23 | Towards Revealing the Effectiveness of Small-Scale Fine-tuning in R1-style Reinforcement Learning | Yutong Chen et.al. | 2505.17988v1 | null |
2025-05-23 | Inflaton Dynamics in Higher-Derivative Scalar-Tensor Theories of Gravity | Sam E. Brady et.al. | 2505.17986v1 | null |
2025-05-23 | Positive codegree thresholds for perfect matchings in hypergraphs | Richard Mycroft et.al. | 2505.17981v1 | null |
2025-05-23 | To Glue or Not to Glue? Classical vs Learned Image Matching for Mobile Mapping Cameras to Textured Semantic 3D Building Models | Simone Gaisbauer et.al. | 2505.17973v1 | null |
2025-05-23 | SVD-Free Low-Rank Adaptive Gradient Optimization for Large Language Models | Ionut-Vlad Modoranu et.al. | 2505.17967v1 | null |
2025-05-23 | A Principled Bayesian Framework for Training Binary and Spiking Neural Networks | James A. Walker et.al. | 2505.17962v1 | null |
Point Cloud Segmentation
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders | Savya Khosla et.al. | 2505.18153v1 | null |
2025-05-23 | Generative Distribution Embeddings | Nic Fishman et.al. | 2505.18150v1 | null |
2025-05-23 | Frankentext: Stitching random text fragments into long-form narratives | Chau Minh Pham et.al. | 2505.18128v1 | null |
2025-05-23 | Comment on "Geometry of the Grosse-Wulkenhaar model" | Dragan Prekrat et.al. | 2505.18123v1 | null |
2025-05-23 | From Temporal to Spatial: Designing Spatialized Interactions with Segmented-audios in Immersive Environments for Active Engagement with Performing Arts Intangible Cultural Heritage | Yuqi Wang et.al. | 2505.18112v1 | null |
2025-05-23 | Adapting SAM 2 for Visual Object Tracking: 1st Place Solution for MMVPR Challenge Multi-Modal Tracking | Cheng-Yen Yang et.al. | 2505.18111v1 | null |
2025-05-23 | F-ANcGAN: An Attention-Enhanced Cycle Consistent Generative Adversarial Architecture for Synthetic Image Generation of Nanoparticles | Varun Ajith et.al. | 2505.18106v1 | null |
2025-05-23 | Zeta functions of K3 categories over finite fields | Asher Auel et.al. | 2505.18104v1 | null |
2025-05-23 | Ballistic macroscopic fluctuation theory via mapping to point particles | Jitendra Kethepalli et.al. | 2505.18093v1 | null |
2025-05-23 | QwenLong-CPRS: Towards $\infty$-LLMs with Dynamic Context Optimization | Weizhou Shen et.al. | 2505.18092v1 | null |
2025-05-23 | Explaining the extra crystal field mode in ACeX2 | Allen O. Scheie et.al. | 2505.18089v1 | null |
2025-05-23 | CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays | Hyungyung Lee et.al. | 2505.18087v1 | null |
2025-05-23 | First astrometric constraints on parity-violation in the gravitational wave background | Santiago Jaraba et.al. | 2505.18085v1 | null |
2025-05-23 | Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding | Xiaoyi Zhang et.al. | 2505.18079v1 | null |
2025-05-23 | Bayesian Deep Learning for Discrete Choice | Daniel F. Villarraga et.al. | 2505.18077v1 | null |
2025-05-23 | BOTM: Echocardiography Segmentation via Bi-directional Optimal Token Matching | Zhihua Liu et.al. | 2505.18052v1 | null |
2025-05-23 | LookWhere? Efficient Visual Recognition by Learning Where to Look and What to See from Self-Supervision | Anthony Fuller et.al. | 2505.18051v1 | null |
2025-05-23 | Rethinking Climate Econometrics: Data Cleaning, Flexible Trend Controls, and Predictive Validation | Christof Schötz et.al. | 2505.18033v1 | null |
2025-05-23 | 3D Face Reconstruction Error Decomposed: A Modular Benchmark for Fair and Fast Method Evaluation | Evangelos Sariyanidi et.al. | 2505.18025v1 | null |
2025-05-23 | RemoteSAM: Towards Segment Anything for Earth Observation | Liang Yao et.al. | 2505.18022v1 | null |
2025-05-23 | Pressure tuning of competing interactions on a honeycomb lattice | Piyush Sakrikar et.al. | 2505.18016v1 | null |
2025-05-23 | SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification | Shashank Agnihotri et.al. | 2505.18015v1 | null |
2025-05-23 | Classification of assembly tasks combining multiple primitive actions using Transformers and xLSTMs | Miguel Neves et.al. | 2505.18012v1 | null |
2025-05-23 | A 1.8 m class pathfinder Raman LIDAR for the Northern Site of the Cherenkov Telescope Array Observatory -- Performance | Pedro Jose Bauza-Ruiz et.al. | 2505.17996v1 | null |
2025-05-23 | Segment Anyword: Mask Prompt Inversion for Open-Set Grounded Segmentation | Zhihua Liu et.al. | 2505.17994v1 | null |
2025-05-23 | Outcome-based Reinforcement Learning to Predict the Future | Benjamin Turtel et.al. | 2505.17989v1 | null |
2025-05-23 | Inflaton Dynamics in Higher-Derivative Scalar-Tensor Theories of Gravity | Sam E. Brady et.al. | 2505.17986v1 | null |
2025-05-23 | MR-EEGWaveNet: Multiresolutional EEGWaveNet for Seizure Detection from Long EEG Recordings | Kazi Mahmudul Hassan et.al. | 2505.17972v1 | null |
2025-05-23 | Explainable Anatomy-Guided AI for Prostate MRI: Foundation Models and In Silico Clinical Trials for Virtual Biopsy-based Risk Assessment | Danial Khan et.al. | 2505.17971v1 | null |
2025-05-23 | Mind the Domain Gap: Measuring the Domain Gap Between Real-World and Synthetic Point Clouds for Automated Driving Development | Nguyen Duc et.al. | 2505.17959v1 | null |
Keypoint Detection
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | Semantic Correspondence: Unified Benchmarking and a Strong Baseline | Kaiyan Zhang et.al. | 2505.18060v1 | link |
2025-05-23 | Contrastive Distillation of Emotion Knowledge from LLMs for Zero-Shot Emotion Recognition | Minxue Niu et.al. | 2505.18040v1 | null |
2025-05-23 | Directed Semi-Simplicial Learning with Applications to Brain Activity Decoding | Manuel Lecha et.al. | 2505.17939v1 | null |
2025-05-23 | ICPL-ReID: Identity-Conditional Prompt Learning for Multi-Spectral Object Re-Identification | Shihao Li et.al. | 2505.17821v1 | null |
2025-05-23 | RQR3D: Reparametrizing the regression targets for BEV-based 3D object detection | Ozsel Kilinc et.al. | 2505.17732v1 | null |
2025-05-23 | MinkUNeXt-SI: Improving point cloud-based place recognition including spherical coordinates and LiDAR intensity | Judith Vilella-Cantos et.al. | 2505.17591v1 | null |
2025-05-23 | PoseBH: Prototypical Multi-Dataset Training Beyond Human Pose Estimation | Uyoung Jeong et.al. | 2505.17475v1 | link |
2025-05-23 | PawPrint: Whose Footprints Are These? Identifying Animal Individuals by Their Footprints | Inpyo Song et.al. | 2505.17445v1 | null |
2025-05-22 | LengthLogD: A Length-Stratified Ensemble Framework for Enhanced Peptide Lipophilicity Prediction via Multi-Scale Feature Integration | Shuang Wu et.al. | 2505.17198v1 | null |
2025-05-22 | Towards Texture- And Shape-Independent 3D Keypoint Estimation in Birds | Valentin Schmuker et.al. | 2505.16633v1 | null |
2025-05-22 | Motion Matters: Compact Gaussian Streaming for Free-Viewpoint Video Reconstruction | Jiacong Chen et.al. | 2505.16533v1 | null |
2025-05-22 | TAT-VPR: Ternary Adaptive Transformer for Dynamic and Efficient Visual Place Recognition | Oliver Grainge et.al. | 2505.16447v1 | null |
2025-05-22 | RE-TRIP : Reflectivity Instance Augmented Triangle Descriptor for 3D Place Recognition | Yechan Park et.al. | 2505.16165v1 | link |
2025-05-22 | GMatch: Geometry-Constrained Feature Matching for RGB-D Object Pose Estimation | Ming Yang et.al. | 2505.16144v1 | null |
2025-05-21 | Mouse Lockbox Dataset: Behavior Recognition for Mice Solving Lockboxes | Patrik Reiske et.al. | 2505.15408v1 | null |
2025-05-21 | On the Relevance of Clinical Assessment Tasks for the Automatic Detection of Parkinson's Disease Medication State from Speech | David Gimeno-Gómez et.al. | 2505.15378v1 | null |
2025-05-20 | UPTor: Unified 3D Human Pose Dynamics and Trajectory Prediction for Human-Robot Interaction | Nisarga Nilavadi et.al. | 2505.14866v1 | null |
2025-05-20 | AudSemThinker: Enhancing Audio-Language Models through Reasoning over Semantics of Sound | Gijs Wijngaard et.al. | 2505.14142v1 | link |
2025-05-20 | Place Recognition: A Comprehensive Review, Current Challenges and Future Directions | Zhenyu Li et.al. | 2505.14068v2 | link |
2025-05-20 | Active-Spin-State-Derived Descriptor for Hydrogen Evolution Reaction Catalysis | Yu Tan et.al. | 2505.13786v1 | null |
2025-05-19 | RECON: Robust symmetry discovery via Explicit Canonical Orientation Normalization | Alonso Urbano et.al. | 2505.13289v1 | null |
2025-05-19 | The Way Up: A Dataset for Hold Usage Detection in Sport Climbing | Anna Maschek et.al. | 2505.12854v1 | null |
2025-05-19 | Chain-Talker: Chain Understanding and Rendering for Empathetic Conversational Speech Synthesis | Yifan Hu et.al. | 2505.12597v1 | link |
2025-05-18 | DS-ProGen: A Dual-Structure Deep Language Model for Functional Protein Design | Yanting Li et.al. | 2505.12511v1 | null |
2025-05-18 | SEPT: Standard-Definition Map Enhanced Scene Perception and Topology Reasoning for Autonomous Driving | Muleilan Pei et.al. | 2505.12246v1 | null |
2025-05-17 | Understanding the Capabilities of Molecular Graph Neural Networks in Materials Science Through Multimodal Learning and Physical Context Encoding | Can Polat et.al. | 2505.12137v1 | null |
2025-05-17 | Keypoints as Dynamic Centroids for Unified Human Pose and Segmentation | Niaz Ahmad et.al. | 2505.12130v1 | null |
2025-05-17 | Prediction of Novel CXCR7 Inhibitors Using QSAR Modeling and Validation via Molecular Docking | Belaguppa Manjunath Ashwin Desai et.al. | 2505.12055v1 | null |
2025-05-17 | Continuous Domain Generalization | Zekun Cai et.al. | 2505.13519v1 | null |
2025-05-17 | Accelerating the Search for Superconductors Using Machine Learning | Suhas Adiga et.al. | 2505.11964v1 | link |
Image Matching
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders | Savya Khosla et.al. | 2505.18153v1 | null |
2025-05-23 | WonderPlay: Dynamic 3D Scene Generation from a Single Image and Actions | Zizhang Li et.al. | 2505.18151v1 | null |
2025-05-23 | Generative Distribution Embeddings | Nic Fishman et.al. | 2505.18150v1 | null |
2025-05-23 | First Finish Search: Efficient Test-Time Scaling in Large Language Models | Aradhye Agarwal et.al. | 2505.18149v1 | null |
2025-05-23 | TokBench: Evaluating Your Visual Tokenizer before Visual Generation | Junfeng Wu et.al. | 2505.18142v1 | null |
2025-05-23 | INN-FF: A Scalable and Efficient Machine Learning Potential for Molecular Dynamics | Taskin Mehereen et.al. | 2505.18141v1 | null |
2025-05-23 | UNJOIN: Enhancing Multi-Table Text-to-SQL Generation via Schema Simplification | Poojah Ganesan et.al. | 2505.18122v1 | null |
2025-05-23 | Bridging Supervised Learning and Reinforcement Learning in Math Reasoning | Huayu Chen et.al. | 2505.18116v1 | null |
2025-05-23 | Instructify: Demystifying Metadata to Visual Instruction Tuning Data Conversion | Jacob Hansen et.al. | 2505.18115v1 | null |
2025-05-23 | Accelerating Learned Image Compression Through Modeling Neural Training Dynamics | Yichi Zhang et.al. | 2505.18107v1 | null |
2025-05-23 | F-ANcGAN: An Attention-Enhanced Cycle Consistent Generative Adversarial Architecture for Synthetic Image Generation of Nanoparticles | Varun Ajith et.al. | 2505.18106v1 | null |
2025-05-23 | Structural Dynamics of Harmful Content Dissemination on WhatsApp | Yuxin Liu et.al. | 2505.18099v1 | null |
2025-05-23 | Early-Exit Graph Neural Networks | Andrea Giuseppe Di Francesco et.al. | 2505.18088v1 | null |
2025-05-23 | DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation | Junhao Chen et.al. | 2505.18078v1 | null |
2025-05-23 | Image rotation in plasmas | Renaud Gueroult et.al. | 2505.18062v1 | null |
2025-05-23 | Posted Pricing and Competition in Large Markets | José Correa et.al. | 2505.18061v1 | null |
2025-05-23 | Semantic Correspondence: Unified Benchmarking and a Strong Baseline | Kaiyan Zhang et.al. | 2505.18060v1 | link |
2025-05-23 | A Foundation Model Framework for Multi-View MRI Classification of Extramural Vascular Invasion and Mesorectal Fascia Invasion in Rectal Cancer | Yumeng Zhang et.al. | 2505.18058v1 | null |
2025-05-23 | BOTM: Echocardiography Segmentation via Bi-directional Optimal Token Matching | Zhihua Liu et.al. | 2505.18052v1 | null |
2025-05-23 | LookWhere? Efficient Visual Recognition by Learning Where to Look and What to See from Self-Supervision | Anthony Fuller et.al. | 2505.18051v1 | null |
2025-05-23 | SpikeGen: Generative Framework for Visual Spike Stream Processing | Gaole Dai et.al. | 2505.18049v1 | null |
2025-05-23 | SHARDeg: A Benchmark for Skeletal Human Action Recognition in Degraded Scenarios | Simon Malzard et.al. | 2505.18048v1 | null |
2025-05-23 | RestoreVAR: Visual Autoregressive Generation for All-in-One Image Restoration | Sudarshan Rajagopalan et.al. | 2505.18047v1 | null |
2025-05-23 | Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation | Li Zhong et.al. | 2505.18039v1 | null |
2025-05-23 | CAMME: Adaptive Deepfake Image Detection with Multi-Modal Cross-Attention | Naseem Khan et.al. | 2505.18035v1 | null |
2025-05-23 | Knot So Simple: A Minimalistic Environment for Spatial Reasoning | Zizhao Chen et.al. | 2505.18028v1 | null |
2025-05-23 | A Wavelet-based Stereo Matching Framework for Solving Frequency Convergence Inconsistency | Xiaobao Wei et.al. | 2505.18024v1 | null |
2025-05-23 | RemoteSAM: Towards Segment Anything for Earth Observation | Liang Yao et.al. | 2505.18022v1 | null |
2025-05-23 | Building Floor Number Estimation from Crowdsourced Street-Level Images: Munich Dataset and Baseline Method | Yao Sun et.al. | 2505.18021v1 | null |
2025-05-23 | SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification | Shashank Agnihotri et.al. | 2505.18015v1 | null |
Computer Vision
Image Matching
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders | Savya Khosla et.al. | 2505.18153v1 | null |
2025-05-23 | WonderPlay: Dynamic 3D Scene Generation from a Single Image and Actions | Zizhang Li et.al. | 2505.18151v1 | null |
2025-05-23 | Generative Distribution Embeddings | Nic Fishman et.al. | 2505.18150v1 | null |
2025-05-23 | First Finish Search: Efficient Test-Time Scaling in Large Language Models | Aradhye Agarwal et.al. | 2505.18149v1 | null |
2025-05-23 | TokBench: Evaluating Your Visual Tokenizer before Visual Generation | Junfeng Wu et.al. | 2505.18142v1 | null |
2025-05-23 | INN-FF: A Scalable and Efficient Machine Learning Potential for Molecular Dynamics | Taskin Mehereen et.al. | 2505.18141v1 | null |
2025-05-23 | UNJOIN: Enhancing Multi-Table Text-to-SQL Generation via Schema Simplification | Poojah Ganesan et.al. | 2505.18122v1 | null |
2025-05-23 | Bridging Supervised Learning and Reinforcement Learning in Math Reasoning | Huayu Chen et.al. | 2505.18116v1 | null |
2025-05-23 | Instructify: Demystifying Metadata to Visual Instruction Tuning Data Conversion | Jacob Hansen et.al. | 2505.18115v1 | null |
2025-05-23 | Accelerating Learned Image Compression Through Modeling Neural Training Dynamics | Yichi Zhang et.al. | 2505.18107v1 | null |
2025-05-23 | F-ANcGAN: An Attention-Enhanced Cycle Consistent Generative Adversarial Architecture for Synthetic Image Generation of Nanoparticles | Varun Ajith et.al. | 2505.18106v1 | null |
2025-05-23 | Structural Dynamics of Harmful Content Dissemination on WhatsApp | Yuxin Liu et.al. | 2505.18099v1 | null |
2025-05-23 | Early-Exit Graph Neural Networks | Andrea Giuseppe Di Francesco et.al. | 2505.18088v1 | null |
2025-05-23 | DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation | Junhao Chen et.al. | 2505.18078v1 | null |
2025-05-23 | Image rotation in plasmas | Renaud Gueroult et.al. | 2505.18062v1 | null |
2025-05-23 | Posted Pricing and Competition in Large Markets | José Correa et.al. | 2505.18061v1 | null |
2025-05-23 | Semantic Correspondence: Unified Benchmarking and a Strong Baseline | Kaiyan Zhang et.al. | 2505.18060v1 | link |
2025-05-23 | A Foundation Model Framework for Multi-View MRI Classification of Extramural Vascular Invasion and Mesorectal Fascia Invasion in Rectal Cancer | Yumeng Zhang et.al. | 2505.18058v1 | null |
2025-05-23 | BOTM: Echocardiography Segmentation via Bi-directional Optimal Token Matching | Zhihua Liu et.al. | 2505.18052v1 | null |
2025-05-23 | LookWhere? Efficient Visual Recognition by Learning Where to Look and What to See from Self-Supervision | Anthony Fuller et.al. | 2505.18051v1 | null |
2025-05-23 | SpikeGen: Generative Framework for Visual Spike Stream Processing | Gaole Dai et.al. | 2505.18049v1 | null |
2025-05-23 | SHARDeg: A Benchmark for Skeletal Human Action Recognition in Degraded Scenarios | Simon Malzard et.al. | 2505.18048v1 | null |
2025-05-23 | RestoreVAR: Visual Autoregressive Generation for All-in-One Image Restoration | Sudarshan Rajagopalan et.al. | 2505.18047v1 | null |
2025-05-23 | Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation | Li Zhong et.al. | 2505.18039v1 | null |
2025-05-23 | CAMME: Adaptive Deepfake Image Detection with Multi-Modal Cross-Attention | Naseem Khan et.al. | 2505.18035v1 | null |
2025-05-23 | Knot So Simple: A Minimalistic Environment for Spatial Reasoning | Zizhao Chen et.al. | 2505.18028v1 | null |
2025-05-23 | A Wavelet-based Stereo Matching Framework for Solving Frequency Convergence Inconsistency | Xiaobao Wei et.al. | 2505.18024v1 | null |
2025-05-23 | RemoteSAM: Towards Segment Anything for Earth Observation | Liang Yao et.al. | 2505.18022v1 | null |
2025-05-23 | Building Floor Number Estimation from Crowdsourced Street-Level Images: Munich Dataset and Baseline Method | Yao Sun et.al. | 2505.18021v1 | null |
2025-05-23 | SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification | Shashank Agnihotri et.al. | 2505.18015v1 | null |
Multi-Object Tracking
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders | Savya Khosla et.al. | 2505.18153v1 | null |
2025-05-23 | Stochastic agent-based Monte Carlo simulations for reaction-diffusion models, population dynamics, and epidemic spreading | Mohamed Swailem et.al. | 2505.18145v1 | null |
2025-05-23 | TokBench: Evaluating Your Visual Tokenizer before Visual Generation | Junfeng Wu et.al. | 2505.18142v1 | null |
2025-05-23 | Embracing Contradiction: Theoretical Inconsistency Will Not Impede the Road of Building Responsible AI Systems | Gordon Dai et.al. | 2505.18139v1 | null |
2025-05-23 | VideoGameBench: Can Vision-Language Models complete popular video games? | Alex L. Zhang et.al. | 2505.18134v1 | null |
2025-05-23 | One RL to See Them All: Visual Triple Unified Reinforcement Learning | Yan Ma et.al. | 2505.18129v1 | null |
2025-05-23 | Instructify: Demystifying Metadata to Visual Instruction Tuning Data Conversion | Jacob Hansen et.al. | 2505.18115v1 | null |
2025-05-23 | Adapting SAM 2 for Visual Object Tracking: 1st Place Solution for MMVPR Challenge Multi-Modal Tracking | Cheng-Yen Yang et.al. | 2505.18111v1 | null |
2025-05-23 | Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM | Zinuo Li et.al. | 2505.18110v1 | null |
2025-05-23 | CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays | Hyungyung Lee et.al. | 2505.18087v1 | null |
2025-05-23 | DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation | Junhao Chen et.al. | 2505.18078v1 | null |
2025-05-23 | Beyond flat-panel displays, applications of stereographic and holographic devices in 3D microscopy data analysis | Yong Wan et.al. | 2505.18075v1 | null |
2025-05-23 | Towards Uncertainty Aware Task Delegation and Human-AI Collaborative Decision-Making | Min Hun Lee et.al. | 2505.18066v1 | null |
2025-05-23 | Asymptotically optimal regret in communicating Markov decision processes | Victor Boone et.al. | 2505.18064v1 | null |
2025-05-23 | Semantic Correspondence: Unified Benchmarking and a Strong Baseline | Kaiyan Zhang et.al. | 2505.18060v1 | link |
2025-05-23 | A Foundation Model Framework for Multi-View MRI Classification of Extramural Vascular Invasion and Mesorectal Fascia Invasion in Rectal Cancer | Yumeng Zhang et.al. | 2505.18058v1 | null |
2025-05-23 | LookWhere? Efficient Visual Recognition by Learning Where to Look and What to See from Self-Supervision | Anthony Fuller et.al. | 2505.18051v1 | null |
2025-05-23 | SpikeGen: Generative Framework for Visual Spike Stream Processing | Gaole Dai et.al. | 2505.18049v1 | null |
2025-05-23 | RestoreVAR: Visual Autoregressive Generation for All-in-One Image Restoration | Sudarshan Rajagopalan et.al. | 2505.18047v1 | null |
2025-05-23 | Learning with Restricted Boltzmann Machines: Asymptotics of AMP and GD in High Dimensions | Yizhou Xu et.al. | 2505.18046v1 | null |
2025-05-23 | Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation | Li Zhong et.al. | 2505.18039v1 | null |
2025-05-23 | Efficient Conditional Gradient Methods for Solving Stochastic Convex Bilevel Optimization Problems | Khanh-Hung Giang-Tran et.al. | 2505.18037v1 | null |
2025-05-23 | CAMME: Adaptive Deepfake Image Detection with Multi-Modal Cross-Attention | Naseem Khan et.al. | 2505.18035v1 | null |
2025-05-23 | Automata Learning of Preferences over Temporal Logic Formulas from Pairwise Comparisons | Hazhar Rahmani et.al. | 2505.18030v1 | null |
2025-05-23 | A Wavelet-based Stereo Matching Framework for Solving Frequency Convergence Inconsistency | Xiaobao Wei et.al. | 2505.18024v1 | null |
2025-05-23 | RemoteSAM: Towards Segment Anything for Earth Observation | Liang Yao et.al. | 2505.18022v1 | null |
2025-05-23 | SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification | Shashank Agnihotri et.al. | 2505.18015v1 | null |
2025-05-23 | DiFache: Efficient and Scalable Caching on Disaggregated Memory using Decentralized Coherence | Hanze Zhang et.al. | 2505.18013v1 | null |
2025-05-23 | TRACE for Tracking the Emergence of Semantic Representations in Transformers | Nura Aljaafari et.al. | 2505.17998v1 | null |
2025-05-23 | Segment Anyword: Mask Prompt Inversion for Open-Set Grounded Segmentation | Zhihua Liu et.al. | 2505.17994v1 | null |
Object Tracking
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders | Savya Khosla et.al. | 2505.18153v1 | null |
2025-05-23 | Embracing Contradiction: Theoretical Inconsistency Will Not Impede the Road of Building Responsible AI Systems | Gordon Dai et.al. | 2505.18139v1 | null |
2025-05-23 | VideoGameBench: Can Vision-Language Models complete popular video games? | Alex L. Zhang et.al. | 2505.18134v1 | null |
2025-05-23 | One RL to See Them All: Visual Triple Unified Reinforcement Learning | Yan Ma et.al. | 2505.18129v1 | null |
2025-05-23 | Adapting SAM 2 for Visual Object Tracking: 1st Place Solution for MMVPR Challenge Multi-Modal Tracking | Cheng-Yen Yang et.al. | 2505.18111v1 | null |
2025-05-23 | DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation | Junhao Chen et.al. | 2505.18078v1 | null |
2025-05-23 | Asymptotically optimal regret in communicating Markov decision processes | Victor Boone et.al. | 2505.18064v1 | null |
2025-05-23 | Learning with Restricted Boltzmann Machines: Asymptotics of AMP and GD in High Dimensions | Yizhou Xu et.al. | 2505.18046v1 | null |
2025-05-23 | Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation | Li Zhong et.al. | 2505.18039v1 | null |
2025-05-23 | Efficient Conditional Gradient Methods for Solving Stochastic Convex Bilevel Optimization Problems | Khanh-Hung Giang-Tran et.al. | 2505.18037v1 | null |
2025-05-23 | Automata Learning of Preferences over Temporal Logic Formulas from Pairwise Comparisons | Hazhar Rahmani et.al. | 2505.18030v1 | null |
2025-05-23 | A Wavelet-based Stereo Matching Framework for Solving Frequency Convergence Inconsistency | Xiaobao Wei et.al. | 2505.18024v1 | null |
2025-05-23 | SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification | Shashank Agnihotri et.al. | 2505.18015v1 | null |
2025-05-23 | DiFache: Efficient and Scalable Caching on Disaggregated Memory using Decentralized Coherence | Hanze Zhang et.al. | 2505.18013v1 | null |
2025-05-23 | TRACE for Tracking the Emergence of Semantic Representations in Transformers | Nura Aljaafari et.al. | 2505.17998v1 | null |
2025-05-23 | Segment Anyword: Mask Prompt Inversion for Open-Set Grounded Segmentation | Zhihua Liu et.al. | 2505.17994v1 | null |
2025-05-23 | Canonical Pose Reconstruction from Single Depth Image for 3D Non-rigid Pose Recovery on Limited Datasets | Fahd Alhamazani et.al. | 2505.17992v1 | null |
2025-05-23 | A Principled Bayesian Framework for Training Binary and Spiking Neural Networks | James A. Walker et.al. | 2505.17962v1 | null |
2025-05-23 | Mind the Domain Gap: Measuring the Domain Gap Between Real-World and Synthetic Point Clouds for Automated Driving Development | Nguyen Duc et.al. | 2505.17959v1 | null |
2025-05-23 | The impact of compact object deformation on thin accretion disk properties | Shokoufe Faraji et.al. | 2505.17924v1 | null |
2025-05-23 | Object-level Cross-view Geo-localization with Location Enhancement and Multi-Head Cross Attention | Zheyang Huang et.al. | 2505.17911v1 | null |
2025-05-23 | Tracking phase entanglement during propagation of downconverted photons | Rounak Chatterjee et.al. | 2505.17906v1 | null |
2025-05-23 | Geometric Shape Modelling and Volume Estimation of Dry Bulk Cargo Piles using a Single Image | Debanshu Ratha et.al. | 2505.17896v1 | null |
2025-05-23 | DataRater: Meta-Learned Dataset Curation | Dan A. Calian et.al. | 2505.17895v1 | null |
2025-05-23 | A model-free approach to control barrier functions using funnel control | Lukas Lanza et.al. | 2505.17887v1 | null |
2025-05-23 | Track Anything Annotate: Video annotation and dataset generation of computer vision models | Nikita Ivanov et.al. | 2505.17884v1 | null |
2025-05-23 | FastCAV: Efficient Computation of Concept Activation Vectors for Explaining Deep Neural Networks | Laines Schmalwasser et.al. | 2505.17883v1 | null |
2025-05-23 | Semi-Supervised Multi-Label Feature Selection with Consistent Sparse Graph Learning | Yan Zhong et.al. | 2505.17875v1 | null |
2025-05-23 | BLAST: Balanced Sampling Time Series Corpus for Universal Forecasting Models | Zezhi Shao et.al. | 2505.17871v1 | null |
2025-05-23 | Best Group Identification in Multi-Objective Bandits | Mohammad Shahverdikondori et.al. | 2505.17869v1 | null |
Semantic Segmentation
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders | Savya Khosla et.al. | 2505.18153v1 | null |
2025-05-23 | Fann or Flop: A Multigenre, Multiera Benchmark for Arabic Poetry Understanding in LLMs | Wafa Alghallabi et.al. | 2505.18152v1 | null |
2025-05-23 | Boosting Open Set Recognition Performance through Modulated Representation Learning | Amit Kumar Kundu et.al. | 2505.18137v1 | null |
2025-05-23 | Frankentext: Stitching random text fragments into long-form narratives | Chau Minh Pham et.al. | 2505.18128v1 | null |
2025-05-23 | TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations | Alan Arazi et.al. | 2505.18125v1 | null |
2025-05-23 | Bidirectional Knowledge Distillation for Enhancing Sequential Recommendation with Large Language Models | Jiongran Wu et.al. | 2505.18120v1 | null |
2025-05-23 | From Temporal to Spatial: Designing Spatialized Interactions with Segmented-audios in Immersive Environments for Active Engagement with Performing Arts Intangible Cultural Heritage | Yuqi Wang et.al. | 2505.18112v1 | null |
2025-05-23 | Adapting SAM 2 for Visual Object Tracking: 1st Place Solution for MMVPR Challenge Multi-Modal Tracking | Cheng-Yen Yang et.al. | 2505.18111v1 | null |
2025-05-23 | F-ANcGAN: An Attention-Enhanced Cycle Consistent Generative Adversarial Architecture for Synthetic Image Generation of Nanoparticles | Varun Ajith et.al. | 2505.18106v1 | null |
2025-05-23 | Dynamic Dual Buffer with Divide-and-Conquer Strategy for Online Continual Learning | Congren Dai et.al. | 2505.18101v1 | null |
2025-05-23 | CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays | Hyungyung Lee et.al. | 2505.18087v1 | null |
2025-05-23 | Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding | Xiaoyi Zhang et.al. | 2505.18079v1 | null |
2025-05-23 | DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation | Junhao Chen et.al. | 2505.18078v1 | null |
2025-05-23 | Semantic Correspondence: Unified Benchmarking and a Strong Baseline | Kaiyan Zhang et.al. | 2505.18060v1 | link |
2025-05-23 | FDBPL: Faster Distillation-Based Prompt Learning for Region-Aware Vision-Language Models Adaptation | Zherui Zhang et.al. | 2505.18053v1 | null |
2025-05-23 | BOTM: Echocardiography Segmentation via Bi-directional Optimal Token Matching | Zhihua Liu et.al. | 2505.18052v1 | null |
2025-05-23 | LookWhere? Efficient Visual Recognition by Learning Where to Look and What to See from Self-Supervision | Anthony Fuller et.al. | 2505.18051v1 | null |
2025-05-23 | RemoteSAM: Towards Segment Anything for Earth Observation | Liang Yao et.al. | 2505.18022v1 | null |
2025-05-23 | LLM assisted web application functional requirements generation: A case study of four popular LLMs over a Mess Management System | Rashmi Gupta et.al. | 2505.18019v1 | null |
2025-05-23 | SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification | Shashank Agnihotri et.al. | 2505.18015v1 | null |
2025-05-23 | Classification of assembly tasks combining multiple primitive actions using Transformers and xLSTMs | Miguel Neves et.al. | 2505.18012v1 | null |
2025-05-23 | TRACE for Tracking the Emergence of Semantic Representations in Transformers | Nura Aljaafari et.al. | 2505.17998v1 | null |
2025-05-23 | Segment Anyword: Mask Prompt Inversion for Open-Set Grounded Segmentation | Zhihua Liu et.al. | 2505.17994v1 | null |
2025-05-23 | ADLGen: Synthesizing Symbolic, Event-Triggered Sensor Sequences for Human Activity Modeling | Weihang You et.al. | 2505.17987v1 | null |
2025-05-23 | Few-Shot Learning from Gigapixel Images via Hierarchical Vision-Language Alignment and Modeling | Bryan Wong et.al. | 2505.17982v1 | null |
2025-05-23 | To Glue or Not to Glue? Classical vs Learned Image Matching for Mobile Mapping Cameras to Textured Semantic 3D Building Models | Simone Gaisbauer et.al. | 2505.17973v1 | null |
2025-05-23 | MR-EEGWaveNet: Multiresolutional EEGWaveNet for Seizure Detection from Long EEG Recordings | Kazi Mahmudul Hassan et.al. | 2505.17972v1 | null |
2025-05-23 | Explainable Anatomy-Guided AI for Prostate MRI: Foundation Models and In Silico Clinical Trials for Virtual Biopsy-based Risk Assessment | Danial Khan et.al. | 2505.17971v1 | null |
2025-05-23 | Mind the Domain Gap: Measuring the Domain Gap Between Real-World and Synthetic Point Clouds for Automated Driving Development | Nguyen Duc et.al. | 2505.17959v1 | null |
2025-05-23 | AutoMiSeg: Automatic Medical Image Segmentation via Test-Time Adaptation of Foundation Models | Xingjian Li et.al. | 2505.17931v1 | null |
Keypoint Detection
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | Semantic Correspondence: Unified Benchmarking and a Strong Baseline | Kaiyan Zhang et.al. | 2505.18060v1 | link |
2025-05-23 | Contrastive Distillation of Emotion Knowledge from LLMs for Zero-Shot Emotion Recognition | Minxue Niu et.al. | 2505.18040v1 | null |
2025-05-23 | Directed Semi-Simplicial Learning with Applications to Brain Activity Decoding | Manuel Lecha et.al. | 2505.17939v1 | null |
2025-05-23 | ICPL-ReID: Identity-Conditional Prompt Learning for Multi-Spectral Object Re-Identification | Shihao Li et.al. | 2505.17821v1 | null |
2025-05-23 | RQR3D: Reparametrizing the regression targets for BEV-based 3D object detection | Ozsel Kilinc et.al. | 2505.17732v1 | null |
2025-05-23 | MinkUNeXt-SI: Improving point cloud-based place recognition including spherical coordinates and LiDAR intensity | Judith Vilella-Cantos et.al. | 2505.17591v1 | null |
2025-05-23 | PoseBH: Prototypical Multi-Dataset Training Beyond Human Pose Estimation | Uyoung Jeong et.al. | 2505.17475v1 | link |
2025-05-23 | PawPrint: Whose Footprints Are These? Identifying Animal Individuals by Their Footprints | Inpyo Song et.al. | 2505.17445v1 | null |
2025-05-22 | LengthLogD: A Length-Stratified Ensemble Framework for Enhanced Peptide Lipophilicity Prediction via Multi-Scale Feature Integration | Shuang Wu et.al. | 2505.17198v1 | null |
2025-05-22 | Towards Texture- And Shape-Independent 3D Keypoint Estimation in Birds | Valentin Schmuker et.al. | 2505.16633v1 | null |
2025-05-22 | Motion Matters: Compact Gaussian Streaming for Free-Viewpoint Video Reconstruction | Jiacong Chen et.al. | 2505.16533v1 | null |
2025-05-22 | TAT-VPR: Ternary Adaptive Transformer for Dynamic and Efficient Visual Place Recognition | Oliver Grainge et.al. | 2505.16447v1 | null |
2025-05-22 | RE-TRIP : Reflectivity Instance Augmented Triangle Descriptor for 3D Place Recognition | Yechan Park et.al. | 2505.16165v1 | link |
2025-05-22 | GMatch: Geometry-Constrained Feature Matching for RGB-D Object Pose Estimation | Ming Yang et.al. | 2505.16144v1 | null |
2025-05-21 | Mouse Lockbox Dataset: Behavior Recognition for Mice Solving Lockboxes | Patrik Reiske et.al. | 2505.15408v1 | null |
2025-05-21 | On the Relevance of Clinical Assessment Tasks for the Automatic Detection of Parkinson's Disease Medication State from Speech | David Gimeno-Gómez et.al. | 2505.15378v1 | null |
2025-05-20 | UPTor: Unified 3D Human Pose Dynamics and Trajectory Prediction for Human-Robot Interaction | Nisarga Nilavadi et.al. | 2505.14866v1 | null |
2025-05-20 | AudSemThinker: Enhancing Audio-Language Models through Reasoning over Semantics of Sound | Gijs Wijngaard et.al. | 2505.14142v1 | link |
2025-05-20 | Place Recognition: A Comprehensive Review, Current Challenges and Future Directions | Zhenyu Li et.al. | 2505.14068v2 | link |
2025-05-20 | Active-Spin-State-Derived Descriptor for Hydrogen Evolution Reaction Catalysis | Yu Tan et.al. | 2505.13786v1 | null |
2025-05-19 | RECON: Robust symmetry discovery via Explicit Canonical Orientation Normalization | Alonso Urbano et.al. | 2505.13289v1 | null |
2025-05-19 | The Way Up: A Dataset for Hold Usage Detection in Sport Climbing | Anna Maschek et.al. | 2505.12854v1 | null |
2025-05-19 | Chain-Talker: Chain Understanding and Rendering for Empathetic Conversational Speech Synthesis | Yifan Hu et.al. | 2505.12597v1 | link |
2025-05-18 | DS-ProGen: A Dual-Structure Deep Language Model for Functional Protein Design | Yanting Li et.al. | 2505.12511v1 | null |
2025-05-18 | SEPT: Standard-Definition Map Enhanced Scene Perception and Topology Reasoning for Autonomous Driving | Muleilan Pei et.al. | 2505.12246v1 | null |
2025-05-17 | Understanding the Capabilities of Molecular Graph Neural Networks in Materials Science Through Multimodal Learning and Physical Context Encoding | Can Polat et.al. | 2505.12137v1 | null |
2025-05-17 | Keypoints as Dynamic Centroids for Unified Human Pose and Segmentation | Niaz Ahmad et.al. | 2505.12130v1 | null |
2025-05-17 | Prediction of Novel CXCR7 Inhibitors Using QSAR Modeling and Validation via Molecular Docking | Belaguppa Manjunath Ashwin Desai et.al. | 2505.12055v1 | null |
2025-05-17 | Continuous Domain Generalization | Zekun Cai et.al. | 2505.13519v1 | null |
2025-05-17 | Accelerating the Search for Superconductors Using Machine Learning | Suhas Adiga et.al. | 2505.11964v1 | link |
Object Detection
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders | Savya Khosla et.al. | 2505.18153v1 | null |
2025-05-23 | Embracing Contradiction: Theoretical Inconsistency Will Not Impede the Road of Building Responsible AI Systems | Gordon Dai et.al. | 2505.18139v1 | null |
2025-05-23 | VideoGameBench: Can Vision-Language Models complete popular video games? | Alex L. Zhang et.al. | 2505.18134v1 | null |
2025-05-23 | One RL to See Them All: Visual Triple Unified Reinforcement Learning | Yan Ma et.al. | 2505.18129v1 | null |
2025-05-23 | Frankentext: Stitching random text fragments into long-form narratives | Chau Minh Pham et.al. | 2505.18128v1 | null |
2025-05-23 | Adapting SAM 2 for Visual Object Tracking: 1st Place Solution for MMVPR Challenge Multi-Modal Tracking | Cheng-Yen Yang et.al. | 2505.18111v1 | null |
2025-05-23 | Low energy calibration in DUNE far detector prototypes | Emile Lavaut et.al. | 2505.18073v1 | null |
2025-05-23 | Learning with Restricted Boltzmann Machines: Asymptotics of AMP and GD in High Dimensions | Yizhou Xu et.al. | 2505.18046v1 | null |
2025-05-23 | Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation | Li Zhong et.al. | 2505.18039v1 | null |
2025-05-23 | Efficient Conditional Gradient Methods for Solving Stochastic Convex Bilevel Optimization Problems | Khanh-Hung Giang-Tran et.al. | 2505.18037v1 | null |
2025-05-23 | Automata Learning of Preferences over Temporal Logic Formulas from Pairwise Comparisons | Hazhar Rahmani et.al. | 2505.18030v1 | null |
2025-05-23 | A Wavelet-based Stereo Matching Framework for Solving Frequency Convergence Inconsistency | Xiaobao Wei et.al. | 2505.18024v1 | null |
2025-05-23 | SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification | Shashank Agnihotri et.al. | 2505.18015v1 | null |
2025-05-23 | DiFache: Efficient and Scalable Caching on Disaggregated Memory using Decentralized Coherence | Hanze Zhang et.al. | 2505.18013v1 | null |
2025-05-23 | Measurement of branching fractions of $Λ_{c}^{+}$ decays to $Σ^{+} η$ and $Σ^{+} η'$ | BESIII Collaboration et.al. | 2505.18004v1 | null |
2025-05-23 | Dark Matter EFT Landscape Probed by QUEST-DMC | QUEST-DMC Collaboration et.al. | 2505.17995v1 | null |
2025-05-23 | Segment Anyword: Mask Prompt Inversion for Open-Set Grounded Segmentation | Zhihua Liu et.al. | 2505.17994v1 | null |
2025-05-23 | Canonical Pose Reconstruction from Single Depth Image for 3D Non-rigid Pose Recovery on Limited Datasets | Fahd Alhamazani et.al. | 2505.17992v1 | null |
2025-05-23 | A Principled Bayesian Framework for Training Binary and Spiking Neural Networks | James A. Walker et.al. | 2505.17962v1 | null |
2025-05-23 | Mind the Domain Gap: Measuring the Domain Gap Between Real-World and Synthetic Point Clouds for Automated Driving Development | Nguyen Duc et.al. | 2505.17959v1 | null |
2025-05-23 | The impact of compact object deformation on thin accretion disk properties | Shokoufe Faraji et.al. | 2505.17924v1 | null |
2025-05-23 | Object-level Cross-view Geo-localization with Location Enhancement and Multi-Head Cross Attention | Zheyang Huang et.al. | 2505.17911v1 | null |
2025-05-23 | Geometric Shape Modelling and Volume Estimation of Dry Bulk Cargo Piles using a Single Image | Debanshu Ratha et.al. | 2505.17896v1 | null |
2025-05-23 | DataRater: Meta-Learned Dataset Curation | Dan A. Calian et.al. | 2505.17895v1 | null |
2025-05-23 | FastCAV: Efficient Computation of Concept Activation Vectors for Explaining Deep Neural Networks | Laines Schmalwasser et.al. | 2505.17883v1 | null |
2025-05-23 | Semi-Supervised Multi-Label Feature Selection with Consistent Sparse Graph Learning | Yan Zhong et.al. | 2505.17875v1 | null |
2025-05-23 | Best Group Identification in Multi-Objective Bandits | Mohammad Shahverdikondori et.al. | 2505.17869v1 | null |
2025-05-23 | DesignX: Human-Competitive Algorithm Designer for Black-Box Optimization | Hongshu Guo et.al. | 2505.17866v1 | null |
2025-05-23 | Scalable Valuation of Human Feedback through Provably Robust Model Alignment | Masahiro Fujisawa et.al. | 2505.17859v1 | null |
2025-05-23 | Measurement of event shapes in minimum-bias events from proton-proton collisions at $\sqrt{s}$ = 13 TeV | CMS Collaboration et.al. | 2505.17850v1 | null |
Image Classification
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders | Savya Khosla et.al. | 2505.18153v1 | null |
2025-05-23 | WonderPlay: Dynamic 3D Scene Generation from a Single Image and Actions | Zizhang Li et.al. | 2505.18151v1 | null |
2025-05-23 | Generative Distribution Embeddings | Nic Fishman et.al. | 2505.18150v1 | null |
2025-05-23 | TokBench: Evaluating Your Visual Tokenizer before Visual Generation | Junfeng Wu et.al. | 2505.18142v1 | null |
2025-05-23 | TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations | Alan Arazi et.al. | 2505.18125v1 | null |
2025-05-23 | Instructify: Demystifying Metadata to Visual Instruction Tuning Data Conversion | Jacob Hansen et.al. | 2505.18115v1 | null |
2025-05-23 | Accelerating Learned Image Compression Through Modeling Neural Training Dynamics | Yichi Zhang et.al. | 2505.18107v1 | null |
2025-05-23 | F-ANcGAN: An Attention-Enhanced Cycle Consistent Generative Adversarial Architecture for Synthetic Image Generation of Nanoparticles | Varun Ajith et.al. | 2505.18106v1 | null |
2025-05-23 | Structural Dynamics of Harmful Content Dissemination on WhatsApp | Yuxin Liu et.al. | 2505.18099v1 | null |
2025-05-23 | Early-Exit Graph Neural Networks | Andrea Giuseppe Di Francesco et.al. | 2505.18088v1 | null |
2025-05-23 | DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation | Junhao Chen et.al. | 2505.18078v1 | null |
2025-05-23 | Image rotation in plasmas | Renaud Gueroult et.al. | 2505.18062v1 | null |
2025-05-23 | Semantic Correspondence: Unified Benchmarking and a Strong Baseline | Kaiyan Zhang et.al. | 2505.18060v1 | link |
2025-05-23 | A Foundation Model Framework for Multi-View MRI Classification of Extramural Vascular Invasion and Mesorectal Fascia Invasion in Rectal Cancer | Yumeng Zhang et.al. | 2505.18058v1 | null |
2025-05-23 | BOTM: Echocardiography Segmentation via Bi-directional Optimal Token Matching | Zhihua Liu et.al. | 2505.18052v1 | null |
2025-05-23 | LookWhere? Efficient Visual Recognition by Learning Where to Look and What to See from Self-Supervision | Anthony Fuller et.al. | 2505.18051v1 | null |
2025-05-23 | SpikeGen: Generative Framework for Visual Spike Stream Processing | Gaole Dai et.al. | 2505.18049v1 | null |
2025-05-23 | SHARDeg: A Benchmark for Skeletal Human Action Recognition in Degraded Scenarios | Simon Malzard et.al. | 2505.18048v1 | null |
2025-05-23 | RestoreVAR: Visual Autoregressive Generation for All-in-One Image Restoration | Sudarshan Rajagopalan et.al. | 2505.18047v1 | null |
2025-05-23 | Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation | Li Zhong et.al. | 2505.18039v1 | null |
2025-05-23 | CAMME: Adaptive Deepfake Image Detection with Multi-Modal Cross-Attention | Naseem Khan et.al. | 2505.18035v1 | null |
2025-05-23 | Knot So Simple: A Minimalistic Environment for Spatial Reasoning | Zizhao Chen et.al. | 2505.18028v1 | null |
2025-05-23 | A Wavelet-based Stereo Matching Framework for Solving Frequency Convergence Inconsistency | Xiaobao Wei et.al. | 2505.18024v1 | null |
2025-05-23 | RemoteSAM: Towards Segment Anything for Earth Observation | Liang Yao et.al. | 2505.18022v1 | null |
2025-05-23 | Building Floor Number Estimation from Crowdsourced Street-Level Images: Munich Dataset and Baseline Method | Yao Sun et.al. | 2505.18021v1 | null |
2025-05-23 | SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification | Shashank Agnihotri et.al. | 2505.18015v1 | null |
2025-05-23 | Classification of assembly tasks combining multiple primitive actions using Transformers and xLSTMs | Miguel Neves et.al. | 2505.18012v1 | null |
2025-05-23 | Clinical Validation of Deep Learning for Real-Time Tissue Oxygenation Estimation Using Spectral Imaging | Jens De Winne et.al. | 2505.18010v1 | null |
2025-05-23 | Segment Anyword: Mask Prompt Inversion for Open-Set Grounded Segmentation | Zhihua Liu et.al. | 2505.17994v1 | null |
2025-05-23 | Canonical Pose Reconstruction from Single Depth Image for 3D Non-rigid Pose Recovery on Limited Datasets | Fahd Alhamazani et.al. | 2505.17992v1 | null |
Instance Segmentation
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders | Savya Khosla et.al. | 2505.18153v1 | null |
2025-05-23 | Boosting Open Set Recognition Performance through Modulated Representation Learning | Amit Kumar Kundu et.al. | 2505.18137v1 | null |
2025-05-23 | Frankentext: Stitching random text fragments into long-form narratives | Chau Minh Pham et.al. | 2505.18128v1 | null |
2025-05-23 | From Temporal to Spatial: Designing Spatialized Interactions with Segmented-audios in Immersive Environments for Active Engagement with Performing Arts Intangible Cultural Heritage | Yuqi Wang et.al. | 2505.18112v1 | null |
2025-05-23 | Adapting SAM 2 for Visual Object Tracking: 1st Place Solution for MMVPR Challenge Multi-Modal Tracking | Cheng-Yen Yang et.al. | 2505.18111v1 | null |
2025-05-23 | F-ANcGAN: An Attention-Enhanced Cycle Consistent Generative Adversarial Architecture for Synthetic Image Generation of Nanoparticles | Varun Ajith et.al. | 2505.18106v1 | null |
2025-05-23 | CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays | Hyungyung Lee et.al. | 2505.18087v1 | null |
2025-05-23 | Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding | Xiaoyi Zhang et.al. | 2505.18079v1 | null |
2025-05-23 | BOTM: Echocardiography Segmentation via Bi-directional Optimal Token Matching | Zhihua Liu et.al. | 2505.18052v1 | null |
2025-05-23 | LookWhere? Efficient Visual Recognition by Learning Where to Look and What to See from Self-Supervision | Anthony Fuller et.al. | 2505.18051v1 | null |
2025-05-23 | RemoteSAM: Towards Segment Anything for Earth Observation | Liang Yao et.al. | 2505.18022v1 | null |
2025-05-23 | SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification | Shashank Agnihotri et.al. | 2505.18015v1 | null |
2025-05-23 | Classification of assembly tasks combining multiple primitive actions using Transformers and xLSTMs | Miguel Neves et.al. | 2505.18012v1 | null |
2025-05-23 | Segment Anyword: Mask Prompt Inversion for Open-Set Grounded Segmentation | Zhihua Liu et.al. | 2505.17994v1 | null |
2025-05-23 | Few-Shot Learning from Gigapixel Images via Hierarchical Vision-Language Alignment and Modeling | Bryan Wong et.al. | 2505.17982v1 | null |
2025-05-23 | MR-EEGWaveNet: Multiresolutional EEGWaveNet for Seizure Detection from Long EEG Recordings | Kazi Mahmudul Hassan et.al. | 2505.17972v1 | null |
2025-05-23 | Explainable Anatomy-Guided AI for Prostate MRI: Foundation Models and In Silico Clinical Trials for Virtual Biopsy-based Risk Assessment | Danial Khan et.al. | 2505.17971v1 | null |
2025-05-23 | Optimizing QAOA circuit transpilation with parity twine and SWAP network encodings | J. A. Montanez-Barrera et.al. | 2505.17944v1 | null |
2025-05-23 | AutoMiSeg: Automatic Medical Image Segmentation via Test-Time Adaptation of Foundation Models | Xingjian Li et.al. | 2505.17931v1 | null |
2025-05-23 | Promptable cancer segmentation using minimal expert-curated data | Lynn Karam et.al. | 2505.17915v1 | null |
2025-05-23 | Semantic segmentation with reward | Xie Ting et.al. | 2505.17905v1 | null |
2025-05-23 | DataRater: Meta-Learned Dataset Curation | Dan A. Calian et.al. | 2505.17895v1 | null |
2025-05-23 | Track Anything Annotate: Video annotation and dataset generation of computer vision models | Nikita Ivanov et.al. | 2505.17884v1 | null |
2025-05-23 | DesignX: Human-Competitive Algorithm Designer for Black-Box Optimization | Hongshu Guo et.al. | 2505.17866v1 | null |
2025-05-23 | Generative Data Augmentation for Object Point Cloud Segmentation | Dekai Zhu et.al. | 2505.17783v1 | null |
2025-05-23 | Hephaestus Minicubes: A Global, Multi-Modal Dataset for Volcanic Unrest Monitoring | Nikolas Papadopoulos et.al. | 2505.17782v1 | null |
2025-05-23 | But what is your honest answer? Aiding LLM-judges with honest alternatives using steering vectors | Leon Eshuijs et.al. | 2505.17760v1 | null |
2025-05-23 | MetaBox-v2: A Unified Benchmark Platform for Meta-Black-Box Optimization | Zeyuan Ma et.al. | 2505.17745v1 | null |
2025-05-23 | Slot-MLLM: Object-Centric Visual Tokenization for Multimodal LLM | Donghwan Chi et.al. | 2505.17726v1 | null |
2025-05-23 | SeaLion: Semantic Part-Aware Latent Point Diffusion Models for 3D Generation | Dekai Zhu et.al. | 2505.17721v1 | null |
Few-shot Learning
Meta Learning
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | Linear Mixture Distributionally Robust Markov Decision Processes | Zhishuai Liu et.al. | 2505.18044v1 | null |
2025-05-23 | The bipartite structure of treatment-trial networks reveals the flow of information in network meta-analysis | Annabel L Davies et.al. | 2505.18036v1 | null |
2025-05-23 | Federated Causal Inference from Multi-Site Observational Data via Propensity Score Aggregation | Khellaf Rémi et.al. | 2505.17961v1 | null |
2025-05-23 | Evolving Machine Learning: A Survey | Ignacio Cabrera Martin et.al. | 2505.17902v1 | null |
2025-05-23 | T2I-Eval-R1: Reinforcement Learning-Driven Reasoning for Interpretable Text-to-Image Evaluation | Zi-Ao Ma et.al. | 2505.17897v1 | null |
2025-05-23 | DataRater: Meta-Learned Dataset Curation | Dan A. Calian et.al. | 2505.17895v1 | null |
2025-05-23 | DesignX: Human-Competitive Algorithm Designer for Black-Box Optimization | Hongshu Guo et.al. | 2505.17866v1 | null |
2025-05-23 | MetaBox-v2: A Unified Benchmark Platform for Meta-Black-Box Optimization | Zeyuan Ma et.al. | 2505.17745v1 | null |
2025-05-23 | MARCO: Meta-Reflection with Cross-Referencing for Code Reasoning | Yusheng Zhao et.al. | 2505.17481v1 | null |
2025-05-23 | Towards Heterogeneous Continual Graph Learning via Meta-knowledge Distillation | Guiquan Sun et.al. | 2505.17458v1 | null |
2025-05-22 | GreekBarBench: A Challenging Benchmark for Free-Text Legal Reasoning and Citations | Odysseas S. Chlapanis et.al. | 2505.17267v1 | null |
2025-05-22 | Approach to Finding a Robust Deep Learning Model | Alexey Boldyrev et.al. | 2505.17254v1 | null |
2025-05-22 | Generative AI and Creativity: A Systematic Literature Review and Meta-Analysis | Niklas Holzner et.al. | 2505.17241v1 | null |
2025-05-22 | Understanding Prompt Tuning and In-Context Learning via Meta-Learning | Tim Genewein et.al. | 2505.17010v1 | link |
2025-05-22 | AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios | Yunjia Qi et.al. | 2505.16944v1 | link |
2025-05-22 | Perceptual Quality Assessment for Embodied AI | Chunyi Li et.al. | 2505.16815v1 | link |
2025-05-22 | Action is All You Need: Dual-Flow Generative Ranking Network for Recommendation | Hao Guo et.al. | 2505.16752v1 | null |
2025-05-22 | Meta-reinforcement learning with minimum attention | Pilhwa Lee et.al. | 2505.16741v1 | null |
2025-05-22 | Beyond Induction Heads: In-Context Meta Learning Induces Multi-Phase Circuit Emergence | Gouki Minegishi et.al. | 2505.16694v1 | null |
2025-05-22 | Universal estimates for the density of states for aperiodic block subwavelength resonator systems | Habib Ammari et.al. | 2505.16677v1 | null |
2025-05-22 | Finetuning-Activated Backdoors in LLMs | Thibaud Gloaguen et.al. | 2505.16567v2 | null |
2025-05-22 | ReflectEvo: Improving Meta Introspection of Small LLMs by Learning Self-Reflection | Jiaqi Li et.al. | 2505.16475v1 | null |
2025-05-22 | Meta-Calibration of the Cosmic Magnification Coefficient: Toward Unbiased Weak Lensing Reconstruction by Counting Galaxies | Jian Qin et.al. | 2505.16420v1 | null |
2025-05-22 | A Square Peg in a Square Hole: Meta-Expert for Long-Tailed Semi-Supervised Learning | Yaxin Hou et.al. | 2505.16341v1 | link |
2025-05-22 | MetaSTH-Sleep: Towards Effective Few-Shot Sleep Stage Classification with Spatial-Temporal Hypergraph Enhanced Meta-Learning | Jingyu Li et.al. | 2505.17142v1 | null |
2025-05-22 | Fashion Industry in the Age of Generative Artificial Intelligence and Metaverse: A systematic Review | Rania Ahmed et.al. | 2505.17141v1 | null |
2025-05-22 | Meta-PerSER: Few-Shot Listener Personalized Speech Emotion Recognition via Meta-learning | Liang-Yeh Shen et.al. | 2505.16220v1 | null |
2025-05-22 | Distilling the Implicit Multi-Branch Structure in LLMs' Reasoning via Reinforcement Learning | Shicheng Xu et.al. | 2505.16142v1 | null |
2025-05-21 | Meta-Learning an In-Context Transformer Model of Human Higher Visual Cortex | Muquan Yu et.al. | 2505.15813v1 | null |
2025-05-21 | Families of tractable problems with respect to vertex-interval-membership width and its generalisations | Jessica Enright et.al. | 2505.15699v1 | null |
Few-shot Learning
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | Generative Distribution Embeddings | Nic Fishman et.al. | 2505.18150v1 | null |
2025-05-23 | Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find | Owen Bianchi et.al. | 2505.18148v1 | null |
2025-05-23 | Nonadiabatic reactive scattering of hydrogen on different surface facets of copper | Wojciech G. Stark et.al. | 2505.18147v1 | null |
2025-05-23 | Stochastic agent-based Monte Carlo simulations for reaction-diffusion models, population dynamics, and epidemic spreading | Mohamed Swailem et.al. | 2505.18145v1 | null |
2025-05-23 | INN-FF: A Scalable and Efficient Machine Learning Potential for Molecular Dynamics | Taskin Mehereen et.al. | 2505.18141v1 | null |
2025-05-23 | Boosting Open Set Recognition Performance through Modulated Representation Learning | Amit Kumar Kundu et.al. | 2505.18137v1 | null |
2025-05-23 | Gaming Tool Preferences in Agentic LLMs | Kazem Faghih et.al. | 2505.18135v1 | null |
2025-05-23 | VideoGameBench: Can Vision-Language Models complete popular video games? | Alex L. Zhang et.al. | 2505.18134v1 | null |
2025-05-23 | BiggerGait: Unlocking Gait Recognition with Layer-wise Representations from Large Vision Models | Dingqing Ye et.al. | 2505.18132v1 | null |
2025-05-23 | Leveraging KANs for Expedient Training of Multichannel MLPs via Preconditioning and Geometric Refinement | Jonas A. Actor et.al. | 2505.18131v1 | null |
2025-05-23 | One RL to See Them All: Visual Triple Unified Reinforcement Learning | Yan Ma et.al. | 2505.18129v1 | null |
2025-05-23 | Tuning Thermal Conductivity and Electron-Phonon Interactions in Carbon and Boron Nitride Moiré Diamanes via Twist Angle Manipulation | Rustam Arabov et.al. | 2505.18127v1 | null |
2025-05-23 | Reward Model Overoptimisation in Iterated RLHF | Lorenz Wolf et.al. | 2505.18126v1 | null |
2025-05-23 | TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations | Alan Arazi et.al. | 2505.18125v1 | null |
2025-05-23 | ProgRM: Build Better GUI Agents with Progress Rewards | Danyang Zhang et.al. | 2505.18121v1 | null |
2025-05-23 | Scalable Policy Maximization Under Network Interference | Aidan Gleich et.al. | 2505.18118v1 | null |
2025-05-23 | Bridging Supervised Learning and Reinforcement Learning in Math Reasoning | Huayu Chen et.al. | 2505.18116v1 | null |
2025-05-23 | Beyond Discreteness: Finite-Sample Analysis of Straight-Through Estimator for Quantization | Halyun Jeong et.al. | 2505.18113v1 | null |
2025-05-23 | Accelerating Learned Image Compression Through Modeling Neural Training Dynamics | Yichi Zhang et.al. | 2505.18107v1 | null |
2025-05-23 | F-ANcGAN: An Attention-Enhanced Cycle Consistent Generative Adversarial Architecture for Synthetic Image Generation of Nanoparticles | Varun Ajith et.al. | 2505.18106v1 | null |
2025-05-23 | How Can I Publish My LLM Benchmark Without Giving the True Answers Away? | Takashi Ishida et.al. | 2505.18102v1 | null |
2025-05-23 | Dynamic Dual Buffer with Divide-and-Conquer Strategy for Online Continual Learning | Congren Dai et.al. | 2505.18101v1 | null |
2025-05-23 | Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL | Joey Hong et.al. | 2505.18098v1 | null |
2025-05-23 | Towards more transferable adversarial attack in black-box manner | Chun Tong Lei et.al. | 2505.18097v1 | null |
2025-05-23 | Data Mixing Can Induce Phase Transitions in Knowledge Acquisition | Xinran Gu et.al. | 2505.18091v1 | null |
2025-05-23 | Evaluation of derivatives using approximate generalized parameter shift rule | Vytautas Abramavicius et.al. | 2505.18090v1 | null |
2025-05-23 | Early-Exit Graph Neural Networks | Andrea Giuseppe Di Francesco et.al. | 2505.18088v1 | null |
2025-05-23 | CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays | Hyungyung Lee et.al. | 2505.18087v1 | null |
2025-05-23 | Stable Reinforcement Learning for Efficient Reasoning | Muzhi Dai et.al. | 2505.18086v1 | null |
2025-05-23 | What Do You Need for Diverse Trajectory Stitching in Diffusion Planning? | Quentin Clark et.al. | 2505.18083v1 | null |
One-shot Learning
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | Generative Distribution Embeddings | Nic Fishman et.al. | 2505.18150v1 | null |
2025-05-23 | Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find | Owen Bianchi et.al. | 2505.18148v1 | null |
2025-05-23 | Nonadiabatic reactive scattering of hydrogen on different surface facets of copper | Wojciech G. Stark et.al. | 2505.18147v1 | null |
2025-05-23 | Stochastic agent-based Monte Carlo simulations for reaction-diffusion models, population dynamics, and epidemic spreading | Mohamed Swailem et.al. | 2505.18145v1 | null |
2025-05-23 | INN-FF: A Scalable and Efficient Machine Learning Potential for Molecular Dynamics | Taskin Mehereen et.al. | 2505.18141v1 | null |
2025-05-23 | Boosting Open Set Recognition Performance through Modulated Representation Learning | Amit Kumar Kundu et.al. | 2505.18137v1 | null |
2025-05-23 | Gaming Tool Preferences in Agentic LLMs | Kazem Faghih et.al. | 2505.18135v1 | null |
2025-05-23 | VideoGameBench: Can Vision-Language Models complete popular video games? | Alex L. Zhang et.al. | 2505.18134v1 | null |
2025-05-23 | BiggerGait: Unlocking Gait Recognition with Layer-wise Representations from Large Vision Models | Dingqing Ye et.al. | 2505.18132v1 | null |
2025-05-23 | Leveraging KANs for Expedient Training of Multichannel MLPs via Preconditioning and Geometric Refinement | Jonas A. Actor et.al. | 2505.18131v1 | null |
2025-05-23 | One RL to See Them All: Visual Triple Unified Reinforcement Learning | Yan Ma et.al. | 2505.18129v1 | null |
2025-05-23 | Tuning Thermal Conductivity and Electron-Phonon Interactions in Carbon and Boron Nitride Moiré Diamanes via Twist Angle Manipulation | Rustam Arabov et.al. | 2505.18127v1 | null |
2025-05-23 | Reward Model Overoptimisation in Iterated RLHF | Lorenz Wolf et.al. | 2505.18126v1 | null |
2025-05-23 | TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations | Alan Arazi et.al. | 2505.18125v1 | null |
2025-05-23 | ProgRM: Build Better GUI Agents with Progress Rewards | Danyang Zhang et.al. | 2505.18121v1 | null |
2025-05-23 | Scalable Policy Maximization Under Network Interference | Aidan Gleich et.al. | 2505.18118v1 | null |
2025-05-23 | Bridging Supervised Learning and Reinforcement Learning in Math Reasoning | Huayu Chen et.al. | 2505.18116v1 | null |
2025-05-23 | Beyond Discreteness: Finite-Sample Analysis of Straight-Through Estimator for Quantization | Halyun Jeong et.al. | 2505.18113v1 | null |
2025-05-23 | Accelerating Learned Image Compression Through Modeling Neural Training Dynamics | Yichi Zhang et.al. | 2505.18107v1 | null |
2025-05-23 | F-ANcGAN: An Attention-Enhanced Cycle Consistent Generative Adversarial Architecture for Synthetic Image Generation of Nanoparticles | Varun Ajith et.al. | 2505.18106v1 | null |
2025-05-23 | How Can I Publish My LLM Benchmark Without Giving the True Answers Away? | Takashi Ishida et.al. | 2505.18102v1 | null |
2025-05-23 | Dynamic Dual Buffer with Divide-and-Conquer Strategy for Online Continual Learning | Congren Dai et.al. | 2505.18101v1 | null |
2025-05-23 | Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL | Joey Hong et.al. | 2505.18098v1 | null |
2025-05-23 | Towards more transferable adversarial attack in black-box manner | Chun Tong Lei et.al. | 2505.18097v1 | null |
2025-05-23 | Data Mixing Can Induce Phase Transitions in Knowledge Acquisition | Xinran Gu et.al. | 2505.18091v1 | null |
2025-05-23 | Evaluation of derivatives using approximate generalized parameter shift rule | Vytautas Abramavicius et.al. | 2505.18090v1 | null |
2025-05-23 | Early-Exit Graph Neural Networks | Andrea Giuseppe Di Francesco et.al. | 2505.18088v1 | null |
2025-05-23 | CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays | Hyungyung Lee et.al. | 2505.18087v1 | null |
2025-05-23 | Stable Reinforcement Learning for Efficient Reasoning | Muzhi Dai et.al. | 2505.18086v1 | null |
2025-05-23 | What Do You Need for Diverse Trajectory Stitching in Diffusion Planning? | Quentin Clark et.al. | 2505.18083v1 | null |
Federated Learning
Personalized
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation | Junhao Chen et.al. | 2505.18078v1 | null |
2025-05-23 | Extended Inductive Reasoning for Personalized Preference Inference from Behavioral Signals | Jia-Nan Li et.al. | 2505.18071v1 | null |
2025-05-23 | MathEDU: Towards Adaptive Feedback for Student Mathematical Problem-Solving | Wei-Ling Hsu et.al. | 2505.18056v1 | null |
2025-05-23 | ExoGait-MS: Learning Periodic Dynamics with Multi-Scale Graph Network for Exoskeleton Gait Recognition | Lijiang Liu et.al. | 2505.18018v1 | null |
2025-05-23 | Federated Causal Inference from Multi-Site Observational Data via Propensity Score Aggregation | Khellaf Rémi et.al. | 2505.17961v1 | null |
2025-05-23 | Urban Household Behavior in Indonesia: Drivers of Zero Waste Participation | Faizal Amir et.al. | 2505.17864v1 | null |
2025-05-23 | Multi-Person Interaction Generation from Two-Person Motion Priors | Wenning Xu et.al. | 2505.17860v1 | null |
2025-05-23 | PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions | Daeun Kyung et.al. | 2505.17818v1 | null |
2025-05-23 | Biomechanical Mapping of Tumor Growth: A Novel Method to Quantify Glioma Infiltration and Mass Effect | Carles López-Mateu et.al. | 2505.17715v1 | null |
2025-05-23 | Simulating Macroeconomic Expectations using LLM Agents | Jianhao Lin et.al. | 2505.17648v1 | null |
2025-05-23 | Large language model as user daily behavior data generator: balancing population diversity and individual personality | Haoxin Li et.al. | 2505.17615v1 | null |
2025-05-23 | NeUQI: Near-Optimal Uniform Quantization Parameter Initialization | Li Lin et.al. | 2505.17595v1 | null |
2025-05-23 | Reasoning Meets Personalization: Unleashing the Potential of Large Reasoning Model for Personalized Generation | Sichun Luo et.al. | 2505.17571v1 | null |
2025-05-23 | Twin-2K-500: A dataset for building digital twins of over 2,000 people based on their answers to over 500 questions | Olivier Toubia et.al. | 2505.17479v1 | null |
2025-05-23 | SecurePay: Enabling Secure and Fast Payment Processing for Platform Economy | Junru Lin et.al. | 2505.17466v1 | null |
2025-05-23 | Conversations: Love Them, Hate Them, Steer Them | Niranjan Chebrolu et.al. | 2505.17413v1 | null |
2025-05-22 | Deconfounded Warm-Start Thompson Sampling with Applications to Precision Medicine | Prateek Jaiswal et.al. | 2505.17283v1 | null |
2025-05-22 | Personalizing Student-Agent Interactions Using Log-Contextualized Retrieval Augmented Generation (RAG) | Clayton Cohn et.al. | 2505.17238v1 | null |
2025-05-22 | Secure and Private Federated Learning: Achieving Adversarial Resilience through Robust Aggregation | Kun Yang et.al. | 2505.17226v1 | null |
2025-05-22 | Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine | Adib Bazgir et.al. | 2505.16982v1 | null |
2025-05-22 | Incorporating Visual Correspondence into Diffusion Model for Virtual Try-On | Siqi Wan et.al. | 2505.16977v1 | link |
2025-05-22 | Cracking Aegis: An Adversarial LLM-based Game for Raising Awareness of Vulnerabilities in Privacy Protection | Jiaying Fu et.al. | 2505.16954v1 | null |
2025-05-22 | PIIvot: A Lightweight NLP Anonymization Framework for Question-Anchored Tutoring Dialogues | Matthew Zent et.al. | 2505.16931v1 | null |
2025-05-22 | Scalable and Interpretable Contextual Bandits: A Literature Review and Retail Offer Prototype | Nikola Tankovic et.al. | 2505.16918v1 | null |
2025-05-22 | Redefining Clustered Federated Learning for System Identification: The Path of ClusterCraft | Ertuğrul Keçeci et.al. | 2505.16857v1 | null |
2025-05-22 | ATR-Bench: A Federated Learning Benchmark for Adaptation, Trust, and Reasoning | Tajamul Ashraf et.al. | 2505.16850v1 | null |
2025-05-22 | A modular framework for automated evaluation of procedural content generation in serious games with deep reinforcement learning agents | Eleftherios Kalafatis et.al. | 2505.16801v1 | null |
2025-05-22 | Data-Driven Breakthroughs and Future Directions in AI Infrastructure: A Comprehensive Review | Beyazit Bestami Yuksel et.al. | 2505.16771v1 | null |
2025-05-22 | WikiDBGraph: Large-Scale Database Graph of Wikidata for Collaborative Learning | Zhaomin Wu et.al. | 2505.16635v1 | null |
2025-05-22 | Steering Large Language Models for Machine Translation Personalization | Daniel Scalena et.al. | 2505.16612v1 | link |
Benchmark
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | Federated Causal Inference from Multi-Site Observational Data via Propensity Score Aggregation | Khellaf Rémi et.al. | 2505.17961v1 | null |
2025-05-22 | Secure and Private Federated Learning: Achieving Adversarial Resilience through Robust Aggregation | Kun Yang et.al. | 2505.17226v1 | null |
2025-05-22 | Redefining Clustered Federated Learning for System Identification: The Path of ClusterCraft | Ertuğrul Keçeci et.al. | 2505.16857v1 | null |
2025-05-22 | ATR-Bench: A Federated Learning Benchmark for Adaptation, Trust, and Reasoning | Tajamul Ashraf et.al. | 2505.16850v1 | null |
2025-05-22 | Data-Driven Breakthroughs and Future Directions in AI Infrastructure: A Comprehensive Review | Beyazit Bestami Yuksel et.al. | 2505.16771v1 | null |
2025-05-22 | WikiDBGraph: Large-Scale Database Graph of Wikidata for Collaborative Learning | Zhaomin Wu et.al. | 2505.16635v1 | null |
2025-05-22 | From Local Patterns to Global Understanding: Cross-Stock Trend Integration for Enhanced Predictive Modeling | Yi Hu et.al. | 2505.16573v1 | null |
2025-05-22 | Performance Guaranteed Poisoning Attacks in Federated Learning: A Sliding Mode Approach | Huazi Pan et.al. | 2505.16403v1 | null |
2025-05-22 | Privacy-Aware Cyberterrorism Network Analysis using Graph Neural Networks and Federated Learning | Anas Ali et.al. | 2505.16371v1 | null |
2025-05-22 | Enhancing Federated Survival Analysis through Peer-Driven Client Reputation in Healthcare | Navid Seidi et.al. | 2505.16190v1 | null |
2025-05-22 | Multimodal Online Federated Learning with Modality Missing in Internet of Things | Heqiang Wang et.al. | 2505.16138v1 | null |
2025-05-21 | Spectroscopic study of globular and fuzzy clusters in Lenticular galaxy NGC 1023 | Miguel A. López-Santamaría et.al. | 2505.16075v1 | null |
2025-05-21 | A Federated Splitting Framework for LLMs: Security, Efficiency, and Adaptability | Zishuai Zhang et.al. | 2505.15683v1 | link |
2025-05-21 | Federated Learning with Unlabeled Clients: Personalization Can Happen in Low Dimensions | Hossein Zakerinia et.al. | 2505.15579v1 | null |
2025-05-21 | Federated Learning-Enhanced Blockchain Framework for Privacy-Preserving Intrusion Detection in Industrial IoT | Anas Ali et.al. | 2505.15376v1 | null |
2025-05-21 | Distributionally Robust Federated Learning with Client Drift Minimization | Mounssif Krouka et.al. | 2505.15371v1 | null |
2025-05-21 | Reliable Vertical Federated Learning in 5G Core Network Architecture | Mohamad Mestoukirdi et.al. | 2505.15244v1 | link |
2025-05-21 | EC-LDA : Label Distribution Inference Attack against Federated Graph Learning with Embedding Compression | Tong Cheng et.al. | 2505.15140v1 | null |
2025-05-21 | A Survey On Secure Machine Learning | Taobo Liao et.al. | 2505.15124v1 | null |
2025-05-20 | Efficient Privacy-Preserving Cross-Silo Federated Learning with Multi-Key Homomorphic Encryption | Abdullah Al Omar et.al. | 2505.14797v1 | null |
2025-05-20 | Deep Learning-Based Forecasting of Boarding Patient Counts to Address ED Overcrowding | Orhun Vural et.al. | 2505.14765v1 | null |
2025-05-20 | Keeping in Place After the Storm-Emergency Assistance and Evictions | Bilal Islah et.al. | 2505.14548v1 | null |
2025-05-20 | Federated prediction for scalable and privacy-preserved knowledge-based planning in radiotherapy | Jingyun Chen et.al. | 2505.14507v1 | null |
2025-05-20 | Path-integral molecular dynamics with actively-trained and universal machine learning force fields | A. A. Solovykh et.al. | 2505.14245v1 | null |
2025-05-20 | Federated learning in low-resource settings: A chest imaging study in Africa -- Challenges and lessons learned | Jorge Fabila et.al. | 2505.14217v1 | null |
2025-05-20 | Personalized Bayesian Federated Learning with Wasserstein Barycenter Aggregation | Ting Wei et.al. | 2505.14161v1 | null |
2025-05-20 | Non-monotonic dependence of $T_c$ on the c axis compression in the HTSC cuprate La$_{2-x}$Sr$_x$CuO$_4$ | I. A. Makarov et.al. | 2505.14048v1 | null |
2025-05-20 | FedGraM: Defending Against Untargeted Attacks in Federated Learning via Embedding Gram Matrix | Di Wu et.al. | 2505.14024v1 | null |
2025-05-19 | New insight into the variability of the Be star $π$ Aquarii: Determination of stellar and disk parameters | D. Concha et.al. | 2505.13700v1 | null |
2025-05-19 | Optimal Client Sampling in Federated Learning with Client-Level Heterogeneous Differential Privacy | Jiahao Xu et.al. | 2505.13655v1 | null |
Federated Learning
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | Generative Distribution Embeddings | Nic Fishman et.al. | 2505.18150v1 | null |
2025-05-23 | Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find | Owen Bianchi et.al. | 2505.18148v1 | null |
2025-05-23 | Nonadiabatic reactive scattering of hydrogen on different surface facets of copper | Wojciech G. Stark et.al. | 2505.18147v1 | null |
2025-05-23 | Stochastic agent-based Monte Carlo simulations for reaction-diffusion models, population dynamics, and epidemic spreading | Mohamed Swailem et.al. | 2505.18145v1 | null |
2025-05-23 | INN-FF: A Scalable and Efficient Machine Learning Potential for Molecular Dynamics | Taskin Mehereen et.al. | 2505.18141v1 | null |
2025-05-23 | Boosting Open Set Recognition Performance through Modulated Representation Learning | Amit Kumar Kundu et.al. | 2505.18137v1 | null |
2025-05-23 | Gaming Tool Preferences in Agentic LLMs | Kazem Faghih et.al. | 2505.18135v1 | null |
2025-05-23 | VideoGameBench: Can Vision-Language Models complete popular video games? | Alex L. Zhang et.al. | 2505.18134v1 | null |
2025-05-23 | BiggerGait: Unlocking Gait Recognition with Layer-wise Representations from Large Vision Models | Dingqing Ye et.al. | 2505.18132v1 | null |
2025-05-23 | Leveraging KANs for Expedient Training of Multichannel MLPs via Preconditioning and Geometric Refinement | Jonas A. Actor et.al. | 2505.18131v1 | null |
2025-05-23 | One RL to See Them All: Visual Triple Unified Reinforcement Learning | Yan Ma et.al. | 2505.18129v1 | null |
2025-05-23 | Tuning Thermal Conductivity and Electron-Phonon Interactions in Carbon and Boron Nitride Moiré Diamanes via Twist Angle Manipulation | Rustam Arabov et.al. | 2505.18127v1 | null |
2025-05-23 | Reward Model Overoptimisation in Iterated RLHF | Lorenz Wolf et.al. | 2505.18126v1 | null |
2025-05-23 | TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations | Alan Arazi et.al. | 2505.18125v1 | null |
2025-05-23 | ProgRM: Build Better GUI Agents with Progress Rewards | Danyang Zhang et.al. | 2505.18121v1 | null |
2025-05-23 | Scalable Policy Maximization Under Network Interference | Aidan Gleich et.al. | 2505.18118v1 | null |
2025-05-23 | Bridging Supervised Learning and Reinforcement Learning in Math Reasoning | Huayu Chen et.al. | 2505.18116v1 | null |
2025-05-23 | Beyond Discreteness: Finite-Sample Analysis of Straight-Through Estimator for Quantization | Halyun Jeong et.al. | 2505.18113v1 | null |
2025-05-23 | Accelerating Learned Image Compression Through Modeling Neural Training Dynamics | Yichi Zhang et.al. | 2505.18107v1 | null |
2025-05-23 | F-ANcGAN: An Attention-Enhanced Cycle Consistent Generative Adversarial Architecture for Synthetic Image Generation of Nanoparticles | Varun Ajith et.al. | 2505.18106v1 | null |
2025-05-23 | How Can I Publish My LLM Benchmark Without Giving the True Answers Away? | Takashi Ishida et.al. | 2505.18102v1 | null |
2025-05-23 | Dynamic Dual Buffer with Divide-and-Conquer Strategy for Online Continual Learning | Congren Dai et.al. | 2505.18101v1 | null |
2025-05-23 | Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL | Joey Hong et.al. | 2505.18098v1 | null |
2025-05-23 | Towards more transferable adversarial attack in black-box manner | Chun Tong Lei et.al. | 2505.18097v1 | null |
2025-05-23 | Data Mixing Can Induce Phase Transitions in Knowledge Acquisition | Xinran Gu et.al. | 2505.18091v1 | null |
2025-05-23 | Evaluation of derivatives using approximate generalized parameter shift rule | Vytautas Abramavicius et.al. | 2505.18090v1 | null |
2025-05-23 | Early-Exit Graph Neural Networks | Andrea Giuseppe Di Francesco et.al. | 2505.18088v1 | null |
2025-05-23 | CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays | Hyungyung Lee et.al. | 2505.18087v1 | null |
2025-05-23 | Stable Reinforcement Learning for Efficient Reasoning | Muzhi Dai et.al. | 2505.18086v1 | null |
2025-05-23 | What Do You Need for Diverse Trajectory Stitching in Diffusion Planning? | Quentin Clark et.al. | 2505.18083v1 | null |
Optimization
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | Federated Causal Inference from Multi-Site Observational Data via Propensity Score Aggregation | Khellaf Rémi et.al. | 2505.17961v1 | null |
2025-05-22 | Secure and Private Federated Learning: Achieving Adversarial Resilience through Robust Aggregation | Kun Yang et.al. | 2505.17226v1 | null |
2025-05-22 | Redefining Clustered Federated Learning for System Identification: The Path of ClusterCraft | Ertuğrul Keçeci et.al. | 2505.16857v1 | null |
2025-05-22 | ATR-Bench: A Federated Learning Benchmark for Adaptation, Trust, and Reasoning | Tajamul Ashraf et.al. | 2505.16850v1 | null |
2025-05-22 | Data-Driven Breakthroughs and Future Directions in AI Infrastructure: A Comprehensive Review | Beyazit Bestami Yuksel et.al. | 2505.16771v1 | null |
2025-05-22 | WikiDBGraph: Large-Scale Database Graph of Wikidata for Collaborative Learning | Zhaomin Wu et.al. | 2505.16635v1 | null |
2025-05-22 | From Local Patterns to Global Understanding: Cross-Stock Trend Integration for Enhanced Predictive Modeling | Yi Hu et.al. | 2505.16573v1 | null |
2025-05-22 | Performance Guaranteed Poisoning Attacks in Federated Learning: A Sliding Mode Approach | Huazi Pan et.al. | 2505.16403v1 | null |
2025-05-22 | Privacy-Aware Cyberterrorism Network Analysis using Graph Neural Networks and Federated Learning | Anas Ali et.al. | 2505.16371v1 | null |
2025-05-22 | Enhancing Federated Survival Analysis through Peer-Driven Client Reputation in Healthcare | Navid Seidi et.al. | 2505.16190v1 | null |
2025-05-22 | Multimodal Online Federated Learning with Modality Missing in Internet of Things | Heqiang Wang et.al. | 2505.16138v1 | null |
2025-05-21 | Spectroscopic study of globular and fuzzy clusters in Lenticular galaxy NGC 1023 | Miguel A. López-Santamaría et.al. | 2505.16075v1 | null |
2025-05-21 | A Federated Splitting Framework for LLMs: Security, Efficiency, and Adaptability | Zishuai Zhang et.al. | 2505.15683v1 | link |
2025-05-21 | Federated Learning with Unlabeled Clients: Personalization Can Happen in Low Dimensions | Hossein Zakerinia et.al. | 2505.15579v1 | null |
2025-05-21 | Federated Learning-Enhanced Blockchain Framework for Privacy-Preserving Intrusion Detection in Industrial IoT | Anas Ali et.al. | 2505.15376v1 | null |
2025-05-21 | Distributionally Robust Federated Learning with Client Drift Minimization | Mounssif Krouka et.al. | 2505.15371v1 | null |
2025-05-21 | Reliable Vertical Federated Learning in 5G Core Network Architecture | Mohamad Mestoukirdi et.al. | 2505.15244v1 | link |
2025-05-21 | EC-LDA : Label Distribution Inference Attack against Federated Graph Learning with Embedding Compression | Tong Cheng et.al. | 2505.15140v1 | null |
2025-05-21 | A Survey On Secure Machine Learning | Taobo Liao et.al. | 2505.15124v1 | null |
2025-05-20 | Efficient Privacy-Preserving Cross-Silo Federated Learning with Multi-Key Homomorphic Encryption | Abdullah Al Omar et.al. | 2505.14797v1 | null |
2025-05-20 | Deep Learning-Based Forecasting of Boarding Patient Counts to Address ED Overcrowding | Orhun Vural et.al. | 2505.14765v1 | null |
2025-05-20 | Keeping in Place After the Storm-Emergency Assistance and Evictions | Bilal Islah et.al. | 2505.14548v1 | null |
2025-05-20 | Federated prediction for scalable and privacy-preserved knowledge-based planning in radiotherapy | Jingyun Chen et.al. | 2505.14507v1 | null |
2025-05-20 | Path-integral molecular dynamics with actively-trained and universal machine learning force fields | A. A. Solovykh et.al. | 2505.14245v1 | null |
2025-05-20 | Federated learning in low-resource settings: A chest imaging study in Africa -- Challenges and lessons learned | Jorge Fabila et.al. | 2505.14217v1 | null |
2025-05-20 | Personalized Bayesian Federated Learning with Wasserstein Barycenter Aggregation | Ting Wei et.al. | 2505.14161v1 | null |
2025-05-20 | Non-monotonic dependence of $T_c$ on the c axis compression in the HTSC cuprate La$_{2-x}$Sr$_x$CuO$_4$ | I. A. Makarov et.al. | 2505.14048v1 | null |
2025-05-20 | FedGraM: Defending Against Untargeted Attacks in Federated Learning via Embedding Gram Matrix | Di Wu et.al. | 2505.14024v1 | null |
2025-05-19 | New insight into the variability of the Be star $π$ Aquarii: Determination of stellar and disk parameters | D. Concha et.al. | 2505.13700v1 | null |
2025-05-19 | Optimal Client Sampling in Federated Learning with Client-Level Heterogeneous Differential Privacy | Jiahao Xu et.al. | 2505.13655v1 | null |
Framework
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | Federated Causal Inference from Multi-Site Observational Data via Propensity Score Aggregation | Khellaf Rémi et.al. | 2505.17961v1 | null |
2025-05-22 | Secure and Private Federated Learning: Achieving Adversarial Resilience through Robust Aggregation | Kun Yang et.al. | 2505.17226v1 | null |
2025-05-22 | Redefining Clustered Federated Learning for System Identification: The Path of ClusterCraft | Ertuğrul Keçeci et.al. | 2505.16857v1 | null |
2025-05-22 | ATR-Bench: A Federated Learning Benchmark for Adaptation, Trust, and Reasoning | Tajamul Ashraf et.al. | 2505.16850v1 | null |
2025-05-22 | Data-Driven Breakthroughs and Future Directions in AI Infrastructure: A Comprehensive Review | Beyazit Bestami Yuksel et.al. | 2505.16771v1 | null |
2025-05-22 | WikiDBGraph: Large-Scale Database Graph of Wikidata for Collaborative Learning | Zhaomin Wu et.al. | 2505.16635v1 | null |
2025-05-22 | From Local Patterns to Global Understanding: Cross-Stock Trend Integration for Enhanced Predictive Modeling | Yi Hu et.al. | 2505.16573v1 | null |
2025-05-22 | Performance Guaranteed Poisoning Attacks in Federated Learning: A Sliding Mode Approach | Huazi Pan et.al. | 2505.16403v1 | null |
2025-05-22 | Privacy-Aware Cyberterrorism Network Analysis using Graph Neural Networks and Federated Learning | Anas Ali et.al. | 2505.16371v1 | null |
2025-05-22 | Enhancing Federated Survival Analysis through Peer-Driven Client Reputation in Healthcare | Navid Seidi et.al. | 2505.16190v1 | null |
2025-05-22 | Multimodal Online Federated Learning with Modality Missing in Internet of Things | Heqiang Wang et.al. | 2505.16138v1 | null |
2025-05-21 | Spectroscopic study of globular and fuzzy clusters in Lenticular galaxy NGC 1023 | Miguel A. López-Santamaría et.al. | 2505.16075v1 | null |
2025-05-21 | A Federated Splitting Framework for LLMs: Security, Efficiency, and Adaptability | Zishuai Zhang et.al. | 2505.15683v1 | link |
2025-05-21 | Federated Learning with Unlabeled Clients: Personalization Can Happen in Low Dimensions | Hossein Zakerinia et.al. | 2505.15579v1 | null |
2025-05-21 | Federated Learning-Enhanced Blockchain Framework for Privacy-Preserving Intrusion Detection in Industrial IoT | Anas Ali et.al. | 2505.15376v1 | null |
2025-05-21 | Distributionally Robust Federated Learning with Client Drift Minimization | Mounssif Krouka et.al. | 2505.15371v1 | null |
2025-05-21 | Reliable Vertical Federated Learning in 5G Core Network Architecture | Mohamad Mestoukirdi et.al. | 2505.15244v1 | link |
2025-05-21 | EC-LDA : Label Distribution Inference Attack against Federated Graph Learning with Embedding Compression | Tong Cheng et.al. | 2505.15140v1 | null |
2025-05-21 | A Survey On Secure Machine Learning | Taobo Liao et.al. | 2505.15124v1 | null |
2025-05-20 | Efficient Privacy-Preserving Cross-Silo Federated Learning with Multi-Key Homomorphic Encryption | Abdullah Al Omar et.al. | 2505.14797v1 | null |
2025-05-20 | Deep Learning-Based Forecasting of Boarding Patient Counts to Address ED Overcrowding | Orhun Vural et.al. | 2505.14765v1 | null |
2025-05-20 | Keeping in Place After the Storm-Emergency Assistance and Evictions | Bilal Islah et.al. | 2505.14548v1 | null |
2025-05-20 | Federated prediction for scalable and privacy-preserved knowledge-based planning in radiotherapy | Jingyun Chen et.al. | 2505.14507v1 | null |
2025-05-20 | Path-integral molecular dynamics with actively-trained and universal machine learning force fields | A. A. Solovykh et.al. | 2505.14245v1 | null |
2025-05-20 | Federated learning in low-resource settings: A chest imaging study in Africa -- Challenges and lessons learned | Jorge Fabila et.al. | 2505.14217v1 | null |
2025-05-20 | Personalized Bayesian Federated Learning with Wasserstein Barycenter Aggregation | Ting Wei et.al. | 2505.14161v1 | null |
2025-05-20 | Non-monotonic dependence of $T_c$ on the c axis compression in the HTSC cuprate La$_{2-x}$Sr$_x$CuO$_4$ | I. A. Makarov et.al. | 2505.14048v1 | null |
2025-05-20 | FedGraM: Defending Against Untargeted Attacks in Federated Learning via Embedding Gram Matrix | Di Wu et.al. | 2505.14024v1 | null |
2025-05-19 | New insight into the variability of the Be star $π$ Aquarii: Determination of stellar and disk parameters | D. Concha et.al. | 2505.13700v1 | null |
2025-05-19 | Optimal Client Sampling in Federated Learning with Client-Level Heterogeneous Differential Privacy | Jiahao Xu et.al. | 2505.13655v1 | null |
Dataset
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | Federated Causal Inference from Multi-Site Observational Data via Propensity Score Aggregation | Khellaf Rémi et.al. | 2505.17961v1 | null |
2025-05-22 | Secure and Private Federated Learning: Achieving Adversarial Resilience through Robust Aggregation | Kun Yang et.al. | 2505.17226v1 | null |
2025-05-22 | Redefining Clustered Federated Learning for System Identification: The Path of ClusterCraft | Ertuğrul Keçeci et.al. | 2505.16857v1 | null |
2025-05-22 | ATR-Bench: A Federated Learning Benchmark for Adaptation, Trust, and Reasoning | Tajamul Ashraf et.al. | 2505.16850v1 | null |
2025-05-22 | Data-Driven Breakthroughs and Future Directions in AI Infrastructure: A Comprehensive Review | Beyazit Bestami Yuksel et.al. | 2505.16771v1 | null |
2025-05-22 | WikiDBGraph: Large-Scale Database Graph of Wikidata for Collaborative Learning | Zhaomin Wu et.al. | 2505.16635v1 | null |
2025-05-22 | From Local Patterns to Global Understanding: Cross-Stock Trend Integration for Enhanced Predictive Modeling | Yi Hu et.al. | 2505.16573v1 | null |
2025-05-22 | Performance Guaranteed Poisoning Attacks in Federated Learning: A Sliding Mode Approach | Huazi Pan et.al. | 2505.16403v1 | null |
2025-05-22 | Privacy-Aware Cyberterrorism Network Analysis using Graph Neural Networks and Federated Learning | Anas Ali et.al. | 2505.16371v1 | null |
2025-05-22 | Enhancing Federated Survival Analysis through Peer-Driven Client Reputation in Healthcare | Navid Seidi et.al. | 2505.16190v1 | null |
2025-05-22 | Multimodal Online Federated Learning with Modality Missing in Internet of Things | Heqiang Wang et.al. | 2505.16138v1 | null |
2025-05-21 | Spectroscopic study of globular and fuzzy clusters in Lenticular galaxy NGC 1023 | Miguel A. López-Santamaría et.al. | 2505.16075v1 | null |
2025-05-21 | A Federated Splitting Framework for LLMs: Security, Efficiency, and Adaptability | Zishuai Zhang et.al. | 2505.15683v1 | link |
2025-05-21 | Federated Learning with Unlabeled Clients: Personalization Can Happen in Low Dimensions | Hossein Zakerinia et.al. | 2505.15579v1 | null |
2025-05-21 | Federated Learning-Enhanced Blockchain Framework for Privacy-Preserving Intrusion Detection in Industrial IoT | Anas Ali et.al. | 2505.15376v1 | null |
2025-05-21 | Distributionally Robust Federated Learning with Client Drift Minimization | Mounssif Krouka et.al. | 2505.15371v1 | null |
2025-05-21 | Reliable Vertical Federated Learning in 5G Core Network Architecture | Mohamad Mestoukirdi et.al. | 2505.15244v1 | link |
2025-05-21 | EC-LDA : Label Distribution Inference Attack against Federated Graph Learning with Embedding Compression | Tong Cheng et.al. | 2505.15140v1 | null |
2025-05-21 | A Survey On Secure Machine Learning | Taobo Liao et.al. | 2505.15124v1 | null |
2025-05-20 | Efficient Privacy-Preserving Cross-Silo Federated Learning with Multi-Key Homomorphic Encryption | Abdullah Al Omar et.al. | 2505.14797v1 | null |
2025-05-20 | Deep Learning-Based Forecasting of Boarding Patient Counts to Address ED Overcrowding | Orhun Vural et.al. | 2505.14765v1 | null |
2025-05-20 | Keeping in Place After the Storm-Emergency Assistance and Evictions | Bilal Islah et.al. | 2505.14548v1 | null |
2025-05-20 | Federated prediction for scalable and privacy-preserved knowledge-based planning in radiotherapy | Jingyun Chen et.al. | 2505.14507v1 | null |
2025-05-20 | Path-integral molecular dynamics with actively-trained and universal machine learning force fields | A. A. Solovykh et.al. | 2505.14245v1 | null |
2025-05-20 | Federated learning in low-resource settings: A chest imaging study in Africa -- Challenges and lessons learned | Jorge Fabila et.al. | 2505.14217v1 | null |
2025-05-20 | Personalized Bayesian Federated Learning with Wasserstein Barycenter Aggregation | Ting Wei et.al. | 2505.14161v1 | null |
2025-05-20 | Non-monotonic dependence of $T_c$ on the c axis compression in the HTSC cuprate La$_{2-x}$Sr$_x$CuO$_4$ | I. A. Makarov et.al. | 2505.14048v1 | null |
2025-05-20 | FedGraM: Defending Against Untargeted Attacks in Federated Learning via Embedding Gram Matrix | Di Wu et.al. | 2505.14024v1 | null |
2025-05-19 | New insight into the variability of the Be star $π$ Aquarii: Determination of stellar and disk parameters | D. Concha et.al. | 2505.13700v1 | null |
2025-05-19 | Optimal Client Sampling in Federated Learning with Client-Level Heterogeneous Differential Privacy | Jiahao Xu et.al. | 2505.13655v1 | null |
Heterogeneous
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | Federated Causal Inference from Multi-Site Observational Data via Propensity Score Aggregation | Khellaf Rémi et.al. | 2505.17961v1 | null |
2025-05-22 | Secure and Private Federated Learning: Achieving Adversarial Resilience through Robust Aggregation | Kun Yang et.al. | 2505.17226v1 | null |
2025-05-22 | Redefining Clustered Federated Learning for System Identification: The Path of ClusterCraft | Ertuğrul Keçeci et.al. | 2505.16857v1 | null |
2025-05-22 | ATR-Bench: A Federated Learning Benchmark for Adaptation, Trust, and Reasoning | Tajamul Ashraf et.al. | 2505.16850v1 | null |
2025-05-22 | Data-Driven Breakthroughs and Future Directions in AI Infrastructure: A Comprehensive Review | Beyazit Bestami Yuksel et.al. | 2505.16771v1 | null |
2025-05-22 | WikiDBGraph: Large-Scale Database Graph of Wikidata for Collaborative Learning | Zhaomin Wu et.al. | 2505.16635v1 | null |
2025-05-22 | From Local Patterns to Global Understanding: Cross-Stock Trend Integration for Enhanced Predictive Modeling | Yi Hu et.al. | 2505.16573v1 | null |
2025-05-22 | Performance Guaranteed Poisoning Attacks in Federated Learning: A Sliding Mode Approach | Huazi Pan et.al. | 2505.16403v1 | null |
2025-05-22 | Privacy-Aware Cyberterrorism Network Analysis using Graph Neural Networks and Federated Learning | Anas Ali et.al. | 2505.16371v1 | null |
2025-05-22 | Enhancing Federated Survival Analysis through Peer-Driven Client Reputation in Healthcare | Navid Seidi et.al. | 2505.16190v1 | null |
2025-05-22 | Multimodal Online Federated Learning with Modality Missing in Internet of Things | Heqiang Wang et.al. | 2505.16138v1 | null |
2025-05-21 | Spectroscopic study of globular and fuzzy clusters in Lenticular galaxy NGC 1023 | Miguel A. López-Santamaría et.al. | 2505.16075v1 | null |
2025-05-21 | A Federated Splitting Framework for LLMs: Security, Efficiency, and Adaptability | Zishuai Zhang et.al. | 2505.15683v1 | link |
2025-05-21 | Federated Learning with Unlabeled Clients: Personalization Can Happen in Low Dimensions | Hossein Zakerinia et.al. | 2505.15579v1 | null |
2025-05-21 | Federated Learning-Enhanced Blockchain Framework for Privacy-Preserving Intrusion Detection in Industrial IoT | Anas Ali et.al. | 2505.15376v1 | null |
2025-05-21 | Distributionally Robust Federated Learning with Client Drift Minimization | Mounssif Krouka et.al. | 2505.15371v1 | null |
2025-05-21 | Reliable Vertical Federated Learning in 5G Core Network Architecture | Mohamad Mestoukirdi et.al. | 2505.15244v1 | link |
2025-05-21 | EC-LDA : Label Distribution Inference Attack against Federated Graph Learning with Embedding Compression | Tong Cheng et.al. | 2505.15140v1 | null |
2025-05-21 | A Survey On Secure Machine Learning | Taobo Liao et.al. | 2505.15124v1 | null |
2025-05-20 | Efficient Privacy-Preserving Cross-Silo Federated Learning with Multi-Key Homomorphic Encryption | Abdullah Al Omar et.al. | 2505.14797v1 | null |
2025-05-20 | Deep Learning-Based Forecasting of Boarding Patient Counts to Address ED Overcrowding | Orhun Vural et.al. | 2505.14765v1 | null |
2025-05-20 | Keeping in Place After the Storm-Emergency Assistance and Evictions | Bilal Islah et.al. | 2505.14548v1 | null |
2025-05-20 | Federated prediction for scalable and privacy-preserved knowledge-based planning in radiotherapy | Jingyun Chen et.al. | 2505.14507v1 | null |
2025-05-20 | Path-integral molecular dynamics with actively-trained and universal machine learning force fields | A. A. Solovykh et.al. | 2505.14245v1 | null |
2025-05-20 | Federated learning in low-resource settings: A chest imaging study in Africa -- Challenges and lessons learned | Jorge Fabila et.al. | 2505.14217v1 | null |
2025-05-20 | Personalized Bayesian Federated Learning with Wasserstein Barycenter Aggregation | Ting Wei et.al. | 2505.14161v1 | null |
2025-05-20 | Non-monotonic dependence of $T_c$ on the c axis compression in the HTSC cuprate La$_{2-x}$Sr$_x$CuO$_4$ | I. A. Makarov et.al. | 2505.14048v1 | null |
2025-05-20 | FedGraM: Defending Against Untargeted Attacks in Federated Learning via Embedding Gram Matrix | Di Wu et.al. | 2505.14024v1 | null |
2025-05-19 | New insight into the variability of the Be star $π$ Aquarii: Determination of stellar and disk parameters | D. Concha et.al. | 2505.13700v1 | null |
2025-05-19 | Optimal Client Sampling in Federated Learning with Client-Level Heterogeneous Differential Privacy | Jiahao Xu et.al. | 2505.13655v1 | null |
Asynchronous
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | Federated Causal Inference from Multi-Site Observational Data via Propensity Score Aggregation | Khellaf Rémi et.al. | 2505.17961v1 | null |
2025-05-22 | Secure and Private Federated Learning: Achieving Adversarial Resilience through Robust Aggregation | Kun Yang et.al. | 2505.17226v1 | null |
2025-05-22 | Redefining Clustered Federated Learning for System Identification: The Path of ClusterCraft | Ertuğrul Keçeci et.al. | 2505.16857v1 | null |
2025-05-22 | ATR-Bench: A Federated Learning Benchmark for Adaptation, Trust, and Reasoning | Tajamul Ashraf et.al. | 2505.16850v1 | null |
2025-05-22 | Data-Driven Breakthroughs and Future Directions in AI Infrastructure: A Comprehensive Review | Beyazit Bestami Yuksel et.al. | 2505.16771v1 | null |
2025-05-22 | WikiDBGraph: Large-Scale Database Graph of Wikidata for Collaborative Learning | Zhaomin Wu et.al. | 2505.16635v1 | null |
2025-05-22 | From Local Patterns to Global Understanding: Cross-Stock Trend Integration for Enhanced Predictive Modeling | Yi Hu et.al. | 2505.16573v1 | null |
2025-05-22 | Performance Guaranteed Poisoning Attacks in Federated Learning: A Sliding Mode Approach | Huazi Pan et.al. | 2505.16403v1 | null |
2025-05-22 | Privacy-Aware Cyberterrorism Network Analysis using Graph Neural Networks and Federated Learning | Anas Ali et.al. | 2505.16371v1 | null |
2025-05-22 | Enhancing Federated Survival Analysis through Peer-Driven Client Reputation in Healthcare | Navid Seidi et.al. | 2505.16190v1 | null |
2025-05-22 | Multimodal Online Federated Learning with Modality Missing in Internet of Things | Heqiang Wang et.al. | 2505.16138v1 | null |
2025-05-21 | Spectroscopic study of globular and fuzzy clusters in Lenticular galaxy NGC 1023 | Miguel A. López-Santamaría et.al. | 2505.16075v1 | null |
2025-05-21 | A Federated Splitting Framework for LLMs: Security, Efficiency, and Adaptability | Zishuai Zhang et.al. | 2505.15683v1 | link |
2025-05-21 | Federated Learning with Unlabeled Clients: Personalization Can Happen in Low Dimensions | Hossein Zakerinia et.al. | 2505.15579v1 | null |
2025-05-21 | Federated Learning-Enhanced Blockchain Framework for Privacy-Preserving Intrusion Detection in Industrial IoT | Anas Ali et.al. | 2505.15376v1 | null |
2025-05-21 | Distributionally Robust Federated Learning with Client Drift Minimization | Mounssif Krouka et.al. | 2505.15371v1 | null |
2025-05-21 | Reliable Vertical Federated Learning in 5G Core Network Architecture | Mohamad Mestoukirdi et.al. | 2505.15244v1 | link |
2025-05-21 | EC-LDA : Label Distribution Inference Attack against Federated Graph Learning with Embedding Compression | Tong Cheng et.al. | 2505.15140v1 | null |
2025-05-21 | A Survey On Secure Machine Learning | Taobo Liao et.al. | 2505.15124v1 | null |
2025-05-20 | Efficient Privacy-Preserving Cross-Silo Federated Learning with Multi-Key Homomorphic Encryption | Abdullah Al Omar et.al. | 2505.14797v1 | null |
2025-05-20 | Deep Learning-Based Forecasting of Boarding Patient Counts to Address ED Overcrowding | Orhun Vural et.al. | 2505.14765v1 | null |
2025-05-20 | Keeping in Place After the Storm-Emergency Assistance and Evictions | Bilal Islah et.al. | 2505.14548v1 | null |
2025-05-20 | Federated prediction for scalable and privacy-preserved knowledge-based planning in radiotherapy | Jingyun Chen et.al. | 2505.14507v1 | null |
2025-05-20 | Path-integral molecular dynamics with actively-trained and universal machine learning force fields | A. A. Solovykh et.al. | 2505.14245v1 | null |
2025-05-20 | Federated learning in low-resource settings: A chest imaging study in Africa -- Challenges and lessons learned | Jorge Fabila et.al. | 2505.14217v1 | null |
2025-05-20 | Personalized Bayesian Federated Learning with Wasserstein Barycenter Aggregation | Ting Wei et.al. | 2505.14161v1 | null |
2025-05-20 | Non-monotonic dependence of $T_c$ on the c axis compression in the HTSC cuprate La$_{2-x}$Sr$_x$CuO$_4$ | I. A. Makarov et.al. | 2505.14048v1 | null |
2025-05-20 | FedGraM: Defending Against Untargeted Attacks in Federated Learning via Embedding Gram Matrix | Di Wu et.al. | 2505.14024v1 | null |
2025-05-19 | New insight into the variability of the Be star $π$ Aquarii: Determination of stellar and disk parameters | D. Concha et.al. | 2505.13700v1 | null |
2025-05-19 | Optimal Client Sampling in Federated Learning with Client-Level Heterogeneous Differential Privacy | Jiahao Xu et.al. | 2505.13655v1 | null |
Unsupervised Learning
Unsupervised Learning
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | Generative Distribution Embeddings | Nic Fishman et.al. | 2505.18150v1 | null |
2025-05-23 | Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find | Owen Bianchi et.al. | 2505.18148v1 | null |
2025-05-23 | Nonadiabatic reactive scattering of hydrogen on different surface facets of copper | Wojciech G. Stark et.al. | 2505.18147v1 | null |
2025-05-23 | Stochastic agent-based Monte Carlo simulations for reaction-diffusion models, population dynamics, and epidemic spreading | Mohamed Swailem et.al. | 2505.18145v1 | null |
2025-05-23 | INN-FF: A Scalable and Efficient Machine Learning Potential for Molecular Dynamics | Taskin Mehereen et.al. | 2505.18141v1 | null |
2025-05-23 | Boosting Open Set Recognition Performance through Modulated Representation Learning | Amit Kumar Kundu et.al. | 2505.18137v1 | null |
2025-05-23 | Gaming Tool Preferences in Agentic LLMs | Kazem Faghih et.al. | 2505.18135v1 | null |
2025-05-23 | VideoGameBench: Can Vision-Language Models complete popular video games? | Alex L. Zhang et.al. | 2505.18134v1 | null |
2025-05-23 | BiggerGait: Unlocking Gait Recognition with Layer-wise Representations from Large Vision Models | Dingqing Ye et.al. | 2505.18132v1 | null |
2025-05-23 | Leveraging KANs for Expedient Training of Multichannel MLPs via Preconditioning and Geometric Refinement | Jonas A. Actor et.al. | 2505.18131v1 | null |
2025-05-23 | One RL to See Them All: Visual Triple Unified Reinforcement Learning | Yan Ma et.al. | 2505.18129v1 | null |
2025-05-23 | Tuning Thermal Conductivity and Electron-Phonon Interactions in Carbon and Boron Nitride Moiré Diamanes via Twist Angle Manipulation | Rustam Arabov et.al. | 2505.18127v1 | null |
2025-05-23 | Reward Model Overoptimisation in Iterated RLHF | Lorenz Wolf et.al. | 2505.18126v1 | null |
2025-05-23 | TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations | Alan Arazi et.al. | 2505.18125v1 | null |
2025-05-23 | ProgRM: Build Better GUI Agents with Progress Rewards | Danyang Zhang et.al. | 2505.18121v1 | null |
2025-05-23 | Scalable Policy Maximization Under Network Interference | Aidan Gleich et.al. | 2505.18118v1 | null |
2025-05-23 | Bridging Supervised Learning and Reinforcement Learning in Math Reasoning | Huayu Chen et.al. | 2505.18116v1 | null |
2025-05-23 | Beyond Discreteness: Finite-Sample Analysis of Straight-Through Estimator for Quantization | Halyun Jeong et.al. | 2505.18113v1 | null |
2025-05-23 | Accelerating Learned Image Compression Through Modeling Neural Training Dynamics | Yichi Zhang et.al. | 2505.18107v1 | null |
2025-05-23 | F-ANcGAN: An Attention-Enhanced Cycle Consistent Generative Adversarial Architecture for Synthetic Image Generation of Nanoparticles | Varun Ajith et.al. | 2505.18106v1 | null |
2025-05-23 | How Can I Publish My LLM Benchmark Without Giving the True Answers Away? | Takashi Ishida et.al. | 2505.18102v1 | null |
2025-05-23 | Dynamic Dual Buffer with Divide-and-Conquer Strategy for Online Continual Learning | Congren Dai et.al. | 2505.18101v1 | null |
2025-05-23 | Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL | Joey Hong et.al. | 2505.18098v1 | null |
2025-05-23 | Towards more transferable adversarial attack in black-box manner | Chun Tong Lei et.al. | 2505.18097v1 | null |
2025-05-23 | Data Mixing Can Induce Phase Transitions in Knowledge Acquisition | Xinran Gu et.al. | 2505.18091v1 | null |
2025-05-23 | Evaluation of derivatives using approximate generalized parameter shift rule | Vytautas Abramavicius et.al. | 2505.18090v1 | null |
2025-05-23 | Early-Exit Graph Neural Networks | Andrea Giuseppe Di Francesco et.al. | 2505.18088v1 | null |
2025-05-23 | CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays | Hyungyung Lee et.al. | 2505.18087v1 | null |
2025-05-23 | Stable Reinforcement Learning for Efficient Reasoning | Muzhi Dai et.al. | 2505.18086v1 | null |
2025-05-23 | What Do You Need for Diverse Trajectory Stitching in Diffusion Planning? | Quentin Clark et.al. | 2505.18083v1 | null |
Face Reenactment
Face Reenactment
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find | Owen Bianchi et.al. | 2505.18148v1 | null |
2025-05-23 | TokBench: Evaluating Your Visual Tokenizer before Visual Generation | Junfeng Wu et.al. | 2505.18142v1 | null |
2025-05-23 | Bidirectional Knowledge Distillation for Enhancing Sequential Recommendation with Large Language Models | Jiongran Wu et.al. | 2505.18120v1 | null |
2025-05-23 | From Temporal to Spatial: Designing Spatialized Interactions with Segmented-audios in Immersive Environments for Active Engagement with Performing Arts Intangible Cultural Heritage | Yuqi Wang et.al. | 2505.18112v1 | null |
2025-05-23 | DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations | Ziqiao Peng et.al. | 2505.18096v1 | null |
2025-05-23 | MathEDU: Towards Adaptive Feedback for Student Mathematical Problem-Solving | Wei-Ling Hsu et.al. | 2505.18056v1 | null |
2025-05-23 | Linear Mixture Distributionally Robust Markov Decision Processes | Zhishuai Liu et.al. | 2505.18044v1 | null |
2025-05-23 | A novel parameter-free and locking-free enriched Galerkin method for linear elasticity | Shuai Su et.al. | 2505.18042v1 | null |
2025-05-23 | 3D Face Reconstruction Error Decomposed: A Modular Benchmark for Fair and Fast Method Evaluation | Evangelos Sariyanidi et.al. | 2505.18025v1 | null |
2025-05-23 | ExoGait-MS: Learning Periodic Dynamics with Multi-Scale Graph Network for Exoskeleton Gait Recognition | Lijiang Liu et.al. | 2505.18018v1 | null |
2025-05-23 | Few-Shot Learning from Gigapixel Images via Hierarchical Vision-Language Alignment and Modeling | Bryan Wong et.al. | 2505.17982v1 | null |
2025-05-23 | DiffusionReward: Enhancing Blind Face Restoration through Reward Feedback Learning | Bin Wu et.al. | 2505.17910v1 | null |
2025-05-23 | TransDF: Time-Series Forecasting Needs Transformed Label Alignment | Hao Wang et.al. | 2505.17847v1 | null |
2025-05-23 | Automated Testing of the GUI of a Real-Life Engineering Software using Large Language Models | Tim Rosenbach et.al. | 2505.17839v1 | null |
2025-05-23 | Imagine Beyond! Distributionally Robust Auto-Encoding for State Space Coverage in Online Reinforcement Learning | Nicolas Castanet et.al. | 2505.17830v1 | null |
2025-05-23 | Temporal Consistency Constrained Transferable Adversarial Attacks with Background Mixup for Action Recognition | Ping Li et.al. | 2505.17807v1 | null |
2025-05-23 | Integrating Counterfactual Simulations with Language Models for Explaining Multi-Agent Behaviour | Bálint Gyevnár et.al. | 2505.17801v1 | null |
2025-05-23 | A Distributionally-Robust Framework for Nuisance in Causal Effect Estimation | Akira Tanimoto et.al. | 2505.17717v1 | null |
2025-05-23 | The Third Pillar of Causal Analysis? A Measurement Perspective on Causal Representations | Dingling Yao et.al. | 2505.17708v1 | null |
2025-05-23 | CIKT: A Collaborative and Iterative Knowledge Tracing Framework with Large Language Models | Runze Li et.al. | 2505.17705v1 | null |
2025-05-23 | MIDB: Multilingual Instruction Data Booster for Enhancing Multilingual Instruction Synthesis | Yilun Liu et.al. | 2505.17671v1 | null |
2025-05-23 | Enhancing Large Vision-Language Models with Layout Modality for Table Question Answering on Japanese Annual Securities Reports | Hayato Aida et.al. | 2505.17625v1 | null |
2025-05-23 | Navigate the Unknown: Enhancing LLM Reasoning with Intrinsic Motivation Guided Exploration | Jingtong Gao et.al. | 2505.17621v1 | null |
2025-05-23 | Scaling Image and Video Generation via Test-Time Evolutionary Search | Haoran He et.al. | 2505.17618v1 | null |
2025-05-23 | Decoupled Visual Interpretation and Linguistic Reasoning for Math Problem Solving | Zixian Guo et.al. | 2505.17609v1 | null |
2025-05-23 | Wolf Hidden in Sheep's Conversations: Toward Harmless Data-Based Backdoor Attacks for Jailbreaking Large Language Models | Jiawei Kong et.al. | 2505.17601v1 | null |
2025-05-23 | Dynamic Text Bundling Supervision for Zero-Shot Inference on Text-Attributed Graphs | Yusheng Zhao et.al. | 2505.17599v1 | null |
2025-05-23 | NeUQI: Near-Optimal Uniform Quantization Parameter Initialization | Li Lin et.al. | 2505.17595v1 | null |
2025-05-23 | ProTAL: A Drag-and-Link Video Programming Framework for Temporal Action Localization | Yuchen He et.al. | 2505.17555v1 | null |
2025-05-23 | FreqU-FNet: Frequency-Aware U-Net for Imbalanced Medical Image Segmentation | Ruiqi Xing et.al. | 2505.17544v1 | null |
Talking Faces
Talking Faces
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations | Ziqiao Peng et.al. | 2505.18096v1 | null |
2025-05-23 | Heterogeneous Transmission of Analog Radio and Digital Coherent Signals Over Multi-Span Metro and PON for Bandwidth-Efficient Fronthaul in mmWave Centralized RAN Networks [Invited] | Devika Dass et.al. | 2505.17785v1 | null |
2025-05-23 | A Fully Generative Motivational Interviewing Counsellor Chatbot for Moving Smokers Towards the Decision to Quit | Zafarullah Mahmood et.al. | 2505.17362v1 | null |
2025-05-22 | LCSR predictions for $B \to K$ Hadronic Matrix Elements | Dayanand Mishra et.al. | 2505.16426v1 | null |
2025-05-21 | Language Specific Knowledge: Do Models Know Better in X than in English? | Ishika Agarwal et.al. | 2505.14990v1 | null |
2025-05-20 | Communication with Multiple Senders | Kailin Chen et.al. | 2505.14639v1 | null |
2025-05-20 | From Unaligned to Aligned: Scaling Multilingual LLMs with Multi-Way Parallel Corpora | Yingli Shen et.al. | 2505.14045v1 | null |
2025-05-20 | The Hidden Dangers of Outdated Software: A Cyber Security Perspective | Gogulakrishnan Thiyagarajan et.al. | 2505.13922v1 | null |
2025-05-19 | To Bias or Not to Bias: Detecting bias in News with bias-detector | Himel Ghosh et.al. | 2505.13010v1 | link |
2025-05-19 | Basis light-front quantization approach to deuteron | Chandan Mondal et.al. | 2505.12889v1 | null |
2025-05-19 | Language Models That Walk the Talk: A Framework for Formal Fairness Certificates | Danqing Chen et.al. | 2505.12767v1 | null |
2025-05-18 | What are they talking about? Benchmarking Large Language Models for Knowledge-Grounded Discussion Summarization | Weixiao Zhou et.al. | 2505.12474v1 | link |
2025-05-17 | Two-Photon Fusion Results at BESIII | Max Lellmann et.al. | 2505.12142v1 | null |
2025-05-17 | AR Secretary Agent: Real-time Memory Augmentation via LLM-powered Augmented Reality Glasses | Raphaël A. El Haddad et.al. | 2505.11888v1 | null |
2025-05-17 | Reputational cheap talk versus prior information | Allen Vong et.al. | 2505.11877v1 | null |
2025-05-16 | Talk to Your Slides: Language-Driven Agents for Efficient Slide Editing | Kyudan Jung et.al. | 2505.11604v2 | link |
2025-05-16 | Einstein Telescope an Cosmic Explorer | Matteo Di Giovanni et.al. | 2505.11033v1 | null |
2025-05-13 | Aspects of massive gauge fields | Anamaria Hell et.al. | 2505.08962v1 | null |
2025-05-13 | Simultaneous sweet-spot locking of gradiometric fluxonium qubits | Denis Bénâtre et.al. | 2505.08769v1 | null |
2025-05-13 | AI and Generative AI Transforming Disaster Management: A Survey of Damage Assessment and Response Techniques | Aman Raj et.al. | 2505.08202v1 | null |
2025-05-12 | Towards Actionable Pedagogical Feedback: A Multi-Perspective Analysis of Mathematics Teaching and Tutoring Dialogue | Jannatun Naim et.al. | 2505.07161v1 | link |
2025-05-11 | Polar Duality and the Donoho--Stark Uncertainty Principle | Maurice de Gosson et.al. | 2505.07037v1 | null |
2025-05-10 | VTutor: An Animated Pedagogical Agent SDK that Provide Real Time Multi-Model Feedback | Eason Chen et.al. | 2505.06676v1 | null |
2025-05-10 | RADE: A Neural Codec for Transmitting Speech over HF Radio Channels | David Rowe et.al. | 2505.06671v1 | null |
2025-05-10 | NeuroPal: A Clinically-Informed Multimodal LLM Assistant for Mental Health Combining Sleep Chronotherapy, Cognitive Behavioral Reframing, and Adaptive Phytochemical Intervention | Xiaoran Han et.al. | 2505.06640v1 | null |
2025-05-09 | An empathic GPT-based chatbot to talk about mental disorders with Spanish teenagers | Alba María Mármol-Romero et.al. | 2505.05828v1 | null |
2025-05-08 | Recent results of (semi)leptonic decays of charm hadrons at BESIII | Xiang Pan et.al. | 2505.05123v1 | null |
2025-05-07 | Accelerating Audio Research with Robotic Dummy Heads | Austin Lu et.al. | 2505.04548v1 | null |
2025-05-07 | Proceedings The 13th International Workshop on Theorem proving components for Educational software | Julien Narboux et.al. | 2505.04677v1 | null |
2025-05-07 | Jet Quenching in Heavy-Ion Collisions at RHIC and the LHC experiments | Nihar Ranjan Sahoo et.al. | 2505.04325v1 | null |
Graph Neural Network
Graph Neural Network
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders | Savya Khosla et.al. | 2505.18153v1 | null |
2025-05-23 | Generative Distribution Embeddings | Nic Fishman et.al. | 2505.18150v1 | null |
2025-05-23 | INN-FF: A Scalable and Efficient Machine Learning Potential for Molecular Dynamics | Taskin Mehereen et.al. | 2505.18141v1 | null |
2025-05-23 | Graph-Linguistic Fusion: Using Language Models for Wikidata Vandalism Detection | Mykola Trokhymovych et.al. | 2505.18136v1 | null |
2025-05-23 | Joint Encryption and Error Correction for Secure Quantum Communication | Nitin Jha et.al. | 2505.18133v1 | null |
2025-05-23 | Leveraging KANs for Expedient Training of Multichannel MLPs via Preconditioning and Geometric Refinement | Jonas A. Actor et.al. | 2505.18131v1 | null |
2025-05-23 | Scalable Policy Maximization Under Network Interference | Aidan Gleich et.al. | 2505.18118v1 | null |
2025-05-23 | Multi-Modal Spectral Parametrization Method (MMSPM) for analyzing EEG activity with distinct scaling regimes | Frigyes Samuel Racz et.al. | 2505.18117v1 | null |
2025-05-23 | Beyond Discreteness: Finite-Sample Analysis of Straight-Through Estimator for Quantization | Halyun Jeong et.al. | 2505.18113v1 | null |
2025-05-23 | Accelerating Learned Image Compression Through Modeling Neural Training Dynamics | Yichi Zhang et.al. | 2505.18107v1 | null |
2025-05-23 | F-ANcGAN: An Attention-Enhanced Cycle Consistent Generative Adversarial Architecture for Synthetic Image Generation of Nanoparticles | Varun Ajith et.al. | 2505.18106v1 | null |
2025-05-23 | Structural Dynamics of Harmful Content Dissemination on WhatsApp | Yuxin Liu et.al. | 2505.18099v1 | null |
2025-05-23 | Early-Exit Graph Neural Networks | Andrea Giuseppe Di Francesco et.al. | 2505.18088v1 | null |
2025-05-23 | An Iterative Framework for Generative Backmapping of Coarse Grained Proteins | Georgios Kementzidis et.al. | 2505.18082v1 | null |
2025-05-23 | Backpropagation-Free Metropolis-Adjusted Langevin Algorithm | Adam D. Cobb et.al. | 2505.18081v1 | null |
2025-05-23 | AFD-STA: Adaptive Filtering Denoising with Spatiotemporal Attention for Chaotic System Prediction | Chunlin Gong et.al. | 2505.18080v1 | null |
2025-05-23 | Emergence of Hebbian Dynamics in Regularized Non-Local Learners | David Koplow et.al. | 2505.18069v1 | null |
2025-05-23 | Preferential attachment and power-law degree distributions in heterogeneous multilayer hypergraphs | Francesco Di Lauro et.al. | 2505.18068v1 | null |
2025-05-23 | Virtual retractions in free constructions | Ashot Minasyan et.al. | 2505.18054v1 | null |
2025-05-23 | Learning with Restricted Boltzmann Machines: Asymptotics of AMP and GD in High Dimensions | Yizhou Xu et.al. | 2505.18046v1 | null |
2025-05-23 | The bipartite structure of treatment-trial networks reveals the flow of information in network meta-analysis | Annabel L Davies et.al. | 2505.18036v1 | null |
2025-05-23 | Structured Thinking Matters: Improving LLMs Generalization in Causal Inference Tasks | Wentao Sun et.al. | 2505.18034v1 | null |
2025-05-23 | Multiplexed multipartite quantum repeater rates in the stationary regime | Julia A. Kunzelmann et.al. | 2505.18031v1 | null |
2025-05-23 | Near optimal edge partitioning via intersecting families | Alexander Yakunin et.al. | 2505.18026v1 | null |
2025-05-23 | Time to Spike? Understanding the Representational Power of Spiking Neural Networks in Discrete Time | Duc Anh Nguyen et.al. | 2505.18023v1 | null |
2025-05-23 | Building Floor Number Estimation from Crowdsourced Street-Level Images: Munich Dataset and Baseline Method | Yao Sun et.al. | 2505.18021v1 | null |
2025-05-23 | ExoGait-MS: Learning Periodic Dynamics with Multi-Scale Graph Network for Exoskeleton Gait Recognition | Lijiang Liu et.al. | 2505.18018v1 | null |
2025-05-23 | On the geometric $k$-colored crossing number of $K_n$ | Benedikt Hahn et.al. | 2505.18014v1 | null |
2025-05-23 | Clinical Validation of Deep Learning for Real-Time Tissue Oxygenation Estimation Using Spectral Imaging | Jens De Winne et.al. | 2505.18010v1 | null |
2025-05-23 | Empathic network learning for multi-expert emergency decision-making under incomplete and inconsistent information | Simin Shen et.al. | 2505.18009v1 | null |
Transfer Learning
Transfer Learning
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | Generative Distribution Embeddings | Nic Fishman et.al. | 2505.18150v1 | null |
2025-05-23 | Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find | Owen Bianchi et.al. | 2505.18148v1 | null |
2025-05-23 | Nonadiabatic reactive scattering of hydrogen on different surface facets of copper | Wojciech G. Stark et.al. | 2505.18147v1 | null |
2025-05-23 | Stochastic agent-based Monte Carlo simulations for reaction-diffusion models, population dynamics, and epidemic spreading | Mohamed Swailem et.al. | 2505.18145v1 | null |
2025-05-23 | INN-FF: A Scalable and Efficient Machine Learning Potential for Molecular Dynamics | Taskin Mehereen et.al. | 2505.18141v1 | null |
2025-05-23 | Effect of Fluorine doping on the electrocatalytic properties of Nb2O5 for H2O2 electrogeneration | Aline B. Trench et.al. | 2505.18140v1 | null |
2025-05-23 | Boosting Open Set Recognition Performance through Modulated Representation Learning | Amit Kumar Kundu et.al. | 2505.18137v1 | null |
2025-05-23 | Gaming Tool Preferences in Agentic LLMs | Kazem Faghih et.al. | 2505.18135v1 | null |
2025-05-23 | VideoGameBench: Can Vision-Language Models complete popular video games? | Alex L. Zhang et.al. | 2505.18134v1 | null |
2025-05-23 | BiggerGait: Unlocking Gait Recognition with Layer-wise Representations from Large Vision Models | Dingqing Ye et.al. | 2505.18132v1 | null |
2025-05-23 | Leveraging KANs for Expedient Training of Multichannel MLPs via Preconditioning and Geometric Refinement | Jonas A. Actor et.al. | 2505.18131v1 | null |
2025-05-23 | One RL to See Them All: Visual Triple Unified Reinforcement Learning | Yan Ma et.al. | 2505.18129v1 | null |
2025-05-23 | Tuning Thermal Conductivity and Electron-Phonon Interactions in Carbon and Boron Nitride Moiré Diamanes via Twist Angle Manipulation | Rustam Arabov et.al. | 2505.18127v1 | null |
2025-05-23 | Reward Model Overoptimisation in Iterated RLHF | Lorenz Wolf et.al. | 2505.18126v1 | null |
2025-05-23 | TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations | Alan Arazi et.al. | 2505.18125v1 | null |
2025-05-23 | ProgRM: Build Better GUI Agents with Progress Rewards | Danyang Zhang et.al. | 2505.18121v1 | null |
2025-05-23 | Bidirectional Knowledge Distillation for Enhancing Sequential Recommendation with Large Language Models | Jiongran Wu et.al. | 2505.18120v1 | null |
2025-05-23 | Scalable Policy Maximization Under Network Interference | Aidan Gleich et.al. | 2505.18118v1 | null |
2025-05-23 | Bridging Supervised Learning and Reinforcement Learning in Math Reasoning | Huayu Chen et.al. | 2505.18116v1 | null |
2025-05-23 | Beyond Discreteness: Finite-Sample Analysis of Straight-Through Estimator for Quantization | Halyun Jeong et.al. | 2505.18113v1 | null |
2025-05-23 | Accelerating Learned Image Compression Through Modeling Neural Training Dynamics | Yichi Zhang et.al. | 2505.18107v1 | null |
2025-05-23 | F-ANcGAN: An Attention-Enhanced Cycle Consistent Generative Adversarial Architecture for Synthetic Image Generation of Nanoparticles | Varun Ajith et.al. | 2505.18106v1 | null |
2025-05-23 | How Can I Publish My LLM Benchmark Without Giving the True Answers Away? | Takashi Ishida et.al. | 2505.18102v1 | null |
2025-05-23 | Dynamic Dual Buffer with Divide-and-Conquer Strategy for Online Continual Learning | Congren Dai et.al. | 2505.18101v1 | null |
2025-05-23 | A new generation of effective core potentials: Selected lanthanides and heavy elements II | Omar Madany et.al. | 2505.18100v1 | null |
2025-05-23 | Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL | Joey Hong et.al. | 2505.18098v1 | null |
2025-05-23 | Towards more transferable adversarial attack in black-box manner | Chun Tong Lei et.al. | 2505.18097v1 | null |
2025-05-23 | Data Mixing Can Induce Phase Transitions in Knowledge Acquisition | Xinran Gu et.al. | 2505.18091v1 | null |
2025-05-23 | Evaluation of derivatives using approximate generalized parameter shift rule | Vytautas Abramavicius et.al. | 2505.18090v1 | null |
2025-05-23 | Early-Exit Graph Neural Networks | Andrea Giuseppe Di Francesco et.al. | 2505.18088v1 | null |
Reinforcement Learning
Reinforcement Learning
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | One RL to See Them All: Visual Triple Unified Reinforcement Learning | Yan Ma et.al. | 2505.18129v1 | null |
2025-05-23 | Reward Model Overoptimisation in Iterated RLHF | Lorenz Wolf et.al. | 2505.18126v1 | null |
2025-05-23 | Bridging Supervised Learning and Reinforcement Learning in Math Reasoning | Huayu Chen et.al. | 2505.18116v1 | null |
2025-05-23 | Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL | Joey Hong et.al. | 2505.18098v1 | null |
2025-05-23 | Stable Reinforcement Learning for Efficient Reasoning | Muzhi Dai et.al. | 2505.18086v1 | null |
2025-05-23 | What Do You Need for Diverse Trajectory Stitching in Diffusion Planning? | Quentin Clark et.al. | 2505.18083v1 | null |
2025-05-23 | Extended Inductive Reasoning for Personalized Preference Inference from Behavioral Signals | Jia-Nan Li et.al. | 2505.18071v1 | null |
2025-05-23 | Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective | Jintian Shao et.al. | 2505.17997v1 | null |
2025-05-23 | Outcome-based Reinforcement Learning to Predict the Future | Benjamin Turtel et.al. | 2505.17989v1 | null |
2025-05-23 | Towards Revealing the Effectiveness of Small-Scale Fine-tuning in R1-style Reinforcement Learning | Yutong Chen et.al. | 2505.17988v1 | null |
2025-05-23 | Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL | Che Liu et.al. | 2505.17952v1 | null |
2025-05-23 | Semantic segmentation with reward | Xie Ting et.al. | 2505.17905v1 | null |
2025-05-23 | T2I-Eval-R1: Reinforcement Learning-Driven Reasoning for Interpretable Text-to-Image Evaluation | Zi-Ao Ma et.al. | 2505.17897v1 | null |
2025-05-23 | Formalizing Embeddedness Failures in Universal Artificial Intelligence | Cole Wyeth et.al. | 2505.17882v1 | null |
2025-05-23 | DesignX: Human-Competitive Algorithm Designer for Black-Box Optimization | Hongshu Guo et.al. | 2505.17866v1 | null |
2025-05-23 | Imagine Beyond! Distributionally Robust Auto-Encoding for State Space Coverage in Online Reinforcement Learning | Nicolas Castanet et.al. | 2505.17830v1 | null |
2025-05-23 | Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models | Xuchen Pan et.al. | 2505.17826v1 | null |
2025-05-23 | Temporal Consistency Constrained Transferable Adversarial Attacks with Background Mixup for Action Recognition | Ping Li et.al. | 2505.17807v1 | null |
2025-05-23 | Integrating Counterfactual Simulations with Language Models for Explaining Multi-Agent Behaviour | Bálint Gyevnár et.al. | 2505.17801v1 | null |
2025-05-23 | DialogXpert: Driving Intelligent and Emotion-Aware Conversations through Online Value-Based Reinforcement Learning with LLM Priors | Tazeek Bin Abdur Rakib et.al. | 2505.17795v1 | null |
2025-05-23 | Mind the GAP! The Challenges of Scale in Pixel-based Deep Reinforcement Learning | Ghada Sokar et.al. | 2505.17749v1 | null |
2025-05-23 | Fast Quiet-STaR: Thinking Without Thought Tokens | Wei Huang et.al. | 2505.17746v1 | null |
2025-05-23 | MetaBox-v2: A Unified Benchmark Platform for Meta-Black-Box Optimization | Zeyuan Ma et.al. | 2505.17745v1 | null |
2025-05-23 | URB -- Urban Routing Benchmark for RL-equipped Connected Autonomous Vehicles | Ahmet Onur Akman et.al. | 2505.17734v1 | null |
2025-05-23 | PPO-BR: Dual-Signal Entropy-Reward Adaptation for Trust Region Policy Optimization | Ben Rahman et.al. | 2505.17714v1 | null |
2025-05-23 | Activation Control for Efficiently Eliciting Long Chain-of-thought Ability of Language Models | Zekai Zhao et.al. | 2505.17697v1 | null |
2025-05-23 | QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning | Fanqi Wan et.al. | 2505.17667v1 | null |
2025-05-23 | Plan-R1: Safe and Feasible Trajectory Planning as Language Modeling | Xiaolong Tang et.al. | 2505.17659v1 | null |
2025-05-23 | Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective | Deyang Kong et.al. | 2505.17652v1 | null |
2025-05-23 | H2-COMPACT: Human-Humanoid Co-Manipulation via Adaptive Contact Trajectory Policies | Geeta Chandra Raju Bethala et.al. | 2505.17627v1 | null |
Transformer
Vision Transformer
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | REN: Fast and Efficient Region Encodings from Patch-Based Image Encoders | Savya Khosla et.al. | 2505.18153v1 | null |
2025-05-23 | WonderPlay: Dynamic 3D Scene Generation from a Single Image and Actions | Zizhang Li et.al. | 2505.18151v1 | null |
2025-05-23 | TokBench: Evaluating Your Visual Tokenizer before Visual Generation | Junfeng Wu et.al. | 2505.18142v1 | null |
2025-05-23 | Boosting Open Set Recognition Performance through Modulated Representation Learning | Amit Kumar Kundu et.al. | 2505.18137v1 | null |
2025-05-23 | VideoGameBench: Can Vision-Language Models complete popular video games? | Alex L. Zhang et.al. | 2505.18134v1 | null |
2025-05-23 | BiggerGait: Unlocking Gait Recognition with Layer-wise Representations from Large Vision Models | Dingqing Ye et.al. | 2505.18132v1 | null |
2025-05-23 | One RL to See Them All: Visual Triple Unified Reinforcement Learning | Yan Ma et.al. | 2505.18129v1 | null |
2025-05-23 | Multi-Modal Spectral Parametrization Method (MMSPM) for analyzing EEG activity with distinct scaling regimes | Frigyes Samuel Racz et.al. | 2505.18117v1 | null |
2025-05-23 | Instructify: Demystifying Metadata to Visual Instruction Tuning Data Conversion | Jacob Hansen et.al. | 2505.18115v1 | null |
2025-05-23 | From Temporal to Spatial: Designing Spatialized Interactions with Segmented-audios in Immersive Environments for Active Engagement with Performing Arts Intangible Cultural Heritage | Yuqi Wang et.al. | 2505.18112v1 | null |
2025-05-23 | Adapting SAM 2 for Visual Object Tracking: 1st Place Solution for MMVPR Challenge Multi-Modal Tracking | Cheng-Yen Yang et.al. | 2505.18111v1 | null |
2025-05-23 | Accelerating Learned Image Compression Through Modeling Neural Training Dynamics | Yichi Zhang et.al. | 2505.18107v1 | null |
2025-05-23 | F-ANcGAN: An Attention-Enhanced Cycle Consistent Generative Adversarial Architecture for Synthetic Image Generation of Nanoparticles | Varun Ajith et.al. | 2505.18106v1 | null |
2025-05-23 | Towards more transferable adversarial attack in black-box manner | Chun Tong Lei et.al. | 2505.18097v1 | null |
2025-05-23 | DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations | Ziqiao Peng et.al. | 2505.18096v1 | null |
2025-05-23 | CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays | Hyungyung Lee et.al. | 2505.18087v1 | null |
2025-05-23 | The Noether formalism for constructing conserved quantities in teleparallel equivalents of general relativity | E. D. Emtsova et.al. | 2505.18084v1 | null |
2025-05-23 | Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding | Xiaoyi Zhang et.al. | 2505.18079v1 | null |
2025-05-23 | DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation | Junhao Chen et.al. | 2505.18078v1 | null |
2025-05-23 | Semantic Correspondence: Unified Benchmarking and a Strong Baseline | Kaiyan Zhang et.al. | 2505.18060v1 | link |
2025-05-23 | A Foundation Model Framework for Multi-View MRI Classification of Extramural Vascular Invasion and Mesorectal Fascia Invasion in Rectal Cancer | Yumeng Zhang et.al. | 2505.18058v1 | null |
2025-05-23 | FDBPL: Faster Distillation-Based Prompt Learning for Region-Aware Vision-Language Models Adaptation | Zherui Zhang et.al. | 2505.18053v1 | null |
2025-05-23 | BOTM: Echocardiography Segmentation via Bi-directional Optimal Token Matching | Zhihua Liu et.al. | 2505.18052v1 | null |
2025-05-23 | LookWhere? Efficient Visual Recognition by Learning Where to Look and What to See from Self-Supervision | Anthony Fuller et.al. | 2505.18051v1 | null |
2025-05-23 | SpikeGen: Generative Framework for Visual Spike Stream Processing | Gaole Dai et.al. | 2505.18049v1 | null |
2025-05-23 | SHARDeg: A Benchmark for Skeletal Human Action Recognition in Degraded Scenarios | Simon Malzard et.al. | 2505.18048v1 | null |
2025-05-23 | RestoreVAR: Visual Autoregressive Generation for All-in-One Image Restoration | Sudarshan Rajagopalan et.al. | 2505.18047v1 | null |
2025-05-23 | Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation | Li Zhong et.al. | 2505.18039v1 | null |
2025-05-23 | CAMME: Adaptive Deepfake Image Detection with Multi-Modal Cross-Attention | Naseem Khan et.al. | 2505.18035v1 | null |
2025-05-23 | Mahalanobis++: Improving OOD Detection via Feature Normalization | Maximilian Mueller et.al. | 2505.18032v1 | null |
Transformer
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | Multi-Modal Spectral Parametrization Method (MMSPM) for analyzing EEG activity with distinct scaling regimes | Frigyes Samuel Racz et.al. | 2505.18117v1 | null |
2025-05-23 | From Temporal to Spatial: Designing Spatialized Interactions with Segmented-audios in Immersive Environments for Active Engagement with Performing Arts Intangible Cultural Heritage | Yuqi Wang et.al. | 2505.18112v1 | null |
2025-05-23 | Accelerating Learned Image Compression Through Modeling Neural Training Dynamics | Yichi Zhang et.al. | 2505.18107v1 | null |
2025-05-23 | The Noether formalism for constructing conserved quantities in teleparallel equivalents of general relativity | E. D. Emtsova et.al. | 2505.18084v1 | null |
2025-05-23 | A Foundation Model Framework for Multi-View MRI Classification of Extramural Vascular Invasion and Mesorectal Fascia Invasion in Rectal Cancer | Yumeng Zhang et.al. | 2505.18058v1 | null |
2025-05-23 | LookWhere? Efficient Visual Recognition by Learning Where to Look and What to See from Self-Supervision | Anthony Fuller et.al. | 2505.18051v1 | null |
2025-05-23 | RestoreVAR: Visual Autoregressive Generation for All-in-One Image Restoration | Sudarshan Rajagopalan et.al. | 2505.18047v1 | null |
2025-05-23 | A Wavelet-based Stereo Matching Framework for Solving Frequency Convergence Inconsistency | Xiaobao Wei et.al. | 2505.18024v1 | null |
2025-05-23 | Classification of assembly tasks combining multiple primitive actions using Transformers and xLSTMs | Miguel Neves et.al. | 2505.18012v1 | null |
2025-05-23 | TRACE for Tracking the Emergence of Semantic Representations in Transformers | Nura Aljaafari et.al. | 2505.17998v1 | null |
2025-05-23 | Canonical Pose Reconstruction from Single Depth Image for 3D Non-rigid Pose Recovery on Limited Datasets | Fahd Alhamazani et.al. | 2505.17992v1 | null |
2025-05-23 | ADLGen: Synthesizing Symbolic, Event-Triggered Sensor Sequences for Human Activity Modeling | Weihang You et.al. | 2505.17987v1 | null |
2025-05-23 | Explainable Anatomy-Guided AI for Prostate MRI: Foundation Models and In Silico Clinical Trials for Virtual Biopsy-based Risk Assessment | Danial Khan et.al. | 2505.17971v1 | null |
2025-05-23 | SVD-Free Low-Rank Adaptive Gradient Optimization for Large Language Models | Ionut-Vlad Modoranu et.al. | 2505.17967v1 | null |
2025-05-23 | Understanding Gated Neurons in Transformers from Their Input-Output Functionality | Sebastian Gerstner et.al. | 2505.17936v1 | null |
2025-05-23 | Selection Mechanisms for Sequence Modeling using Linear State Space Models | Umberto Casti et.al. | 2505.17932v1 | null |
2025-05-23 | Predicting Length of Stay in Neurological ICU Patients Using Classical Machine Learning and Neural Network Models: A Benchmark Study on MIMIC-IV | Alexander Gabitashvili et.al. | 2505.17929v1 | null |
2025-05-23 | Language models can learn implicit multi-hop reasoning, but only if they have lots of training data | Yuekun Yao et.al. | 2505.17923v1 | null |
2025-05-23 | Isospectrality and non-locality of generalized Dirac combs | Giuliano Angelone et.al. | 2505.17920v1 | null |
2025-05-23 | NeuroTrails: Training with Dynamic Sparse Heads as the Key to Effective Ensembling | Bram Grooten et.al. | 2505.17909v1 | null |
2025-05-23 | SpectraLDS: Provable Distillation for Linear Dynamical Systems | Devan Shah et.al. | 2505.17868v1 | null |
2025-05-23 | The emergence of sparse attention: impact of data distribution and benefits of repetition | Nicolas Zucchet et.al. | 2505.17863v1 | null |
2025-05-23 | Stochastic Weight Sharing for Bayesian Neural Networks | Moule Lin et.al. | 2505.17856v1 | null |
2025-05-23 | Scaling Recurrent Neural Networks to a Billion Parameters with Zero-Order Optimization | Francois Chaubard et.al. | 2505.17852v1 | null |
2025-05-23 | TransDF: Time-Series Forecasting Needs Transformed Label Alignment | Hao Wang et.al. | 2505.17847v1 | null |
2025-05-23 | Continuum Transformers Perform In-Context Learning by Operator Gradient Descent | Abhiti Mishra et.al. | 2505.17838v1 | null |
2025-05-23 | Hybrid Mamba-Transformer Decoder for Error-Correcting Codes | Shy-el Cohen et.al. | 2505.17834v1 | null |
2025-05-23 | Low-Resource NMT: A Case Study on the Written and Spoken Languages in Hong Kong | Hei Yi Mak et.al. | 2505.17816v1 | null |
2025-05-23 | An Attention Infused Deep Learning System with Grad-CAM Visualization for Early Screening of Glaucoma | Ramanathan Swaminathan et.al. | 2505.17808v1 | null |
2025-05-23 | Temporal Consistency Constrained Transferable Adversarial Attacks with Background Mixup for Action Recognition | Ping Li et.al. | 2505.17807v1 | null |
Contrastive Learning
Contrastive Learning
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-05-23 | Generative Distribution Embeddings | Nic Fishman et.al. | 2505.18150v1 | null |
2025-05-23 | Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find | Owen Bianchi et.al. | 2505.18148v1 | null |
2025-05-23 | Nonadiabatic reactive scattering of hydrogen on different surface facets of copper | Wojciech G. Stark et.al. | 2505.18147v1 | null |
2025-05-23 | Stochastic agent-based Monte Carlo simulations for reaction-diffusion models, population dynamics, and epidemic spreading | Mohamed Swailem et.al. | 2505.18145v1 | null |
2025-05-23 | INN-FF: A Scalable and Efficient Machine Learning Potential for Molecular Dynamics | Taskin Mehereen et.al. | 2505.18141v1 | null |
2025-05-23 | Embracing Contradiction: Theoretical Inconsistency Will Not Impede the Road of Building Responsible AI Systems | Gordon Dai et.al. | 2505.18139v1 | null |
2025-05-23 | Boosting Open Set Recognition Performance through Modulated Representation Learning | Amit Kumar Kundu et.al. | 2505.18137v1 | null |
2025-05-23 | Gaming Tool Preferences in Agentic LLMs | Kazem Faghih et.al. | 2505.18135v1 | null |
2025-05-23 | VideoGameBench: Can Vision-Language Models complete popular video games? | Alex L. Zhang et.al. | 2505.18134v1 | null |
2025-05-23 | BiggerGait: Unlocking Gait Recognition with Layer-wise Representations from Large Vision Models | Dingqing Ye et.al. | 2505.18132v1 | null |
2025-05-23 | Leveraging KANs for Expedient Training of Multichannel MLPs via Preconditioning and Geometric Refinement | Jonas A. Actor et.al. | 2505.18131v1 | null |
2025-05-23 | Loss Functions for Measuring the Accuracy of Nonnegative Cross-Sectional Predictions | Charles D. Coleman et.al. | 2505.18130v1 | null |
2025-05-23 | One RL to See Them All: Visual Triple Unified Reinforcement Learning | Yan Ma et.al. | 2505.18129v1 | null |
2025-05-23 | Tuning Thermal Conductivity and Electron-Phonon Interactions in Carbon and Boron Nitride Moiré Diamanes via Twist Angle Manipulation | Rustam Arabov et.al. | 2505.18127v1 | null |
2025-05-23 | Reward Model Overoptimisation in Iterated RLHF | Lorenz Wolf et.al. | 2505.18126v1 | null |
2025-05-23 | TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations | Alan Arazi et.al. | 2505.18125v1 | null |
2025-05-23 | Multiparty entanglement loops in quantum spin liquids | Liuke Lyu et.al. | 2505.18124v1 | null |
2025-05-23 | ProgRM: Build Better GUI Agents with Progress Rewards | Danyang Zhang et.al. | 2505.18121v1 | null |
2025-05-23 | Scalable Policy Maximization Under Network Interference | Aidan Gleich et.al. | 2505.18118v1 | null |
2025-05-23 | Bridging Supervised Learning and Reinforcement Learning in Math Reasoning | Huayu Chen et.al. | 2505.18116v1 | null |
2025-05-23 | Beyond Discreteness: Finite-Sample Analysis of Straight-Through Estimator for Quantization | Halyun Jeong et.al. | 2505.18113v1 | null |
2025-05-23 | Accelerating Learned Image Compression Through Modeling Neural Training Dynamics | Yichi Zhang et.al. | 2505.18107v1 | null |
2025-05-23 | F-ANcGAN: An Attention-Enhanced Cycle Consistent Generative Adversarial Architecture for Synthetic Image Generation of Nanoparticles | Varun Ajith et.al. | 2505.18106v1 | null |
2025-05-23 | How Can I Publish My LLM Benchmark Without Giving the True Answers Away? | Takashi Ishida et.al. | 2505.18102v1 | null |
2025-05-23 | Dynamic Dual Buffer with Divide-and-Conquer Strategy for Online Continual Learning | Congren Dai et.al. | 2505.18101v1 | null |
2025-05-23 | Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL | Joey Hong et.al. | 2505.18098v1 | null |
2025-05-23 | Towards more transferable adversarial attack in black-box manner | Chun Tong Lei et.al. | 2505.18097v1 | null |
2025-05-23 | Data Mixing Can Induce Phase Transitions in Knowledge Acquisition | Xinran Gu et.al. | 2505.18091v1 | null |
2025-05-23 | Evaluation of derivatives using approximate generalized parameter shift rule | Vytautas Abramavicius et.al. | 2505.18090v1 | null |
2025-05-23 | Early-Exit Graph Neural Networks | Andrea Giuseppe Di Francesco et.al. | 2505.18088v1 | null |