arxiv-daily
Automated deployment @ 2025-06-23 11:51:17 Asia/Shanghai
Welcome to contribute! Add your topics and keywords in
topic.yml
. You can also view historical data through the storage.
3D Vision
Point Cloud
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Emergent Temporal Correspondences from Video Diffusion Transformers | Jisu Nam et.al. | 2506.17220v1 | link |
2025-06-20 | The full automorphism groups of the five symmetric $(15,8,4)$-designs | Mark Pankov et.al. | 2506.17216v1 | null |
2025-06-20 | Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation | Xiuyu Yang et.al. | 2506.17213v1 | null |
2025-06-20 | Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting | Tianjiao Yu et.al. | 2506.17212v1 | null |
2025-06-20 | Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres | Samuel Howard et.al. | 2506.17197v1 | null |
2025-06-20 | On Energy-Efficient Passive Beamforming Design of RIS-Assisted CoMP-NOMA Networks | Muhammad Umer et.al. | 2506.17189v1 | null |
2025-06-20 | Optimal Implicit Bias in Linear Regression | Kanumuri Nithin Varma et.al. | 2506.17187v1 | null |
2025-06-20 | Partition function for several Ising model interface structures | Alessio Squarcini et.al. | 2506.17170v1 | null |
2025-06-20 | Higher dimensional Sacks-Uhlenbeck-type functionals and applications | Gianmichele Di Matteo et.al. | 2506.17166v1 | null |
2025-06-20 | Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity | Samin Yeasar Arnob et.al. | 2506.17155v1 | null |
2025-06-20 | Reassessing Code Authorship Attribution in the Era of Language Models | Atish Kumar Dipongkor et.al. | 2506.17120v1 | null |
2025-06-20 | Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping | Teng Guo et.al. | 2506.17110v1 | null |
2025-06-20 | GW200208_222617 as an eccentric black-hole binary merger: properties and astrophysical implications | Isobel Romero-Shaw et.al. | 2506.17105v1 | null |
2025-06-20 | Can an Extra Degree of Freedom in Scalar-Tensor Non-Metricity Gravity Account for the Evolution of the Universe? | Ghulam Murtaza et.al. | 2506.17099v1 | null |
2025-06-20 | Instabilities in Colloidal Crystals on Fluid Membranes | Sanjay Dharmavaram et.al. | 2506.17098v1 | null |
2025-06-20 | The existence of quasi-periodic invariant tori and double Hopf bifurcation of van der Pol's oscillator with delayed feedback | Xuemei Li et.al. | 2506.17097v1 | null |
2025-06-20 | Super-Earth formation in systems with cold giants | Claudia Danti et.al. | 2506.17091v1 | null |
2025-06-20 | Better Language Model Inversion by Compactly Representing Next-Token Distributions | Murtaza Nazir et.al. | 2506.17090v1 | null |
2025-06-20 | Simultaneous Translation with Offline Speech and LLM Models in CUNI Submission to IWSLT 2025 | Dominik Macháček et.al. | 2506.17077v1 | null |
2025-06-20 | Assembler: Scalable 3D Part Assembly via Anchor Point Diffusion | Wang Zhao et.al. | 2506.17074v1 | null |
2025-06-20 | Homological stability and Manin's conjecture for rational curves on quartic del Pezzo surfaces | Ronno Das et.al. | 2506.17071v1 | null |
2025-06-20 | Regular homomorphisms, with a twist | Jeff Achter et.al. | 2506.17033v1 | null |
2025-06-20 | Multistability and Noise-Induced Transitions in Dispersively-Coupled Nonlinear Nanomechanical Modes | David Allemeier et.al. | 2506.17026v1 | null |
2025-06-20 | Accumulation of Device-Independent Quantum Randomness against Time-Ordered No-Signalling Adversaries | Ravishankar Ramanathan et.al. | 2506.17020v1 | null |
2025-06-20 | Large-amplitude periodic solutions to the steady Euler equations with piecewise constant vorticity | Alex Doak et.al. | 2506.17002v1 | null |
2025-06-20 | ForestFormer3D: A Unified Framework for End-to-End Segmentation of Forest LiDAR 3D Point Clouds | Binbin Xiang et.al. | 2506.16991v1 | null |
2025-06-20 | SmartGuard: Leveraging Large Language Models for Network Attack Detection through Audit Log Analysis and Summarization | Hao Zhang et.al. | 2506.16981v1 | null |
2025-06-20 | Minimum-Weight Half-Plane Hitting Set | Gang Liu et.al. | 2506.16979v1 | null |
2025-06-20 | Autoregressive Hypergraph | Xianghe Zhu et.al. | 2506.16966v1 | null |
2025-06-20 | Wi-Fi Sensing Tool Release: Gathering 802.11ax Channel State Information from a Commercial Wi-Fi Access Point | Zisheng Wang et.al. | 2506.16957v1 | null |
Point Cloud Registration
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Emergent Temporal Correspondences from Video Diffusion Transformers | Jisu Nam et.al. | 2506.17220v1 | link |
2025-06-20 | The full automorphism groups of the five symmetric $(15,8,4)$-designs | Mark Pankov et.al. | 2506.17216v1 | null |
2025-06-20 | Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation | Xiuyu Yang et.al. | 2506.17213v1 | null |
2025-06-20 | Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting | Tianjiao Yu et.al. | 2506.17212v1 | null |
2025-06-20 | Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres | Samuel Howard et.al. | 2506.17197v1 | null |
2025-06-20 | On Energy-Efficient Passive Beamforming Design of RIS-Assisted CoMP-NOMA Networks | Muhammad Umer et.al. | 2506.17189v1 | null |
2025-06-20 | Optimal Implicit Bias in Linear Regression | Kanumuri Nithin Varma et.al. | 2506.17187v1 | null |
2025-06-20 | Partition function for several Ising model interface structures | Alessio Squarcini et.al. | 2506.17170v1 | null |
2025-06-20 | Higher dimensional Sacks-Uhlenbeck-type functionals and applications | Gianmichele Di Matteo et.al. | 2506.17166v1 | null |
2025-06-20 | Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity | Samin Yeasar Arnob et.al. | 2506.17155v1 | null |
2025-06-20 | Reassessing Code Authorship Attribution in the Era of Language Models | Atish Kumar Dipongkor et.al. | 2506.17120v1 | null |
2025-06-20 | Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping | Teng Guo et.al. | 2506.17110v1 | null |
2025-06-20 | GW200208_222617 as an eccentric black-hole binary merger: properties and astrophysical implications | Isobel Romero-Shaw et.al. | 2506.17105v1 | null |
2025-06-20 | Can an Extra Degree of Freedom in Scalar-Tensor Non-Metricity Gravity Account for the Evolution of the Universe? | Ghulam Murtaza et.al. | 2506.17099v1 | null |
2025-06-20 | Instabilities in Colloidal Crystals on Fluid Membranes | Sanjay Dharmavaram et.al. | 2506.17098v1 | null |
2025-06-20 | The existence of quasi-periodic invariant tori and double Hopf bifurcation of van der Pol's oscillator with delayed feedback | Xuemei Li et.al. | 2506.17097v1 | null |
2025-06-20 | Super-Earth formation in systems with cold giants | Claudia Danti et.al. | 2506.17091v1 | null |
2025-06-20 | Better Language Model Inversion by Compactly Representing Next-Token Distributions | Murtaza Nazir et.al. | 2506.17090v1 | null |
2025-06-20 | Simultaneous Translation with Offline Speech and LLM Models in CUNI Submission to IWSLT 2025 | Dominik Macháček et.al. | 2506.17077v1 | null |
2025-06-20 | Assembler: Scalable 3D Part Assembly via Anchor Point Diffusion | Wang Zhao et.al. | 2506.17074v1 | null |
2025-06-20 | Homological stability and Manin's conjecture for rational curves on quartic del Pezzo surfaces | Ronno Das et.al. | 2506.17071v1 | null |
2025-06-20 | Regular homomorphisms, with a twist | Jeff Achter et.al. | 2506.17033v1 | null |
2025-06-20 | Multistability and Noise-Induced Transitions in Dispersively-Coupled Nonlinear Nanomechanical Modes | David Allemeier et.al. | 2506.17026v1 | null |
2025-06-20 | Accumulation of Device-Independent Quantum Randomness against Time-Ordered No-Signalling Adversaries | Ravishankar Ramanathan et.al. | 2506.17020v1 | null |
2025-06-20 | Large-amplitude periodic solutions to the steady Euler equations with piecewise constant vorticity | Alex Doak et.al. | 2506.17002v1 | null |
2025-06-20 | ForestFormer3D: A Unified Framework for End-to-End Segmentation of Forest LiDAR 3D Point Clouds | Binbin Xiang et.al. | 2506.16991v1 | null |
2025-06-20 | SmartGuard: Leveraging Large Language Models for Network Attack Detection through Audit Log Analysis and Summarization | Hao Zhang et.al. | 2506.16981v1 | null |
2025-06-20 | Minimum-Weight Half-Plane Hitting Set | Gang Liu et.al. | 2506.16979v1 | null |
2025-06-20 | Autoregressive Hypergraph | Xianghe Zhu et.al. | 2506.16966v1 | null |
2025-06-20 | Wi-Fi Sensing Tool Release: Gathering 802.11ax Channel State Information from a Commercial Wi-Fi Access Point | Zisheng Wang et.al. | 2506.16957v1 | null |
Visual Localization
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Confidence Scoring for LLM-Generated SQL in Supply Chain Data Extraction | Jiekai Ma et.al. | 2506.17203v1 | null |
2025-06-20 | Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition | Jiaqi Li et.al. | 2506.17201v1 | null |
2025-06-20 | Towards AI Search Paradigm | Yuchen Li et.al. | 2506.17188v1 | null |
2025-06-20 | YASMOT: Yet another stereo image multi-object tracker | Ketil Malde et.al. | 2506.17186v1 | null |
2025-06-20 | Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping | Teng Guo et.al. | 2506.17110v1 | null |
2025-06-20 | Open-Path Methane Sensing via Backscattered Light in a Nonlinear Interferometer | Jinghan Dong et.al. | 2506.17107v1 | null |
2025-06-20 | Universal Music Representations? Evaluating Foundation Models on World Music Corpora | Charilaos Papaioannou et.al. | 2506.17055v1 | null |
2025-06-20 | PersonalAI: Towards digital twins in the graph form | Mikhail Menschikov et.al. | 2506.17001v1 | null |
2025-06-20 | Directional Dark Field for Nanoscale Full-Field Transmission X-Ray Microscopy | Sami Wirtensohn et.al. | 2506.16998v1 | null |
2025-06-20 | RAGentA: Multi-Agent Retrieval-Augmented Generation for Attributed Question Answering | Ines Besrour et.al. | 2506.16988v1 | null |
2025-06-20 | Learning Accurate Whole-body Throwing with High-frequency Residual Policy and Pullback Tube Acceleration | Yuntao Ma et.al. | 2506.16986v1 | null |
2025-06-20 | LAION-C: An Out-of-Distribution Benchmark for Web-Scale Vision Models | Fanfei Li et.al. | 2506.16950v1 | null |
2025-06-20 | Pyramid Mixer: Multi-dimensional Multi-period Interest Modeling for Sequential Recommendation | Zhen Gong et.al. | 2506.16942v1 | null |
2025-06-20 | Multimodal Fused Learning for Solving the Generalized Traveling Salesman Problem in Robotic Task Planning | Jiaqi Chen et.al. | 2506.16931v1 | null |
2025-06-20 | Low-Energy Supernova Constraints on Lepton Flavor Violating Axions | Zi-Miao Huang et.al. | 2506.16922v1 | null |
2025-06-20 | COSMIC-L: A Photometric Catalog of Observed Stars in the Large MagellanIc Cloud | A. Franco et.al. | 2506.16896v1 | null |
2025-06-20 | With Limited Data for Multimodal Alignment, Let the STRUCTURE Guide You | Fabian Gröger et.al. | 2506.16895v1 | null |
2025-06-20 | Multi-Objective Recommendation in the Era of Generative AI: A Survey of Recent Progress and Future Prospects | Zihan Hong et.al. | 2506.16893v1 | null |
2025-06-20 | From Lab to Factory: Pitfalls and Guidelines for Self-/Unsupervised Defect Detection on Low-Quality Industrial Images | Sebastian Hönel et.al. | 2506.16890v1 | null |
2025-06-20 | Vision-Based Multirotor Control for Spherical Target Tracking: A Bearing-Angle Approach | Marcelo Jacinto et.al. | 2506.16870v1 | null |
2025-06-20 | ParkFormer: A Transformer-Based Parking Policy with Goal Embedding and Pedestrian-Aware Control | Jun Fu et.al. | 2506.16856v1 | null |
2025-06-20 | Camera Calibration via Circular Patterns: A Comprehensive Framework with Measurement Uncertainty and Unbiased Projection Model | Chaehyeon Song et.al. | 2506.16842v1 | null |
2025-06-20 | Loupe: A Generalizable and Adaptive Framework for Image Forgery Detection | Yuchu Jiang et.al. | 2506.16819v1 | null |
2025-06-20 | Theoretical novel medical isotope production with deuterium-tritium fusion technology | Lee J. Evitts et.al. | 2506.16817v1 | null |
2025-06-20 | Integrating Traditional Technical Analysis with AI: A Multi-Agent LLM-Based Approach to Stock Market Forecasting | Michał Wawer et.al. | 2506.16813v1 | null |
2025-06-20 | What Is the Point of Equality in Machine Learning Fairness? Beyond Equality of Opportunity | Youjin Kong et.al. | 2506.16782v1 | null |
2025-06-20 | eSapiens: A Real-World NLP Framework for Multimodal Document Understanding and Enterprise Knowledge Processing | Isaac Shi et.al. | 2506.16768v1 | null |
2025-06-20 | H-QuEST: Accelerating Query-by-Example Spoken Term Detection with Hierarchical Indexing | Akanksha Singh et.al. | 2506.16751v1 | null |
2025-06-20 | Class Agnostic Instance-level Descriptor for Visual Instance Search | Qi-Ying Sun et.al. | 2506.16745v1 | null |
2025-06-20 | Few-Shot Generalized Category Discovery With Retrieval-Guided Decision Boundary Enhancement | Yunhan Ren et.al. | 2506.16728v1 | null |
Point Cloud Matching
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Emergent Temporal Correspondences from Video Diffusion Transformers | Jisu Nam et.al. | 2506.17220v1 | link |
2025-06-20 | No Free Lunch: Rethinking Internal Feedback for LLM Reasoning | Yanzhi Zhang et.al. | 2506.17219v1 | null |
2025-06-20 | The full automorphism groups of the five symmetric $(15,8,4)$-designs | Mark Pankov et.al. | 2506.17216v1 | null |
2025-06-20 | Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation | Xiuyu Yang et.al. | 2506.17213v1 | null |
2025-06-20 | Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting | Tianjiao Yu et.al. | 2506.17212v1 | null |
2025-06-20 | Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres | Samuel Howard et.al. | 2506.17197v1 | null |
2025-06-20 | On Energy-Efficient Passive Beamforming Design of RIS-Assisted CoMP-NOMA Networks | Muhammad Umer et.al. | 2506.17189v1 | null |
2025-06-20 | Optimal Implicit Bias in Linear Regression | Kanumuri Nithin Varma et.al. | 2506.17187v1 | null |
2025-06-20 | Deep generative models as the probability transformation functions | Vitalii Bondar et.al. | 2506.17171v1 | null |
2025-06-20 | Partition function for several Ising model interface structures | Alessio Squarcini et.al. | 2506.17170v1 | null |
2025-06-20 | Higher dimensional Sacks-Uhlenbeck-type functionals and applications | Gianmichele Di Matteo et.al. | 2506.17166v1 | null |
2025-06-20 | Shock formation in 1D conservation laws II: Vanishing viscosity | John Anderson et.al. | 2506.17156v1 | null |
2025-06-20 | Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity | Samin Yeasar Arnob et.al. | 2506.17155v1 | null |
2025-06-20 | Do We Need Large VLMs for Spotting Soccer Actions? | Ritabrata Chakraborty et.al. | 2506.17144v1 | null |
2025-06-20 | Reassessing Code Authorship Attribution in the Era of Language Models | Atish Kumar Dipongkor et.al. | 2506.17120v1 | null |
2025-06-20 | Large Average Subtensor Problem: Ground-State, Algorithms, and Algorithmic Barriers | Abhishek Hegade K. R. et.al. | 2506.17118v1 | null |
2025-06-20 | Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping | Teng Guo et.al. | 2506.17110v1 | null |
2025-06-20 | Open-Path Methane Sensing via Backscattered Light in a Nonlinear Interferometer | Jinghan Dong et.al. | 2506.17107v1 | null |
2025-06-20 | GW200208_222617 as an eccentric black-hole binary merger: properties and astrophysical implications | Isobel Romero-Shaw et.al. | 2506.17105v1 | null |
2025-06-20 | Can an Extra Degree of Freedom in Scalar-Tensor Non-Metricity Gravity Account for the Evolution of the Universe? | Ghulam Murtaza et.al. | 2506.17099v1 | null |
2025-06-20 | Instabilities in Colloidal Crystals on Fluid Membranes | Sanjay Dharmavaram et.al. | 2506.17098v1 | null |
2025-06-20 | The existence of quasi-periodic invariant tori and double Hopf bifurcation of van der Pol's oscillator with delayed feedback | Xuemei Li et.al. | 2506.17097v1 | null |
2025-06-20 | Super-Earth formation in systems with cold giants | Claudia Danti et.al. | 2506.17091v1 | null |
2025-06-20 | Better Language Model Inversion by Compactly Representing Next-Token Distributions | Murtaza Nazir et.al. | 2506.17090v1 | null |
2025-06-20 | Simultaneous Translation with Offline Speech and LLM Models in CUNI Submission to IWSLT 2025 | Dominik Macháček et.al. | 2506.17077v1 | null |
2025-06-20 | Neural Polar Decoders for DNA Data Storage | Ziv Aharoni et.al. | 2506.17076v1 | null |
2025-06-20 | Assembler: Scalable 3D Part Assembly via Anchor Point Diffusion | Wang Zhao et.al. | 2506.17074v1 | null |
2025-06-20 | Homological stability and Manin's conjecture for rational curves on quartic del Pezzo surfaces | Ronno Das et.al. | 2506.17071v1 | null |
2025-06-20 | Navigating the Deep: Signature Extraction on Deep Neural Networks | Haolin Liu et.al. | 2506.17047v1 | null |
2025-06-20 | Regular homomorphisms, with a twist | Jeff Achter et.al. | 2506.17033v1 | null |
3D Object Detection
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | VLN-R1: Vision-Language Navigation via Reinforcement Fine-Tuning | Zhangyang Qi et.al. | 2506.17221v1 | null |
2025-06-20 | Emergent Temporal Correspondences from Video Diffusion Transformers | Jisu Nam et.al. | 2506.17220v1 | link |
2025-06-20 | No Free Lunch: Rethinking Internal Feedback for LLM Reasoning | Yanzhi Zhang et.al. | 2506.17219v1 | null |
2025-06-20 | Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens | Zeyuan Yang et.al. | 2506.17218v1 | null |
2025-06-20 | Bias hardened estimators of patchy screening profiles | Noah Sailer et.al. | 2506.17217v1 | null |
2025-06-20 | Hierarchical constraints on gravitational waves from horizonless compact objects | Rajrupa Mondal et.al. | 2506.17215v1 | null |
2025-06-20 | Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting | Tianjiao Yu et.al. | 2506.17212v1 | null |
2025-06-20 | DreamCube: 3D Panorama Generation via Multi-plane Synchronization | Yukun Huang et.al. | 2506.17206v1 | null |
2025-06-20 | Efficient Implementation of Multi-sensor Adaptive Birth Samplers for Labeled Random Finite Set Tracking | Jennifer Bondarchuk et.al. | 2506.17205v1 | null |
2025-06-20 | Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres | Samuel Howard et.al. | 2506.17197v1 | null |
2025-06-20 | Detecting LLM-Generated Short Answers and Effects on Learner Performance | Shambhavi Bhushan et.al. | 2506.17196v1 | null |
2025-06-20 | Gravitational lensing observables in stationary and axisymmetric solutions in general relativity | Matteo Luca Ruggiero et.al. | 2506.17192v1 | null |
2025-06-20 | YASMOT: Yet another stereo image multi-object tracker | Ketil Malde et.al. | 2506.17186v1 | null |
2025-06-20 | Variational Learning of Disentangled Representations | Yuli Slavutsky et.al. | 2506.17182v1 | null |
2025-06-20 | Feedback cooling scheme for an optically levitated oscillator with controlled cross-talk | J. M. H. Gosling et.al. | 2506.17172v1 | null |
2025-06-20 | Partition function for several Ising model interface structures | Alessio Squarcini et.al. | 2506.17170v1 | null |
2025-06-20 | Scaling limits for sample autocovariance operators of Hilbert space-valued linear processes | Marie-Christine Düker et.al. | 2506.17168v1 | null |
2025-06-20 | Analyzing PDFs like Binaries: Adversarially Robust PDF Malware Analysis via Intermediate Representation and Language Model | Side Liu et.al. | 2506.17162v1 | null |
2025-06-20 | Profile monitoring of random functions with Gaussian process basis expansions | Takayuki Iguchi et.al. | 2506.17153v1 | null |
2025-06-20 | Fully Self-Consistent Semiclassical Gravity | R. Muciño et.al. | 2506.17149v1 | null |
2025-06-20 | Do We Need Large VLMs for Spotting Soccer Actions? | Ritabrata Chakraborty et.al. | 2506.17144v1 | null |
2025-06-20 | On the Theory of Conditional Feature Alignment for Unsupervised Domain-Adaptive Counting | Zhuonan Liang et.al. | 2506.17137v1 | null |
2025-06-20 | Dynamic Watermark Generation for Digital Images using Perimeter Gated SPAD Imager PUFs | Md Sakibur Sajal et.al. | 2506.17134v1 | null |
2025-06-20 | Robust Training with Data Augmentation for Medical Imaging Classification | Josué Martínez-Martínez et.al. | 2506.17133v1 | null |
2025-06-20 | Rapid and Continuous Trust Evaluation for Effective Task Collaboration Through Siamese Model | Botao Zhu et.al. | 2506.17128v1 | null |
2025-06-20 | Reassessing Code Authorship Attribution in the Era of Language Models | Atish Kumar Dipongkor et.al. | 2506.17120v1 | null |
2025-06-20 | RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking | Teng Guo et.al. | 2506.17119v1 | null |
2025-06-20 | MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation | Shoubin Yu et.al. | 2506.17113v1 | null |
2025-06-20 | Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping | Teng Guo et.al. | 2506.17110v1 | null |
2025-06-20 | Searching for a Hidden Markov Anomaly over Multiple Processes | Levli Citron et.al. | 2506.17108v1 | null |
Point Cloud Segmentation
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Emergent Temporal Correspondences from Video Diffusion Transformers | Jisu Nam et.al. | 2506.17220v1 | link |
2025-06-20 | The full automorphism groups of the five symmetric $(15,8,4)$-designs | Mark Pankov et.al. | 2506.17216v1 | null |
2025-06-20 | Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation | Xiuyu Yang et.al. | 2506.17213v1 | null |
2025-06-20 | Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting | Tianjiao Yu et.al. | 2506.17212v1 | null |
2025-06-20 | Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres | Samuel Howard et.al. | 2506.17197v1 | null |
2025-06-20 | On Energy-Efficient Passive Beamforming Design of RIS-Assisted CoMP-NOMA Networks | Muhammad Umer et.al. | 2506.17189v1 | null |
2025-06-20 | Optimal Implicit Bias in Linear Regression | Kanumuri Nithin Varma et.al. | 2506.17187v1 | null |
2025-06-20 | Partition function for several Ising model interface structures | Alessio Squarcini et.al. | 2506.17170v1 | null |
2025-06-20 | Higher dimensional Sacks-Uhlenbeck-type functionals and applications | Gianmichele Di Matteo et.al. | 2506.17166v1 | null |
2025-06-20 | Codeword-Segmentation Rate-Splitting Multiple Access and Evaluation under Suboptimal Decoding | Sibo Zhang et.al. | 2506.17164v1 | null |
2025-06-20 | Co-Seg++: Mutual Prompt-Guided Collaborative Learning for Versatile Medical Segmentation | Qing Xu et.al. | 2506.17159v1 | null |
2025-06-20 | Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity | Samin Yeasar Arnob et.al. | 2506.17155v1 | null |
2025-06-20 | Semi-Supervised Multi-Modal Medical Image Segmentation for Complex Situations | Dongdong Meng et.al. | 2506.17136v1 | null |
2025-06-20 | Reassessing Code Authorship Attribution in the Era of Language Models | Atish Kumar Dipongkor et.al. | 2506.17120v1 | null |
2025-06-20 | Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping | Teng Guo et.al. | 2506.17110v1 | null |
2025-06-20 | GW200208_222617 as an eccentric black-hole binary merger: properties and astrophysical implications | Isobel Romero-Shaw et.al. | 2506.17105v1 | null |
2025-06-20 | Can an Extra Degree of Freedom in Scalar-Tensor Non-Metricity Gravity Account for the Evolution of the Universe? | Ghulam Murtaza et.al. | 2506.17099v1 | null |
2025-06-20 | Instabilities in Colloidal Crystals on Fluid Membranes | Sanjay Dharmavaram et.al. | 2506.17098v1 | null |
2025-06-20 | The existence of quasi-periodic invariant tori and double Hopf bifurcation of van der Pol's oscillator with delayed feedback | Xuemei Li et.al. | 2506.17097v1 | null |
2025-06-20 | Super-Earth formation in systems with cold giants | Claudia Danti et.al. | 2506.17091v1 | null |
2025-06-20 | Better Language Model Inversion by Compactly Representing Next-Token Distributions | Murtaza Nazir et.al. | 2506.17090v1 | null |
2025-06-20 | Simultaneous Translation with Offline Speech and LLM Models in CUNI Submission to IWSLT 2025 | Dominik Macháček et.al. | 2506.17077v1 | null |
2025-06-20 | Assembler: Scalable 3D Part Assembly via Anchor Point Diffusion | Wang Zhao et.al. | 2506.17074v1 | null |
2025-06-20 | Homological stability and Manin's conjecture for rational curves on quartic del Pezzo surfaces | Ronno Das et.al. | 2506.17071v1 | null |
2025-06-20 | Flow-Based Non-stationary Temporal Regime Causal Structure Learning | Abdellah Rahmani et.al. | 2506.17065v1 | null |
2025-06-20 | Regular homomorphisms, with a twist | Jeff Achter et.al. | 2506.17033v1 | null |
2025-06-20 | Multistability and Noise-Induced Transitions in Dispersively-Coupled Nonlinear Nanomechanical Modes | David Allemeier et.al. | 2506.17026v1 | null |
2025-06-20 | Accumulation of Device-Independent Quantum Randomness against Time-Ordered No-Signalling Adversaries | Ravishankar Ramanathan et.al. | 2506.17020v1 | null |
2025-06-20 | Large-amplitude periodic solutions to the steady Euler equations with piecewise constant vorticity | Alex Doak et.al. | 2506.17002v1 | null |
2025-06-20 | ForestFormer3D: A Unified Framework for End-to-End Segmentation of Forest LiDAR 3D Point Clouds | Binbin Xiang et.al. | 2506.16991v1 | null |
Point Cloud Completion
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | VLN-R1: Vision-Language Navigation via Reinforcement Fine-Tuning | Zhangyang Qi et.al. | 2506.17221v1 | null |
2025-06-20 | Emergent Temporal Correspondences from Video Diffusion Transformers | Jisu Nam et.al. | 2506.17220v1 | link |
2025-06-20 | The full automorphism groups of the five symmetric $(15,8,4)$-designs | Mark Pankov et.al. | 2506.17216v1 | null |
2025-06-20 | Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation | Xiuyu Yang et.al. | 2506.17213v1 | null |
2025-06-20 | Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting | Tianjiao Yu et.al. | 2506.17212v1 | null |
2025-06-20 | BREAD: Branched Rollouts from Expert Anchors Bridge SFT & RL for Reasoning | Xuechen Zhang et.al. | 2506.17211v1 | null |
2025-06-20 | Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres | Samuel Howard et.al. | 2506.17197v1 | null |
2025-06-20 | On Energy-Efficient Passive Beamforming Design of RIS-Assisted CoMP-NOMA Networks | Muhammad Umer et.al. | 2506.17189v1 | null |
2025-06-20 | Optimal Implicit Bias in Linear Regression | Kanumuri Nithin Varma et.al. | 2506.17187v1 | null |
2025-06-20 | Partition function for several Ising model interface structures | Alessio Squarcini et.al. | 2506.17170v1 | null |
2025-06-20 | Higher dimensional Sacks-Uhlenbeck-type functionals and applications | Gianmichele Di Matteo et.al. | 2506.17166v1 | null |
2025-06-20 | Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity | Samin Yeasar Arnob et.al. | 2506.17155v1 | null |
2025-06-20 | A Note on Proper Relational Structures | Adam Bjorndahl et.al. | 2506.17142v1 | null |
2025-06-20 | An Elementary Characterization of Bargmann Invariants | Sagar Silva Pratapsi et.al. | 2506.17132v1 | null |
2025-06-20 | Chain-of-Trust: A Progressive Trust Evaluation Framework Enabled by Generative AI | Botao Zhu et.al. | 2506.17130v1 | null |
2025-06-20 | Rapid and Continuous Trust Evaluation for Effective Task Collaboration Through Siamese Model | Botao Zhu et.al. | 2506.17128v1 | null |
2025-06-20 | Reassessing Code Authorship Attribution in the Era of Language Models | Atish Kumar Dipongkor et.al. | 2506.17120v1 | null |
2025-06-20 | Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping | Teng Guo et.al. | 2506.17110v1 | null |
2025-06-20 | Limit theorems under nonlinear expectations dominated by sublinear expectations | Xiaojuan Li et.al. | 2506.17109v1 | null |
2025-06-20 | GW200208_222617 as an eccentric black-hole binary merger: properties and astrophysical implications | Isobel Romero-Shaw et.al. | 2506.17105v1 | null |
2025-06-20 | Can an Extra Degree of Freedom in Scalar-Tensor Non-Metricity Gravity Account for the Evolution of the Universe? | Ghulam Murtaza et.al. | 2506.17099v1 | null |
2025-06-20 | Instabilities in Colloidal Crystals on Fluid Membranes | Sanjay Dharmavaram et.al. | 2506.17098v1 | null |
2025-06-20 | The existence of quasi-periodic invariant tori and double Hopf bifurcation of van der Pol's oscillator with delayed feedback | Xuemei Li et.al. | 2506.17097v1 | null |
2025-06-20 | A Spectral Gap for Spinors on Hyperbolic Surfaces | Anshul Adve et.al. | 2506.17092v1 | null |
2025-06-20 | Super-Earth formation in systems with cold giants | Claudia Danti et.al. | 2506.17091v1 | null |
2025-06-20 | Better Language Model Inversion by Compactly Representing Next-Token Distributions | Murtaza Nazir et.al. | 2506.17090v1 | null |
2025-06-20 | Simultaneous Translation with Offline Speech and LLM Models in CUNI Submission to IWSLT 2025 | Dominik Macháček et.al. | 2506.17077v1 | null |
2025-06-20 | Assembler: Scalable 3D Part Assembly via Anchor Point Diffusion | Wang Zhao et.al. | 2506.17074v1 | null |
2025-06-20 | Homological stability and Manin's conjecture for rational curves on quartic del Pezzo surfaces | Ronno Das et.al. | 2506.17071v1 | null |
2025-06-20 | Quantum k-SAT Related Hypergraph Problems | Simon-Luca Kremer et.al. | 2506.17066v1 | null |
3D Object Tracking
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | VLN-R1: Vision-Language Navigation via Reinforcement Fine-Tuning | Zhangyang Qi et.al. | 2506.17221v1 | null |
2025-06-20 | Emergent Temporal Correspondences from Video Diffusion Transformers | Jisu Nam et.al. | 2506.17220v1 | link |
2025-06-20 | No Free Lunch: Rethinking Internal Feedback for LLM Reasoning | Yanzhi Zhang et.al. | 2506.17219v1 | null |
2025-06-20 | Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens | Zeyuan Yang et.al. | 2506.17218v1 | null |
2025-06-20 | Hierarchical constraints on gravitational waves from horizonless compact objects | Rajrupa Mondal et.al. | 2506.17215v1 | null |
2025-06-20 | Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting | Tianjiao Yu et.al. | 2506.17212v1 | null |
2025-06-20 | Dissecting the SWE-Bench Leaderboards: Profiling Submitters and Architectures of LLM- and Agent-Based Repair Systems | Matias Martinez et.al. | 2506.17208v1 | null |
2025-06-20 | DreamCube: 3D Panorama Generation via Multi-plane Synchronization | Yukun Huang et.al. | 2506.17206v1 | null |
2025-06-20 | Efficient Implementation of Multi-sensor Adaptive Birth Samplers for Labeled Random Finite Set Tracking | Jennifer Bondarchuk et.al. | 2506.17205v1 | null |
2025-06-20 | Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres | Samuel Howard et.al. | 2506.17197v1 | null |
2025-06-20 | On Energy-Efficient Passive Beamforming Design of RIS-Assisted CoMP-NOMA Networks | Muhammad Umer et.al. | 2506.17189v1 | null |
2025-06-20 | YASMOT: Yet another stereo image multi-object tracker | Ketil Malde et.al. | 2506.17186v1 | null |
2025-06-20 | Variational Learning of Disentangled Representations | Yuli Slavutsky et.al. | 2506.17182v1 | null |
2025-06-20 | Feedback cooling scheme for an optically levitated oscillator with controlled cross-talk | J. M. H. Gosling et.al. | 2506.17172v1 | null |
2025-06-20 | Scaling limits for sample autocovariance operators of Hilbert space-valued linear processes | Marie-Christine Düker et.al. | 2506.17168v1 | null |
2025-06-20 | Analyzing PDFs like Binaries: Adversarially Robust PDF Malware Analysis via Intermediate Representation and Language Model | Side Liu et.al. | 2506.17162v1 | null |
2025-06-20 | Fully Self-Consistent Semiclassical Gravity | R. Muciño et.al. | 2506.17149v1 | null |
2025-06-20 | On the Theory of Conditional Feature Alignment for Unsupervised Domain-Adaptive Counting | Zhuonan Liang et.al. | 2506.17137v1 | null |
2025-06-20 | RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking | Teng Guo et.al. | 2506.17119v1 | null |
2025-06-20 | MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation | Shoubin Yu et.al. | 2506.17113v1 | null |
2025-06-20 | Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping | Teng Guo et.al. | 2506.17110v1 | null |
2025-06-20 | Searching for a Hidden Markov Anomaly over Multiple Processes | Levli Citron et.al. | 2506.17108v1 | null |
2025-06-20 | PCG-Informed Neural Solvers for High-Resolution Homogenization of Periodic Microstructures | Yu Xing et.al. | 2506.17087v1 | null |
2025-06-20 | Matroids, intersecting bases, and Borsuk property | Gyivan López-Campos et.al. | 2506.17082v1 | null |
2025-06-20 | Assembler: Scalable 3D Part Assembly via Anchor Point Diffusion | Wang Zhao et.al. | 2506.17074v1 | null |
2025-06-20 | LLM-Based Bot Broadens the Range of Arguments in Online Discussions, Even When Transparently Disclosed as AI | Valeria Vuk et.al. | 2506.17073v1 | null |
2025-06-20 | Empowering Near-Field Communications in Low-Altitude Economy with LLM: Fundamentals, Potentials, Solutions, and Future Directions | Zhuo Xu et.al. | 2506.17067v1 | null |
2025-06-20 | Behavior Driven Development for 3D Games | Fernando Pastor Ricós et.al. | 2506.17057v1 | null |
2025-06-20 | Phase Transition of the Ising Model on a 3-Dimensional Fractal Lattice | Jozef Genzor et.al. | 2506.17053v1 | null |
2025-06-20 | Stretching Beyond the Obvious: A Gradient-Free Framework to Unveil the Hidden Landscape of Visual Invariance | Lorenzo Tausani et.al. | 2506.17040v1 | null |
Keypoint Detection
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Class Agnostic Instance-level Descriptor for Visual Instance Search | Qi-Ying Sun et.al. | 2506.16745v1 | null |
2025-06-20 | The simplest chaos indicator derived from Lagrangian descriptors | Javier Jiménez-López et.al. | 2506.16660v1 | null |
2025-06-19 | How Far Can Off-the-Shelf Multimodal Large Language Models Go in Online Episodic Memory Question Answering? | Giuseppe Lando et.al. | 2506.16450v1 | null |
2025-06-19 | STAR-Pose: Efficient Low-Resolution Video Human Pose Estimation via Spatial-Temporal Adaptive Super-Resolution | Yucheng Jin et.al. | 2506.16061v1 | null |
2025-06-19 | Unveiling defect motifs in amorphous GeSe using machine learning interatomic potentials | Minseok Moon et.al. | 2506.15934v1 | null |
2025-06-18 | A new Surrogate Microstructure Generator for Porous Materials with Applications to the Buffer Layer of TRISO Nuclear Fuel Particles | Philipp Eisenhardt et.al. | 2506.15874v1 | null |
2025-06-18 | Descriptor-based Foundation Models for Molecular Property Prediction | Jackson Burns et.al. | 2506.15792v1 | null |
2025-06-18 | Maximizing solubility in rock salt high-entropy oxides | Matthew Furst et.al. | 2506.15604v1 | null |
2025-06-18 | MCOO-SLAM: A Multi-Camera Omnidirectional Object SLAM System | Miaoxin Pan et.al. | 2506.15402v1 | null |
2025-06-18 | High-Entropy Skutterudites as Thermoelectrics: Synthesizability and Band Convergence via the Cocktail Effect | Jose J. Plata et.al. | 2506.15324v1 | null |
2025-06-18 | SHeRLoc: Synchronized Heterogeneous Radar Place Recognition for Cross-Modal Localization | Hanjun Kim et.al. | 2506.15175v1 | null |
2025-06-18 | Enhancing point cloud analysis via neighbor aggregation correction based on cross-stage structure correlation | Jiaqi Shi et.al. | 2506.15160v1 | link |
2025-06-18 | VIMS: A Visual-Inertial-Magnetic-Sonar SLAM System in Underwater Environments | Bingbing Zhang et.al. | 2506.15126v1 | null |
2025-06-17 | Q2SAR: A Quantum Multiple Kernel Learning Approach for Drug Discovery | Alejandro Giraldo et.al. | 2506.14920v1 | null |
2025-06-17 | Cross-Modal Geometric Hierarchy Fusion: An Implicit-Submap Driven Framework for Resilient 3D Place Recognition | Xiaohui Jiang et.al. | 2506.14243v2 | link |
2025-06-17 | AMPLIFY: Actionless Motion Priors for Robot Learning from Videos | Jeremy A. Collins et.al. | 2506.14198v1 | null |
2025-06-17 | Compositional fluctuations and polymorph selection in crystallization of model soft colloids | Abhilasha Kumari et.al. | 2506.14109v1 | null |
2025-06-16 | AutoSAS: a new human-aside-the-loop paradigm for automated SAS fitting for high throughput and autonomous experimentation | Duncan R. Sutherland et.al. | 2506.13918v1 | null |
2025-06-16 | ATK: Automatic Task-driven Keypoint Selection for Robust Policy Learning | Yunchu Zhang et.al. | 2506.13867v1 | null |
2025-06-16 | Audio-Visual Driven Compression for Low-Bitrate Talking Head Videos | Riku Takahashi et.al. | 2506.13419v1 | null |
2025-06-16 | Quantitative Comparison of Fine-Tuning Techniques for Pretrained Latent Diffusion Models in the Generation of Unseen SAR Image Concepts | Solène Debuysère et.al. | 2506.13307v1 | null |
2025-06-16 | SuperPoint-SLAM3: Augmenting ORB-SLAM3 with Deep Features, Adaptive NMS, and Learning-Based Loop Closure | Shahram Najam Syed et.al. | 2506.13089v1 | link |
2025-06-16 | MAMMA: Markerless & Automatic Multi-Person Motion Action Capture | Hanz Cuevas-Velasquez et.al. | 2506.13040v1 | null |
2025-06-16 | DETRPose: Real-time end-to-end transformer model for multi-person pose estimation | Sebastian Janampa et.al. | 2506.13027v1 | link |
2025-06-15 | A large-scale, physically-based synthetic dataset for satellite pose estimation | Szabolcs Velkei et.al. | 2506.12782v1 | null |
2025-06-14 | Tailored ordering enables high-capacity cathode materials | Tzu-chen Liu et.al. | 2506.12545v2 | null |
2025-06-14 | Information fusion strategy integrating pre-trained language model and contrastive learning for materials knowledge mining | Yongqian Peng et.al. | 2506.12516v1 | null |
2025-06-13 | Interpretable representation learning of quantum data enabled by probabilistic variational autoencoders | Paulin de Schoulepnikoff et.al. | 2506.11982v2 | null |
2025-06-13 | Spectra-to-Structure and Structure-to-Spectra Inference Across the Periodic Table | Yufeng Wang et.al. | 2506.11908v1 | null |
2025-06-12 | A detailed and comprehensive account of fractional Physics-Informed Neural Networks: From implementation to efficiency | Donya Dabiri et.al. | 2506.11241v1 | null |
Image Matching
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Emergent Temporal Correspondences from Video Diffusion Transformers | Jisu Nam et.al. | 2506.17220v1 | link |
2025-06-20 | No Free Lunch: Rethinking Internal Feedback for LLM Reasoning | Yanzhi Zhang et.al. | 2506.17219v1 | null |
2025-06-20 | Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens | Zeyuan Yang et.al. | 2506.17218v1 | null |
2025-06-20 | DreamCube: 3D Panorama Generation via Multi-plane Synchronization | Yukun Huang et.al. | 2506.17206v1 | null |
2025-06-20 | UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation | Teng Li et.al. | 2506.17202v1 | null |
2025-06-20 | Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres | Samuel Howard et.al. | 2506.17197v1 | null |
2025-06-20 | Gravitational lensing observables in stationary and axisymmetric solutions in general relativity | Matteo Luca Ruggiero et.al. | 2506.17192v1 | null |
2025-06-20 | Facial Landmark Visualization and Emotion Recognition Through Neural Networks | Israel Juárez-Jiménez et.al. | 2506.17191v1 | null |
2025-06-20 | YASMOT: Yet another stereo image multi-object tracker | Ketil Malde et.al. | 2506.17186v1 | null |
2025-06-20 | Variational Learning of Disentangled Representations | Yuli Slavutsky et.al. | 2506.17182v1 | null |
2025-06-20 | High-accuracy inference using HfO$_x$S$_y$/HfS$_2$ Memristors | Aferdita Xhameni et.al. | 2506.17174v1 | null |
2025-06-20 | Deep generative models as the probability transformation functions | Vitalii Bondar et.al. | 2506.17171v1 | null |
2025-06-20 | Proportional Sensitivity in Generative Adversarial Network (GAN)-Augmented Brain Tumor Classification Using Convolutional Neural Network | Mahin Montasir Afif et.al. | 2506.17165v1 | null |
2025-06-20 | Walking Fingerprinting Using Wrist Accelerometry During Activities of Daily Living in NHANES | Lily Koffman et.al. | 2506.17160v1 | null |
2025-06-20 | Co-Seg++: Mutual Prompt-Guided Collaborative Learning for Versatile Medical Segmentation | Qing Xu et.al. | 2506.17159v1 | null |
2025-06-20 | Shock formation in 1D conservation laws II: Vanishing viscosity | John Anderson et.al. | 2506.17156v1 | null |
2025-06-20 | Do We Need Large VLMs for Spotting Soccer Actions? | Ritabrata Chakraborty et.al. | 2506.17144v1 | null |
2025-06-20 | MeDi: Metadata-Guided Diffusion Models for Mitigating Biases in Tumor Classification | David Jacob Drexlin et.al. | 2506.17140v1 | null |
2025-06-20 | Semi-Supervised Multi-Modal Medical Image Segmentation for Complex Situations | Dongdong Meng et.al. | 2506.17136v1 | null |
2025-06-20 | Dynamic Watermark Generation for Digital Images using Perimeter Gated SPAD Imager PUFs | Md Sakibur Sajal et.al. | 2506.17134v1 | null |
2025-06-20 | Robust Training with Data Augmentation for Medical Imaging Classification | Josué Martínez-Martínez et.al. | 2506.17133v1 | null |
2025-06-20 | Real-time Broadband RFI Excision for the Upgraded GMRT | Ruta Kale et.al. | 2506.17131v1 | null |
2025-06-20 | Large Average Subtensor Problem: Ground-State, Algorithms, and Algorithmic Barriers | Abhishek Hegade K. R. et.al. | 2506.17118v1 | null |
2025-06-20 | Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping | Teng Guo et.al. | 2506.17110v1 | null |
2025-06-20 | Open-Path Methane Sensing via Backscattered Light in a Nonlinear Interferometer | Jinghan Dong et.al. | 2506.17107v1 | null |
2025-06-20 | Neural Polar Decoders for DNA Data Storage | Ziv Aharoni et.al. | 2506.17076v1 | null |
2025-06-20 | Assembler: Scalable 3D Part Assembly via Anchor Point Diffusion | Wang Zhao et.al. | 2506.17074v1 | null |
2025-06-20 | Client Selection Strategies for Federated Semantic Communications in Heterogeneous IoT Networks | Samer Lahoud et.al. | 2506.17063v1 | null |
2025-06-20 | From Concepts to Components: Concept-Agnostic Attention Module Discovery in Transformers | Jingtong Su et.al. | 2506.17052v1 | null |
2025-06-20 | Navigating the Deep: Signature Extraction on Deep Neural Networks | Haolin Liu et.al. | 2506.17047v1 | null |
Computer Vision
Image Classification
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens | Zeyuan Yang et.al. | 2506.17218v1 | null |
2025-06-20 | DreamCube: 3D Panorama Generation via Multi-plane Synchronization | Yukun Huang et.al. | 2506.17206v1 | null |
2025-06-20 | UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation | Teng Li et.al. | 2506.17202v1 | null |
2025-06-20 | Gravitational lensing observables in stationary and axisymmetric solutions in general relativity | Matteo Luca Ruggiero et.al. | 2506.17192v1 | null |
2025-06-20 | Facial Landmark Visualization and Emotion Recognition Through Neural Networks | Israel Juárez-Jiménez et.al. | 2506.17191v1 | null |
2025-06-20 | YASMOT: Yet another stereo image multi-object tracker | Ketil Malde et.al. | 2506.17186v1 | null |
2025-06-20 | Variational Learning of Disentangled Representations | Yuli Slavutsky et.al. | 2506.17182v1 | null |
2025-06-20 | High-accuracy inference using HfO$_x$S$_y$/HfS$_2$ Memristors | Aferdita Xhameni et.al. | 2506.17174v1 | null |
2025-06-20 | Proportional Sensitivity in Generative Adversarial Network (GAN)-Augmented Brain Tumor Classification Using Convolutional Neural Network | Mahin Montasir Afif et.al. | 2506.17165v1 | null |
2025-06-20 | Walking Fingerprinting Using Wrist Accelerometry During Activities of Daily Living in NHANES | Lily Koffman et.al. | 2506.17160v1 | null |
2025-06-20 | Co-Seg++: Mutual Prompt-Guided Collaborative Learning for Versatile Medical Segmentation | Qing Xu et.al. | 2506.17159v1 | null |
2025-06-20 | Affine semigroups without consecutive small elements | J. C. Rosales et.al. | 2506.17152v1 | null |
2025-06-20 | MeDi: Metadata-Guided Diffusion Models for Mitigating Biases in Tumor Classification | David Jacob Drexlin et.al. | 2506.17140v1 | null |
2025-06-20 | Semi-Supervised Multi-Modal Medical Image Segmentation for Complex Situations | Dongdong Meng et.al. | 2506.17136v1 | null |
2025-06-20 | Dynamic Watermark Generation for Digital Images using Perimeter Gated SPAD Imager PUFs | Md Sakibur Sajal et.al. | 2506.17134v1 | null |
2025-06-20 | Robust Training with Data Augmentation for Medical Imaging Classification | Josué Martínez-Martínez et.al. | 2506.17133v1 | null |
2025-06-20 | Real-time Broadband RFI Excision for the Upgraded GMRT | Ruta Kale et.al. | 2506.17131v1 | null |
2025-06-20 | Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping | Teng Guo et.al. | 2506.17110v1 | null |
2025-06-20 | Open-Path Methane Sensing via Backscattered Light in a Nonlinear Interferometer | Jinghan Dong et.al. | 2506.17107v1 | null |
2025-06-20 | Acquiring and Accumulating Knowledge from Diverse Datasets for Multi-label Driving Scene Classification | Ke Li et.al. | 2506.17101v1 | null |
2025-06-20 | Brain-inspired interpretable reservoir computing with resonant recurrent neural networks | Mark A. Kramer et.al. | 2506.17083v1 | null |
2025-06-20 | Assembler: Scalable 3D Part Assembly via Anchor Point Diffusion | Wang Zhao et.al. | 2506.17074v1 | null |
2025-06-20 | Cross-Modal Epileptic Signal Harmonization: Frequency Domain Mapping Quantization for Pre-training a Unified Neurophysiological Transformer | Runkai Zhang et.al. | 2506.17068v1 | null |
2025-06-20 | Client Selection Strategies for Federated Semantic Communications in Heterogeneous IoT Networks | Samer Lahoud et.al. | 2506.17063v1 | null |
2025-06-20 | From Concepts to Components: Concept-Agnostic Attention Module Discovery in Transformers | Jingtong Su et.al. | 2506.17052v1 | null |
2025-06-20 | MUCAR: Benchmarking Multilingual Cross-Modal Ambiguity Resolution for Multimodal Large Language Models | Xiaolong Wang et.al. | 2506.17046v1 | null |
2025-06-20 | Stretching Beyond the Obvious: A Gradient-Free Framework to Unveil the Hidden Landscape of Visual Invariance | Lorenzo Tausani et.al. | 2506.17040v1 | null |
2025-06-20 | Unsupervised Image Super-Resolution Reconstruction Based on Real-World Degradation Patterns | Yiyang Tie et.al. | 2506.17027v1 | null |
2025-06-20 | The Hidden Cost of an Image: Quantifying the Energy Consumption of AI Image Generation | Giulia Bertazzini et.al. | 2506.17016v1 | null |
2025-06-20 | Directional Dark Field for Nanoscale Full-Field Transmission X-Ray Microscopy | Sami Wirtensohn et.al. | 2506.16998v1 | null |
Multi-Object Tracking
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Emergent Temporal Correspondences from Video Diffusion Transformers | Jisu Nam et.al. | 2506.17220v1 | link |
2025-06-20 | No Free Lunch: Rethinking Internal Feedback for LLM Reasoning | Yanzhi Zhang et.al. | 2506.17219v1 | null |
2025-06-20 | Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens | Zeyuan Yang et.al. | 2506.17218v1 | null |
2025-06-20 | Hierarchical constraints on gravitational waves from horizonless compact objects | Rajrupa Mondal et.al. | 2506.17215v1 | null |
2025-06-20 | Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting | Tianjiao Yu et.al. | 2506.17212v1 | null |
2025-06-20 | Dissecting the SWE-Bench Leaderboards: Profiling Submitters and Architectures of LLM- and Agent-Based Repair Systems | Matias Martinez et.al. | 2506.17208v1 | null |
2025-06-20 | DreamCube: 3D Panorama Generation via Multi-plane Synchronization | Yukun Huang et.al. | 2506.17206v1 | null |
2025-06-20 | Efficient Implementation of Multi-sensor Adaptive Birth Samplers for Labeled Random Finite Set Tracking | Jennifer Bondarchuk et.al. | 2506.17205v1 | null |
2025-06-20 | Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning | Guozheng Ma et.al. | 2506.17204v1 | null |
2025-06-20 | Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition | Jiaqi Li et.al. | 2506.17201v1 | null |
2025-06-20 | Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres | Samuel Howard et.al. | 2506.17197v1 | null |
2025-06-20 | Facial Landmark Visualization and Emotion Recognition Through Neural Networks | Israel Juárez-Jiménez et.al. | 2506.17191v1 | null |
2025-06-20 | On Energy-Efficient Passive Beamforming Design of RIS-Assisted CoMP-NOMA Networks | Muhammad Umer et.al. | 2506.17189v1 | null |
2025-06-20 | YASMOT: Yet another stereo image multi-object tracker | Ketil Malde et.al. | 2506.17186v1 | null |
2025-06-20 | Variational Learning of Disentangled Representations | Yuli Slavutsky et.al. | 2506.17182v1 | null |
2025-06-20 | Scaling limits for sample autocovariance operators of Hilbert space-valued linear processes | Marie-Christine Düker et.al. | 2506.17168v1 | null |
2025-06-20 | Analyzing PDFs like Binaries: Adversarially Robust PDF Malware Analysis via Intermediate Representation and Language Model | Side Liu et.al. | 2506.17162v1 | null |
2025-06-20 | Fully Self-Consistent Semiclassical Gravity | R. Muciño et.al. | 2506.17149v1 | null |
2025-06-20 | Do We Need Large VLMs for Spotting Soccer Actions? | Ritabrata Chakraborty et.al. | 2506.17144v1 | null |
2025-06-20 | On the Theory of Conditional Feature Alignment for Unsupervised Domain-Adaptive Counting | Zhuonan Liang et.al. | 2506.17137v1 | null |
2025-06-20 | RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking | Teng Guo et.al. | 2506.17119v1 | null |
2025-06-20 | Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping | Teng Guo et.al. | 2506.17110v1 | null |
2025-06-20 | Searching for a Hidden Markov Anomaly over Multiple Processes | Levli Citron et.al. | 2506.17108v1 | null |
2025-06-20 | Brain-inspired interpretable reservoir computing with resonant recurrent neural networks | Mark A. Kramer et.al. | 2506.17083v1 | null |
2025-06-20 | Matroids, intersecting bases, and Borsuk property | Gyivan López-Campos et.al. | 2506.17082v1 | null |
2025-06-20 | Assembler: Scalable 3D Part Assembly via Anchor Point Diffusion | Wang Zhao et.al. | 2506.17074v1 | null |
2025-06-20 | LLM-Based Bot Broadens the Range of Arguments in Online Discussions, Even When Transparently Disclosed as AI | Valeria Vuk et.al. | 2506.17073v1 | null |
2025-06-20 | Empowering Near-Field Communications in Low-Altitude Economy with LLM: Fundamentals, Potentials, Solutions, and Future Directions | Zhuo Xu et.al. | 2506.17067v1 | null |
2025-06-20 | From Concepts to Components: Concept-Agnostic Attention Module Discovery in Transformers | Jingtong Su et.al. | 2506.17052v1 | null |
2025-06-20 | MUCAR: Benchmarking Multilingual Cross-Modal Ambiguity Resolution for Multimodal Large Language Models | Xiaolong Wang et.al. | 2506.17046v1 | null |
Image Matching
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Emergent Temporal Correspondences from Video Diffusion Transformers | Jisu Nam et.al. | 2506.17220v1 | link |
2025-06-20 | No Free Lunch: Rethinking Internal Feedback for LLM Reasoning | Yanzhi Zhang et.al. | 2506.17219v1 | null |
2025-06-20 | Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens | Zeyuan Yang et.al. | 2506.17218v1 | null |
2025-06-20 | DreamCube: 3D Panorama Generation via Multi-plane Synchronization | Yukun Huang et.al. | 2506.17206v1 | null |
2025-06-20 | UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation | Teng Li et.al. | 2506.17202v1 | null |
2025-06-20 | Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres | Samuel Howard et.al. | 2506.17197v1 | null |
2025-06-20 | Gravitational lensing observables in stationary and axisymmetric solutions in general relativity | Matteo Luca Ruggiero et.al. | 2506.17192v1 | null |
2025-06-20 | Facial Landmark Visualization and Emotion Recognition Through Neural Networks | Israel Juárez-Jiménez et.al. | 2506.17191v1 | null |
2025-06-20 | YASMOT: Yet another stereo image multi-object tracker | Ketil Malde et.al. | 2506.17186v1 | null |
2025-06-20 | Variational Learning of Disentangled Representations | Yuli Slavutsky et.al. | 2506.17182v1 | null |
2025-06-20 | High-accuracy inference using HfO$_x$S$_y$/HfS$_2$ Memristors | Aferdita Xhameni et.al. | 2506.17174v1 | null |
2025-06-20 | Deep generative models as the probability transformation functions | Vitalii Bondar et.al. | 2506.17171v1 | null |
2025-06-20 | Proportional Sensitivity in Generative Adversarial Network (GAN)-Augmented Brain Tumor Classification Using Convolutional Neural Network | Mahin Montasir Afif et.al. | 2506.17165v1 | null |
2025-06-20 | Walking Fingerprinting Using Wrist Accelerometry During Activities of Daily Living in NHANES | Lily Koffman et.al. | 2506.17160v1 | null |
2025-06-20 | Co-Seg++: Mutual Prompt-Guided Collaborative Learning for Versatile Medical Segmentation | Qing Xu et.al. | 2506.17159v1 | null |
2025-06-20 | Shock formation in 1D conservation laws II: Vanishing viscosity | John Anderson et.al. | 2506.17156v1 | null |
2025-06-20 | Do We Need Large VLMs for Spotting Soccer Actions? | Ritabrata Chakraborty et.al. | 2506.17144v1 | null |
2025-06-20 | MeDi: Metadata-Guided Diffusion Models for Mitigating Biases in Tumor Classification | David Jacob Drexlin et.al. | 2506.17140v1 | null |
2025-06-20 | Semi-Supervised Multi-Modal Medical Image Segmentation for Complex Situations | Dongdong Meng et.al. | 2506.17136v1 | null |
2025-06-20 | Dynamic Watermark Generation for Digital Images using Perimeter Gated SPAD Imager PUFs | Md Sakibur Sajal et.al. | 2506.17134v1 | null |
2025-06-20 | Robust Training with Data Augmentation for Medical Imaging Classification | Josué Martínez-Martínez et.al. | 2506.17133v1 | null |
2025-06-20 | Real-time Broadband RFI Excision for the Upgraded GMRT | Ruta Kale et.al. | 2506.17131v1 | null |
2025-06-20 | Large Average Subtensor Problem: Ground-State, Algorithms, and Algorithmic Barriers | Abhishek Hegade K. R. et.al. | 2506.17118v1 | null |
2025-06-20 | Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping | Teng Guo et.al. | 2506.17110v1 | null |
2025-06-20 | Open-Path Methane Sensing via Backscattered Light in a Nonlinear Interferometer | Jinghan Dong et.al. | 2506.17107v1 | null |
2025-06-20 | Neural Polar Decoders for DNA Data Storage | Ziv Aharoni et.al. | 2506.17076v1 | null |
2025-06-20 | Assembler: Scalable 3D Part Assembly via Anchor Point Diffusion | Wang Zhao et.al. | 2506.17074v1 | null |
2025-06-20 | Client Selection Strategies for Federated Semantic Communications in Heterogeneous IoT Networks | Samer Lahoud et.al. | 2506.17063v1 | null |
2025-06-20 | From Concepts to Components: Concept-Agnostic Attention Module Discovery in Transformers | Jingtong Su et.al. | 2506.17052v1 | null |
2025-06-20 | Navigating the Deep: Signature Extraction on Deep Neural Networks | Haolin Liu et.al. | 2506.17047v1 | null |
Instance Segmentation
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Lower Bounds against the Ideal Proof System in Finite Fields | Tal Elbaz et.al. | 2506.17210v1 | null |
2025-06-20 | Any nonincreasing convergence curves are simultaneously possible for GMRES and weighted GMRES, as well as for left and right preconditioned GMRES | Pierre Matalon et.al. | 2506.17193v1 | null |
2025-06-20 | Codeword-Segmentation Rate-Splitting Multiple Access and Evaluation under Suboptimal Decoding | Sibo Zhang et.al. | 2506.17164v1 | null |
2025-06-20 | Co-Seg++: Mutual Prompt-Guided Collaborative Learning for Versatile Medical Segmentation | Qing Xu et.al. | 2506.17159v1 | null |
2025-06-20 | The odd spectral localiser via asymptotic morphisms and quasi-projections | Yuezhao Li et.al. | 2506.17143v1 | null |
2025-06-20 | Semi-Supervised Multi-Modal Medical Image Segmentation for Complex Situations | Dongdong Meng et.al. | 2506.17136v1 | null |
2025-06-20 | MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation | Shoubin Yu et.al. | 2506.17113v1 | null |
2025-06-20 | Better Language Model Inversion by Compactly Representing Next-Token Distributions | Murtaza Nazir et.al. | 2506.17090v1 | null |
2025-06-20 | Quantum k-SAT Related Hypergraph Problems | Simon-Luca Kremer et.al. | 2506.17066v1 | null |
2025-06-20 | Flow-Based Non-stationary Temporal Regime Causal Structure Learning | Abdellah Rahmani et.al. | 2506.17065v1 | null |
2025-06-20 | Navigating the Deep: Signature Extraction on Deep Neural Networks | Haolin Liu et.al. | 2506.17047v1 | null |
2025-06-20 | Prmpt2Adpt: Prompt-Based Zero-Shot Domain Adaptation for Resource-Constrained Environments | Yasir Ali Farrukh et.al. | 2506.16994v1 | null |
2025-06-20 | ForestFormer3D: A Unified Framework for End-to-End Segmentation of Forest LiDAR 3D Point Clouds | Binbin Xiang et.al. | 2506.16991v1 | null |
2025-06-20 | Modeling and Visualization Reasoning for Stakeholders in Education and Industry Integration Systems: Research on Structured Synthetic Dialogue Data Generation Based on NIST Standards | Wei Meng et.al. | 2506.16952v1 | null |
2025-06-20 | LunarLoc: Segment-Based Global Localization on the Moon | Annika Thomas et.al. | 2506.16940v1 | null |
2025-06-20 | Multimodal Fused Learning for Solving the Generalized Traveling Salesman Problem in Robotic Task Planning | Jiaqi Chen et.al. | 2506.16931v1 | null |
2025-06-20 | Advancing Fact Attribution for Query Answering: Aggregate Queries and Novel Algorithms | Omer Abramovich et.al. | 2506.16923v1 | null |
2025-06-20 | FedFitTech: A Baseline in Federated Learning for Fitness Tracking | Zeyneddin Oz et.al. | 2506.16840v1 | null |
2025-06-20 | AnyTraverse: An off-road traversability framework with VLM and human operator in the loop | Sattwik Sahu et.al. | 2506.16826v1 | null |
2025-06-20 | Loupe: A Generalizable and Adaptive Framework for Image Forgery Detection | Yuchu Jiang et.al. | 2506.16819v1 | null |
2025-06-20 | Using SRv6 to access Edge Applications in 5G Networks | Louis Royer et.al. | 2506.16808v1 | null |
2025-06-20 | FOCUS: Unified Vision-Language Modeling for Interactive Editing Driven by Referential Segmentation | Fan Yang et.al. | 2506.16806v1 | null |
2025-06-20 | Temperature calibration of surface emissivities with an improved thermal image enhancement network | Ning Chu et.al. | 2506.16803v1 | null |
2025-06-20 | Quantum prime factorization algorithms using binary carry propagation | Arim Ryou et.al. | 2506.16799v1 | null |
2025-06-20 | Robust Dynamic Material Handling via Adaptive Constrained Evolutionary Reinforcement Learning | Chengpeng Hu et.al. | 2506.16795v1 | null |
2025-06-20 | TextBraTS: Text-Guided Volumetric Brain Tumor Segmentation with Innovative Dataset Development and Fusion Module Exploration | Xiaoyu Shi et.al. | 2506.16784v1 | null |
2025-06-20 | Class Agnostic Instance-level Descriptor for Visual Instance Search | Qi-Ying Sun et.al. | 2506.16745v1 | null |
2025-06-20 | Uncertainty-Aware Variational Information Pursuit for Interpretable Medical Image Analysis | Md Nahiduzzaman et.al. | 2506.16742v1 | null |
2025-06-20 | On Training-Test (Mis)alignment in Unsupervised Combinatorial Optimization: Observation, Empirical Exploration, and Analysis | Fanchen Bu et.al. | 2506.16732v1 | null |
2025-06-20 | TeSG: Textual Semantic Guidance for Infrared and Visible Image Fusion | Mingrui Zhu et.al. | 2506.16730v1 | null |
Semantic Segmentation
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Confidence Scoring for LLM-Generated SQL in Supply Chain Data Extraction | Jiekai Ma et.al. | 2506.17203v1 | null |
2025-06-20 | UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation | Teng Li et.al. | 2506.17202v1 | null |
2025-06-20 | CLEAR-3K: Assessing Causal Explanatory Capabilities in Language Models | Naiming Liu et.al. | 2506.17180v1 | null |
2025-06-20 | Codeword-Segmentation Rate-Splitting Multiple Access and Evaluation under Suboptimal Decoding | Sibo Zhang et.al. | 2506.17164v1 | null |
2025-06-20 | Analyzing PDFs like Binaries: Adversarially Robust PDF Malware Analysis via Intermediate Representation and Language Model | Side Liu et.al. | 2506.17162v1 | null |
2025-06-20 | Co-Seg++: Mutual Prompt-Guided Collaborative Learning for Versatile Medical Segmentation | Qing Xu et.al. | 2506.17159v1 | null |
2025-06-20 | A Note on Proper Relational Structures | Adam Bjorndahl et.al. | 2506.17142v1 | null |
2025-06-20 | Semi-Supervised Multi-Modal Medical Image Segmentation for Complex Situations | Dongdong Meng et.al. | 2506.17136v1 | null |
2025-06-20 | Rapid and Continuous Trust Evaluation for Effective Task Collaboration Through Siamese Model | Botao Zhu et.al. | 2506.17128v1 | null |
2025-06-20 | Chain-of-Thought Prompting Obscures Hallucination Cues in Large Language Models: An Empirical Evaluation | Jiahao Cheng et.al. | 2506.17088v1 | null |
2025-06-20 | Flow-Based Non-stationary Temporal Regime Causal Structure Learning | Abdellah Rahmani et.al. | 2506.17065v1 | null |
2025-06-20 | Client Selection Strategies for Federated Semantic Communications in Heterogeneous IoT Networks | Samer Lahoud et.al. | 2506.17063v1 | null |
2025-06-20 | A Synthetic Benchmark for Collaborative 3D Semantic Occupancy Prediction in V2X Autonomous Driving | Hanlin Wu et.al. | 2506.17004v1 | null |
2025-06-20 | Prmpt2Adpt: Prompt-Based Zero-Shot Domain Adaptation for Resource-Constrained Environments | Yasir Ali Farrukh et.al. | 2506.16994v1 | null |
2025-06-20 | ForestFormer3D: A Unified Framework for End-to-End Segmentation of Forest LiDAR 3D Point Clouds | Binbin Xiang et.al. | 2506.16991v1 | null |
2025-06-20 | SmartGuard: Leveraging Large Language Models for Network Attack Detection through Audit Log Analysis and Summarization | Hao Zhang et.al. | 2506.16981v1 | null |
2025-06-20 | Visual-Instructed Degradation Diffusion for All-in-One Image Restoration | Wenyang Luo et.al. | 2506.16960v1 | null |
2025-06-20 | Modeling and Visualization Reasoning for Stakeholders in Education and Industry Integration Systems: Research on Structured Synthetic Dialogue Data Generation Based on NIST Standards | Wei Meng et.al. | 2506.16952v1 | null |
2025-06-20 | LunarLoc: Segment-Based Global Localization on the Moon | Annika Thomas et.al. | 2506.16940v1 | null |
2025-06-20 | LMQ-Sketch: Lagom Multi-Query Sketch for High-Rate Online Analytics | Martin Hilgendorf et.al. | 2506.16928v1 | null |
2025-06-20 | Hybrid-Sep: Language-queried audio source separation via pre-trained Model Fusion and Adversarial Diffusion Training | Jianyuan Feng et.al. | 2506.16833v1 | null |
2025-06-20 | AnyTraverse: An off-road traversability framework with VLM and human operator in the loop | Sattwik Sahu et.al. | 2506.16826v1 | null |
2025-06-20 | Predicting New Research Directions in Materials Science using Large Language Models and Concept Graphs | Thomas Marwitz et.al. | 2506.16824v1 | null |
2025-06-20 | Loupe: A Generalizable and Adaptive Framework for Image Forgery Detection | Yuchu Jiang et.al. | 2506.16819v1 | null |
2025-06-20 | Using SRv6 to access Edge Applications in 5G Networks | Louis Royer et.al. | 2506.16808v1 | null |
2025-06-20 | FOCUS: Unified Vision-Language Modeling for Interactive Editing Driven by Referential Segmentation | Fan Yang et.al. | 2506.16806v1 | null |
2025-06-20 | Temperature calibration of surface emissivities with an improved thermal image enhancement network | Ning Chu et.al. | 2506.16803v1 | null |
2025-06-20 | Seeing What Matters: Generalizable AI-generated Video Detection with Forensic-Oriented Augmentation | Riccardo Corvi et.al. | 2506.16802v1 | null |
2025-06-20 | RealSR-R1: Reinforcement Learning for Real-World Image Super-Resolution with Vision-Language Chain-of-Thought | Junbo Qiao et.al. | 2506.16796v1 | null |
2025-06-20 | MIST: Jailbreaking Black-box Large Language Models via Iterative Semantic Tuning | Muyang Zheng et.al. | 2506.16792v1 | null |
Object Detection
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | No Free Lunch: Rethinking Internal Feedback for LLM Reasoning | Yanzhi Zhang et.al. | 2506.17219v1 | null |
2025-06-20 | Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens | Zeyuan Yang et.al. | 2506.17218v1 | null |
2025-06-20 | Hierarchical constraints on gravitational waves from horizonless compact objects | Rajrupa Mondal et.al. | 2506.17215v1 | null |
2025-06-20 | Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting | Tianjiao Yu et.al. | 2506.17212v1 | null |
2025-06-20 | Efficient Implementation of Multi-sensor Adaptive Birth Samplers for Labeled Random Finite Set Tracking | Jennifer Bondarchuk et.al. | 2506.17205v1 | null |
2025-06-20 | Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres | Samuel Howard et.al. | 2506.17197v1 | null |
2025-06-20 | Operation and performance of the CMS silicon strip tracker with proton-proton collisions at the CERN LHC | CMS Collaboration et.al. | 2506.17195v1 | null |
2025-06-20 | YASMOT: Yet another stereo image multi-object tracker | Ketil Malde et.al. | 2506.17186v1 | null |
2025-06-20 | Variational Learning of Disentangled Representations | Yuli Slavutsky et.al. | 2506.17182v1 | null |
2025-06-20 | Scaling limits for sample autocovariance operators of Hilbert space-valued linear processes | Marie-Christine Düker et.al. | 2506.17168v1 | null |
2025-06-20 | Analyzing PDFs like Binaries: Adversarially Robust PDF Malware Analysis via Intermediate Representation and Language Model | Side Liu et.al. | 2506.17162v1 | null |
2025-06-20 | $^{50}$Cr and $^{53}$Cr neutron capture cross sections measurement at the n_TOF facility at CERN | P. Pérez-Maroto et.al. | 2506.17161v1 | null |
2025-06-20 | Fully Self-Consistent Semiclassical Gravity | R. Muciño et.al. | 2506.17149v1 | null |
2025-06-20 | On the Theory of Conditional Feature Alignment for Unsupervised Domain-Adaptive Counting | Zhuonan Liang et.al. | 2506.17137v1 | null |
2025-06-20 | RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking | Teng Guo et.al. | 2506.17119v1 | null |
2025-06-20 | Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping | Teng Guo et.al. | 2506.17110v1 | null |
2025-06-20 | Searching for a Hidden Markov Anomaly over Multiple Processes | Levli Citron et.al. | 2506.17108v1 | null |
2025-06-20 | Matroids, intersecting bases, and Borsuk property | Gyivan López-Campos et.al. | 2506.17082v1 | null |
2025-06-20 | Assembler: Scalable 3D Part Assembly via Anchor Point Diffusion | Wang Zhao et.al. | 2506.17074v1 | null |
2025-06-20 | LLM-Based Bot Broadens the Range of Arguments in Online Discussions, Even When Transparently Disclosed as AI | Valeria Vuk et.al. | 2506.17073v1 | null |
2025-06-20 | Empowering Near-Field Communications in Low-Altitude Economy with LLM: Fundamentals, Potentials, Solutions, and Future Directions | Zhuo Xu et.al. | 2506.17067v1 | null |
2025-06-20 | Stretching Beyond the Obvious: A Gradient-Free Framework to Unveil the Hidden Landscape of Visual Invariance | Lorenzo Tausani et.al. | 2506.17040v1 | null |
2025-06-20 | Volumetric Parameterization for 3-Dimensional Simply-Connected Manifolds | Zhiyuan Lyu et.al. | 2506.17025v1 | null |
2025-06-20 | Theoretical modeling of QCD radiation in off-shell Higgs production through gluon fusion | Rafael Coelho Lopes de Sá et.al. | 2506.17022v1 | null |
2025-06-20 | Robust Reinforcement Learning for Discrete Compositional Generation via General Soft Operators | Marco Jiralerspong et.al. | 2506.17007v1 | null |
2025-06-20 | Directional Dark Field for Nanoscale Full-Field Transmission X-Ray Microscopy | Sami Wirtensohn et.al. | 2506.16998v1 | null |
2025-06-20 | Learning Accurate Whole-body Throwing with High-frequency Residual Policy and Pullback Tube Acceleration | Yuntao Ma et.al. | 2506.16986v1 | null |
2025-06-20 | Maximal Achievable Service Rates of Codes and Connections to Combinatorial Designs | Hoang Ly et.al. | 2506.16983v1 | null |
2025-06-20 | Performance studies of thin gas gap Resistive Plate Chamber prototypes with low Global Warming Potential gases for the ANUBIS experiment | Aashaq Shah et.al. | 2506.16948v1 | null |
2025-06-20 | Axion contribution to the mass-radius relation of neutron stars | Momchil Naydenov et.al. | 2506.16932v1 | null |
Keypoint Detection
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Class Agnostic Instance-level Descriptor for Visual Instance Search | Qi-Ying Sun et.al. | 2506.16745v1 | null |
2025-06-20 | The simplest chaos indicator derived from Lagrangian descriptors | Javier Jiménez-López et.al. | 2506.16660v1 | null |
2025-06-19 | How Far Can Off-the-Shelf Multimodal Large Language Models Go in Online Episodic Memory Question Answering? | Giuseppe Lando et.al. | 2506.16450v1 | null |
2025-06-19 | STAR-Pose: Efficient Low-Resolution Video Human Pose Estimation via Spatial-Temporal Adaptive Super-Resolution | Yucheng Jin et.al. | 2506.16061v1 | null |
2025-06-19 | Unveiling defect motifs in amorphous GeSe using machine learning interatomic potentials | Minseok Moon et.al. | 2506.15934v1 | null |
2025-06-18 | A new Surrogate Microstructure Generator for Porous Materials with Applications to the Buffer Layer of TRISO Nuclear Fuel Particles | Philipp Eisenhardt et.al. | 2506.15874v1 | null |
2025-06-18 | Descriptor-based Foundation Models for Molecular Property Prediction | Jackson Burns et.al. | 2506.15792v1 | null |
2025-06-18 | Maximizing solubility in rock salt high-entropy oxides | Matthew Furst et.al. | 2506.15604v1 | null |
2025-06-18 | MCOO-SLAM: A Multi-Camera Omnidirectional Object SLAM System | Miaoxin Pan et.al. | 2506.15402v1 | null |
2025-06-18 | High-Entropy Skutterudites as Thermoelectrics: Synthesizability and Band Convergence via the Cocktail Effect | Jose J. Plata et.al. | 2506.15324v1 | null |
2025-06-18 | SHeRLoc: Synchronized Heterogeneous Radar Place Recognition for Cross-Modal Localization | Hanjun Kim et.al. | 2506.15175v1 | null |
2025-06-18 | Enhancing point cloud analysis via neighbor aggregation correction based on cross-stage structure correlation | Jiaqi Shi et.al. | 2506.15160v1 | link |
2025-06-18 | VIMS: A Visual-Inertial-Magnetic-Sonar SLAM System in Underwater Environments | Bingbing Zhang et.al. | 2506.15126v1 | null |
2025-06-17 | Q2SAR: A Quantum Multiple Kernel Learning Approach for Drug Discovery | Alejandro Giraldo et.al. | 2506.14920v1 | null |
2025-06-17 | Cross-Modal Geometric Hierarchy Fusion: An Implicit-Submap Driven Framework for Resilient 3D Place Recognition | Xiaohui Jiang et.al. | 2506.14243v2 | link |
2025-06-17 | AMPLIFY: Actionless Motion Priors for Robot Learning from Videos | Jeremy A. Collins et.al. | 2506.14198v1 | null |
2025-06-17 | Compositional fluctuations and polymorph selection in crystallization of model soft colloids | Abhilasha Kumari et.al. | 2506.14109v1 | null |
2025-06-16 | AutoSAS: a new human-aside-the-loop paradigm for automated SAS fitting for high throughput and autonomous experimentation | Duncan R. Sutherland et.al. | 2506.13918v1 | null |
2025-06-16 | ATK: Automatic Task-driven Keypoint Selection for Robust Policy Learning | Yunchu Zhang et.al. | 2506.13867v1 | null |
2025-06-16 | Audio-Visual Driven Compression for Low-Bitrate Talking Head Videos | Riku Takahashi et.al. | 2506.13419v1 | null |
2025-06-16 | Quantitative Comparison of Fine-Tuning Techniques for Pretrained Latent Diffusion Models in the Generation of Unseen SAR Image Concepts | Solène Debuysère et.al. | 2506.13307v1 | null |
2025-06-16 | SuperPoint-SLAM3: Augmenting ORB-SLAM3 with Deep Features, Adaptive NMS, and Learning-Based Loop Closure | Shahram Najam Syed et.al. | 2506.13089v1 | link |
2025-06-16 | MAMMA: Markerless & Automatic Multi-Person Motion Action Capture | Hanz Cuevas-Velasquez et.al. | 2506.13040v1 | null |
2025-06-16 | DETRPose: Real-time end-to-end transformer model for multi-person pose estimation | Sebastian Janampa et.al. | 2506.13027v1 | link |
2025-06-15 | A large-scale, physically-based synthetic dataset for satellite pose estimation | Szabolcs Velkei et.al. | 2506.12782v1 | null |
2025-06-14 | Tailored ordering enables high-capacity cathode materials | Tzu-chen Liu et.al. | 2506.12545v2 | null |
2025-06-14 | Information fusion strategy integrating pre-trained language model and contrastive learning for materials knowledge mining | Yongqian Peng et.al. | 2506.12516v1 | null |
2025-06-13 | Interpretable representation learning of quantum data enabled by probabilistic variational autoencoders | Paulin de Schoulepnikoff et.al. | 2506.11982v2 | null |
2025-06-13 | Spectra-to-Structure and Structure-to-Spectra Inference Across the Periodic Table | Yufeng Wang et.al. | 2506.11908v1 | null |
2025-06-12 | A detailed and comprehensive account of fractional Physics-Informed Neural Networks: From implementation to efficiency | Donya Dabiri et.al. | 2506.11241v1 | null |
Object Tracking
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Emergent Temporal Correspondences from Video Diffusion Transformers | Jisu Nam et.al. | 2506.17220v1 | link |
2025-06-20 | No Free Lunch: Rethinking Internal Feedback for LLM Reasoning | Yanzhi Zhang et.al. | 2506.17219v1 | null |
2025-06-20 | Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens | Zeyuan Yang et.al. | 2506.17218v1 | null |
2025-06-20 | Hierarchical constraints on gravitational waves from horizonless compact objects | Rajrupa Mondal et.al. | 2506.17215v1 | null |
2025-06-20 | Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting | Tianjiao Yu et.al. | 2506.17212v1 | null |
2025-06-20 | Dissecting the SWE-Bench Leaderboards: Profiling Submitters and Architectures of LLM- and Agent-Based Repair Systems | Matias Martinez et.al. | 2506.17208v1 | null |
2025-06-20 | Efficient Implementation of Multi-sensor Adaptive Birth Samplers for Labeled Random Finite Set Tracking | Jennifer Bondarchuk et.al. | 2506.17205v1 | null |
2025-06-20 | Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres | Samuel Howard et.al. | 2506.17197v1 | null |
2025-06-20 | On Energy-Efficient Passive Beamforming Design of RIS-Assisted CoMP-NOMA Networks | Muhammad Umer et.al. | 2506.17189v1 | null |
2025-06-20 | YASMOT: Yet another stereo image multi-object tracker | Ketil Malde et.al. | 2506.17186v1 | null |
2025-06-20 | Variational Learning of Disentangled Representations | Yuli Slavutsky et.al. | 2506.17182v1 | null |
2025-06-20 | Scaling limits for sample autocovariance operators of Hilbert space-valued linear processes | Marie-Christine Düker et.al. | 2506.17168v1 | null |
2025-06-20 | Analyzing PDFs like Binaries: Adversarially Robust PDF Malware Analysis via Intermediate Representation and Language Model | Side Liu et.al. | 2506.17162v1 | null |
2025-06-20 | Fully Self-Consistent Semiclassical Gravity | R. Muciño et.al. | 2506.17149v1 | null |
2025-06-20 | On the Theory of Conditional Feature Alignment for Unsupervised Domain-Adaptive Counting | Zhuonan Liang et.al. | 2506.17137v1 | null |
2025-06-20 | RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking | Teng Guo et.al. | 2506.17119v1 | null |
2025-06-20 | Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping | Teng Guo et.al. | 2506.17110v1 | null |
2025-06-20 | Searching for a Hidden Markov Anomaly over Multiple Processes | Levli Citron et.al. | 2506.17108v1 | null |
2025-06-20 | Matroids, intersecting bases, and Borsuk property | Gyivan López-Campos et.al. | 2506.17082v1 | null |
2025-06-20 | Assembler: Scalable 3D Part Assembly via Anchor Point Diffusion | Wang Zhao et.al. | 2506.17074v1 | null |
2025-06-20 | LLM-Based Bot Broadens the Range of Arguments in Online Discussions, Even When Transparently Disclosed as AI | Valeria Vuk et.al. | 2506.17073v1 | null |
2025-06-20 | Empowering Near-Field Communications in Low-Altitude Economy with LLM: Fundamentals, Potentials, Solutions, and Future Directions | Zhuo Xu et.al. | 2506.17067v1 | null |
2025-06-20 | Stretching Beyond the Obvious: A Gradient-Free Framework to Unveil the Hidden Landscape of Visual Invariance | Lorenzo Tausani et.al. | 2506.17040v1 | null |
2025-06-20 | Volumetric Parameterization for 3-Dimensional Simply-Connected Manifolds | Zhiyuan Lyu et.al. | 2506.17025v1 | null |
2025-06-20 | Instituto de Telecomunicações at IWSLT 2025: Aligning Small-Scale Speech and Language Models for Speech-to-Text Learning | Giuseppe Attanasio et.al. | 2506.17019v1 | null |
2025-06-20 | Robust Reinforcement Learning for Discrete Compositional Generation via General Soft Operators | Marco Jiralerspong et.al. | 2506.17007v1 | null |
2025-06-20 | Trajectory tracking control of USV with actuator constraints in the presence of disturbances | Ram Milan Kumar Verma et.al. | 2506.17005v1 | null |
2025-06-20 | Directional Dark Field for Nanoscale Full-Field Transmission X-Ray Microscopy | Sami Wirtensohn et.al. | 2506.16998v1 | null |
2025-06-20 | Identifying Explanation Needs: Towards a Catalog of User-based Indicators | Hannah Deters et.al. | 2506.16997v1 | null |
2025-06-20 | Learning Accurate Whole-body Throwing with High-frequency Residual Policy and Pullback Tube Acceleration | Yuntao Ma et.al. | 2506.16986v1 | null |
Talking Faces
Talking Faces
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Feedback cooling scheme for an optically levitated oscillator with controlled cross-talk | J. M. H. Gosling et.al. | 2506.17172v1 | null |
2025-06-20 | Shadow in the Galactic Center: Theoretical Concept -- Prediction -- Realization | Alexander F. Zakharov et.al. | 2506.16927v1 | null |
2025-06-19 | Do We Talk to Robots Like Therapists, and Do They Respond Accordingly? Language Alignment in AI Emotional Support | Sophie Chiang et.al. | 2506.16473v1 | null |
2025-06-19 | Probe before You Talk: Towards Black-box Defense against Backdoor Unalignment for Large Language Models | Biao Yi et.al. | 2506.16447v1 | null |
2025-06-19 | Optimizing Multilingual Text-To-Speech with Accents & Emotions | Pranav Pawar et.al. | 2506.16310v1 | null |
2025-06-17 | Thinking in Directivity: Speech Large Language Model for Multi-Talker Directional Speech Recognition | Jiamin Xie et.al. | 2506.14973v1 | null |
2025-06-17 | SyncTalk++: High-Fidelity and Efficient Synchronized Talking Heads Synthesis Using Gaussian Splatting | Ziqiao Peng et.al. | 2506.14742v1 | null |
2025-06-17 | Anomalous diffusion for mass transport phenomena II: Subdiffusion in polydimethylsiloxane (PDMS) | Nathaniel G. Hermann et.al. | 2506.14600v1 | null |
2025-06-17 | Compressed Video Super-Resolution based on Hierarchical Encoding | Yuxuan Jiang et.al. | 2506.14381v1 | null |
2025-06-16 | Audio-Visual Driven Compression for Low-Bitrate Talking Head Videos | Riku Takahashi et.al. | 2506.13419v1 | null |
2025-06-16 | Theoretical Summary: Moriond QCD and High-Energy Interactions 2025 | Peter Skands et.al. | 2506.13338v2 | null |
2025-06-16 | Testing the quantum nature of gravity through interferometry | Yubao Liu et.al. | 2506.13085v2 | null |
2025-06-13 | ICME 2025 Grand Challenge on Video Super-Resolution for Video Conferencing | Babak Naderi et.al. | 2506.12269v1 | link |
2025-06-13 | Because we have LLMs, we Can and Should Pursue Agentic Interpretability | Been Kim et.al. | 2506.12152v1 | null |
2025-06-13 | Technical Evaluation of a Disruptive Approach in Homomorphic AI | Eric Filiol et.al. | 2506.11954v1 | null |
2025-06-09 | Seeing Voices: Generating A-Roll Video from Audio with Mirage | Aditi Sundararaman et.al. | 2506.08279v1 | null |
2025-06-07 | High count rate effects in event processing for XRISM/Resolve x-ray microcalorimeter: II. Energy scale and resolution in orbit | Misaki Mizumoto et.al. | 2506.06692v1 | null |
2025-06-06 | Initial stage jet momentum broadening in tBLFQ formalism | Dana Avramescu et.al. | 2506.06206v1 | null |
2025-06-06 | Information Bargaining: Bilateral Commitment in Bayesian Persuasion | Yue Lin et.al. | 2506.05876v2 | link |
2025-06-05 | Can LLMs Talk 'Sex'? Exploring How AI Models Handle Intimate Conversations | Huiqian Lai et.al. | 2506.05514v1 | null |
2025-06-05 | Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games | Niv Eckhaus et.al. | 2506.05309v1 | link |
2025-06-05 | Galactic Science -- Rapporteur Talk of the 8th Heidelberg International Symposium on High Energy Gamma Ray Astronomy | Sandro Mereghetti et.al. | 2506.04729v1 | null |
2025-06-04 | High Accuracy, Less Talk (HALT): Reliable LLMs through Capability-Aligned Finetuning | Tim Franzmeyer et.al. | 2506.04051v1 | null |
2025-06-04 | VChatter: Exploring Generative Conversational Agents for Simulating Exposure Therapy to Reduce Social Anxiety | Han Zhang et.al. | 2506.03520v1 | null |
2025-06-03 | Recent results from MicroBooNE | Holly B. Parkinson et.al. | 2506.03376v1 | null |
2025-06-03 | TDCOSMO 2025: Cosmological constraints from strong lensing time delays | TDCOSMO Collaboration et.al. | 2506.03023v2 | null |
2025-06-03 | NTIRE 2025 XGC Quality Assessment Challenge: Methods and Results | Xiaohong Liu et.al. | 2506.02875v1 | null |
2025-06-02 | Cocktail-Party Audio-Visual Speech Recognition | Thai-Binh Nguyen et.al. | 2506.02178v1 | null |
2025-06-02 | Low-Rank Head Avatar Personalization with Registers | Sai Tanmay Reddy Chakkera et.al. | 2506.01935v1 | null |
2025-06-02 | Greening AI-enabled Systems with Software Engineering: A Research Agenda for Environmentally Sustainable AI Practices | Luís Cruz et.al. | 2506.01774v2 | null |
Face Reenactment
Face Reenactment
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition | Jiaqi Li et.al. | 2506.17201v1 | null |
2025-06-20 | Semi-Supervised Multi-Modal Medical Image Segmentation for Complex Situations | Dongdong Meng et.al. | 2506.17136v1 | null |
2025-06-20 | Empowering Near-Field Communications in Low-Altitude Economy with LLM: Fundamentals, Potentials, Solutions, and Future Directions | Zhuo Xu et.al. | 2506.17067v1 | null |
2025-06-20 | Great Restraining Wall in Multidimentional Collective Variable Space | Zhijun Pan et.al. | 2506.17043v1 | null |
2025-06-20 | Scalable and Reliable Multi-agent Reinforcement Learning for Traffic Assignment | Leizhen Wang et.al. | 2506.17029v1 | null |
2025-06-20 | RS-Coded Adaptive Dynamic Network for Reliable Long-Term Information Transmission in Disturbed Multimode Fiber | Yang Hu et.al. | 2506.16859v1 | null |
2025-06-20 | Controllable and Expressive One-Shot Video Head Swapping | Chaonan Ji et.al. | 2506.16852v1 | null |
2025-06-20 | Hybrid-Sep: Language-queried audio source separation via pre-trained Model Fusion and Adversarial Diffusion Training | Jianyuan Feng et.al. | 2506.16833v1 | null |
2025-06-20 | Integrating Traditional Technical Analysis with AI: A Multi-Agent LLM-Based Approach to Stock Market Forecasting | Michał Wawer et.al. | 2506.16813v1 | null |
2025-06-20 | Temperature calibration of surface emissivities with an improved thermal image enhancement network | Ning Chu et.al. | 2506.16803v1 | null |
2025-06-20 | Quantum prime factorization algorithms using binary carry propagation | Arim Ryou et.al. | 2506.16799v1 | null |
2025-06-20 | Quadratic estimates for the $H^\infty$-functional calculus of bisectorial Clifford operators | Fabrizio Colombo et.al. | 2506.16783v1 | null |
2025-06-20 | Reinforcement learning for hybrid charging stations planning and operation considering fixed and mobile chargers | Yanchen Zhu et.al. | 2506.16764v1 | null |
2025-06-19 | Data marketplaces can increase the willingness to share social media data at low prices | Meysam Alizadeh et.al. | 2506.16618v1 | null |
2025-06-19 | SparseDPD: A Sparse Neural Network-based Digital Predistortion FPGA Accelerator for RF Power Amplifier Linearization | Manno Versluis et.al. | 2506.16591v1 | null |
2025-06-19 | AI-Driven Tools in Modern Software Quality Assurance: An Assessment of Benefits, Challenges, and Future Directions | Ihor Pysmennyi et.al. | 2506.16586v1 | null |
2025-06-19 | SafeTriage: Facial Video De-identification for Privacy-Preserving Stroke Triage | Tongan Cai et.al. | 2506.16578v1 | null |
2025-06-19 | Crystal Nucleation Kinetics and Mechanism: Influence of Interaction Potential | Porhouy Minh et.al. | 2506.16541v1 | null |
2025-06-19 | Spotting tell-tale visual artifacts in face swapping videos: strengths and pitfalls of CNN detectors | Riccardo Ziglio et.al. | 2506.16497v1 | null |
2025-06-19 | Grounding Language Models with Semantic Digital Twins for Robotic Planning | Mehreen Naeem et.al. | 2506.16493v1 | null |
2025-06-19 | Do We Talk to Robots Like Therapists, and Do They Respond Accordingly? Language Alignment in AI Emotional Support | Sophie Chiang et.al. | 2506.16473v1 | null |
2025-06-19 | Scientific Applications Leveraging Randomized Linear Algebra | Vivak Patel et.al. | 2506.16457v1 | null |
2025-06-19 | REIS: A High-Performance and Energy-Efficient Retrieval System with In-Storage Processing | Kangqi Chen et.al. | 2506.16444v1 | null |
2025-06-19 | OJBench: A Competition Level Code Benchmark For Large Language Models | Zhexu Wang et.al. | 2506.16395v1 | null |
2025-06-19 | RiOT: Efficient Prompt Refinement with Residual Optimization Tree | Chenyi Zhou et.al. | 2506.16389v1 | null |
2025-06-19 | HausaNLP at SemEval-2025 Task 11: Advancing Hausa Text-based Emotion Detection | Sani Abdullahi Sani et.al. | 2506.16388v1 | null |
2025-06-19 | AGC-Drive: A Large-Scale Dataset for Real-World Aerial-Ground Collaboration in Driving Scenarios | Yunhao Hou et.al. | 2506.16371v1 | null |
2025-06-19 | Can structural correspondences ground real world representational content in Large Language Models? | Iwan Williams et.al. | 2506.16370v1 | null |
2025-06-19 | The many faces of rotating quantum turbulence | Julian Amette Estrada et.al. | 2506.16358v1 | null |
2025-06-19 | Bayesian Optimization over Bounded Domains with the Beta Product Kernel | Huy Hoang Nguyen et.al. | 2506.16316v1 | null |
Federated Learning
Asynchronous
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Client Selection Strategies for Federated Semantic Communications in Heterogeneous IoT Networks | Samer Lahoud et.al. | 2506.17063v1 | null |
2025-06-20 | FedFitTech: A Baseline in Federated Learning for Fitness Tracking | Zeyneddin Oz et.al. | 2506.16840v1 | null |
2025-06-20 | Incentivizing High-quality Participation From Federated Learning Agents | Jinlong Pang et.al. | 2506.16731v1 | null |
2025-06-20 | TriCon-SF: A Triple-Shuffle and Contribution-Aware Serial Federated Learning Framework for Heterogeneous Healthcare Data | Yuping Yan et.al. | 2506.16723v1 | null |
2025-06-19 | FLAME: Towards Federated Fine-Tuning Large Language Models Through Adaptive SMoE | Khiem Le et.al. | 2506.16600v1 | null |
2025-06-19 | Teaching Complex Systems based on Microservices | Renato Cordeiro Ferreira et.al. | 2506.16492v1 | null |
2025-06-19 | SecureFed: A Two-Phase Framework for Detecting Malicious Clients in Federated Learning | Likhitha Annapurna Kavuri et.al. | 2506.16458v1 | null |
2025-06-19 | FOCoOp: Enhancing Out-of-Distribution Robustness in Federated Prompt Learning for Vision-Language Models | Xinting Liao et.al. | 2506.16218v1 | null |
2025-06-19 | Test beam measurements and computer simulations of the ATLAS ITk R2 silicon strip detector | Jan-Hendrik Arling et.al. | 2506.16053v1 | null |
2025-06-19 | Leveraging Optimal Transport for Distributed Two-Sample Testing: An Integrated Transportation Distance-based Framework | Zhengqi Lin et.al. | 2506.16047v1 | null |
2025-06-18 | PNCS:Power-Norm Cosine Similarity for Diverse Client Selection in Federated Learning | Liangyan Li et.al. | 2506.15923v1 | null |
2025-06-18 | Heterogeneous Federated Reinforcement Learning Using Wasserstein Barycenters | Luiz Pereira et.al. | 2506.15825v1 | null |
2025-06-18 | Federated Learning for MRI-based BrainAGE: a multicenter study on post-stroke functional outcome prediction | Vincent Roca et.al. | 2506.15626v2 | null |
2025-06-18 | FedWSIDD: Federated Whole Slide Image Classification via Dataset Distillation | Haolong Jin et.al. | 2506.15365v1 | link |
2025-06-18 | Centroid Approximation for Byzantine-Tolerant Federated Learning | Mélanie Cambus et.al. | 2506.15264v1 | null |
2025-06-18 | Accessible Gesture-Driven Augmented Reality Interaction System | Yikan Wang et.al. | 2506.15189v1 | null |
2025-06-17 | FedOne: Query-Efficient Federated Learning for Black-box Discrete Prompt Learning | Ganyu Wang et.al. | 2506.14929v1 | link |
2025-06-17 | How many federal employees are not satisfied? Using response times to estimate population proportions under the survey variable cause model | Jonathan Auerbach et.al. | 2506.14915v1 | null |
2025-06-17 | Event-Driven Online Vertical Federated Learning | Ganyu Wang et.al. | 2506.14911v1 | null |
2025-06-17 | Now More Than Ever, Foundational AI Research and Infrastructure Depends on the Federal Government | Michela Taufer et.al. | 2506.14679v1 | null |
2025-06-17 | Knowledge Adaptation as Posterior Correction | Mohammad Emtiyaz Khan et.al. | 2506.14262v1 | null |
2025-06-17 | Convergence-Privacy-Fairness Trade-Off in Personalized Federated Learning | Xiyu Zhao et.al. | 2506.14251v1 | null |
2025-06-17 | A Comprehensive Survey on Underwater Acoustic Target Positioning and Tracking: Progress, Challenges, and Perspectives | Zhong Yang et.al. | 2506.14165v1 | null |
2025-06-16 | PeakWeather: MeteoSwiss Weather Station Measurements for Spatiotemporal Deep Learning | Daniele Zambon et.al. | 2506.13652v1 | link |
2025-06-16 | EBS-CFL: Efficient and Byzantine-robust Secure Clustered Federated Learning | Zhiqiang Li et.al. | 2506.13612v1 | link |
2025-06-16 | Perfect Privacy for Discriminator-Based Byzantine-Resilient Federated Learning | Yue Xia et.al. | 2506.13561v1 | null |
2025-06-16 | A Two-stage Optimization Method for Wide-range Single-electron Quantum Magnetic Sensing | Shiqian Guo et.al. | 2506.13469v1 | null |
2025-06-16 | Federated ADMM from Bayesian Duality | Thomas Möllenhoff et.al. | 2506.13150v1 | link |
2025-06-15 | Does the Expansion of Medicaid Lead to Income Adjustment -- Evidence from SIPP | Mingjian Li et.al. | 2506.12976v1 | null |
2025-06-15 | Privacy-Preserving Federated Learning against Malicious Clients Based on Verifiable Functional Encryption | Nina Cai et.al. | 2506.12846v1 | null |
Benchmark
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Client Selection Strategies for Federated Semantic Communications in Heterogeneous IoT Networks | Samer Lahoud et.al. | 2506.17063v1 | null |
2025-06-20 | FedFitTech: A Baseline in Federated Learning for Fitness Tracking | Zeyneddin Oz et.al. | 2506.16840v1 | null |
2025-06-20 | Incentivizing High-quality Participation From Federated Learning Agents | Jinlong Pang et.al. | 2506.16731v1 | null |
2025-06-20 | TriCon-SF: A Triple-Shuffle and Contribution-Aware Serial Federated Learning Framework for Heterogeneous Healthcare Data | Yuping Yan et.al. | 2506.16723v1 | null |
2025-06-19 | FLAME: Towards Federated Fine-Tuning Large Language Models Through Adaptive SMoE | Khiem Le et.al. | 2506.16600v1 | null |
2025-06-19 | Teaching Complex Systems based on Microservices | Renato Cordeiro Ferreira et.al. | 2506.16492v1 | null |
2025-06-19 | SecureFed: A Two-Phase Framework for Detecting Malicious Clients in Federated Learning | Likhitha Annapurna Kavuri et.al. | 2506.16458v1 | null |
2025-06-19 | FOCoOp: Enhancing Out-of-Distribution Robustness in Federated Prompt Learning for Vision-Language Models | Xinting Liao et.al. | 2506.16218v1 | null |
2025-06-19 | Test beam measurements and computer simulations of the ATLAS ITk R2 silicon strip detector | Jan-Hendrik Arling et.al. | 2506.16053v1 | null |
2025-06-19 | Leveraging Optimal Transport for Distributed Two-Sample Testing: An Integrated Transportation Distance-based Framework | Zhengqi Lin et.al. | 2506.16047v1 | null |
2025-06-18 | PNCS:Power-Norm Cosine Similarity for Diverse Client Selection in Federated Learning | Liangyan Li et.al. | 2506.15923v1 | null |
2025-06-18 | Heterogeneous Federated Reinforcement Learning Using Wasserstein Barycenters | Luiz Pereira et.al. | 2506.15825v1 | null |
2025-06-18 | Federated Learning for MRI-based BrainAGE: a multicenter study on post-stroke functional outcome prediction | Vincent Roca et.al. | 2506.15626v2 | null |
2025-06-18 | FedWSIDD: Federated Whole Slide Image Classification via Dataset Distillation | Haolong Jin et.al. | 2506.15365v1 | link |
2025-06-18 | Centroid Approximation for Byzantine-Tolerant Federated Learning | Mélanie Cambus et.al. | 2506.15264v1 | null |
2025-06-18 | Accessible Gesture-Driven Augmented Reality Interaction System | Yikan Wang et.al. | 2506.15189v1 | null |
2025-06-17 | FedOne: Query-Efficient Federated Learning for Black-box Discrete Prompt Learning | Ganyu Wang et.al. | 2506.14929v1 | link |
2025-06-17 | How many federal employees are not satisfied? Using response times to estimate population proportions under the survey variable cause model | Jonathan Auerbach et.al. | 2506.14915v1 | null |
2025-06-17 | Event-Driven Online Vertical Federated Learning | Ganyu Wang et.al. | 2506.14911v1 | null |
2025-06-17 | Now More Than Ever, Foundational AI Research and Infrastructure Depends on the Federal Government | Michela Taufer et.al. | 2506.14679v1 | null |
2025-06-17 | Knowledge Adaptation as Posterior Correction | Mohammad Emtiyaz Khan et.al. | 2506.14262v1 | null |
2025-06-17 | Convergence-Privacy-Fairness Trade-Off in Personalized Federated Learning | Xiyu Zhao et.al. | 2506.14251v1 | null |
2025-06-17 | A Comprehensive Survey on Underwater Acoustic Target Positioning and Tracking: Progress, Challenges, and Perspectives | Zhong Yang et.al. | 2506.14165v1 | null |
2025-06-16 | PeakWeather: MeteoSwiss Weather Station Measurements for Spatiotemporal Deep Learning | Daniele Zambon et.al. | 2506.13652v1 | link |
2025-06-16 | EBS-CFL: Efficient and Byzantine-robust Secure Clustered Federated Learning | Zhiqiang Li et.al. | 2506.13612v1 | link |
2025-06-16 | Perfect Privacy for Discriminator-Based Byzantine-Resilient Federated Learning | Yue Xia et.al. | 2506.13561v1 | null |
2025-06-16 | A Two-stage Optimization Method for Wide-range Single-electron Quantum Magnetic Sensing | Shiqian Guo et.al. | 2506.13469v1 | null |
2025-06-16 | Federated ADMM from Bayesian Duality | Thomas Möllenhoff et.al. | 2506.13150v1 | link |
2025-06-15 | Does the Expansion of Medicaid Lead to Income Adjustment -- Evidence from SIPP | Mingjian Li et.al. | 2506.12976v1 | null |
2025-06-15 | Privacy-Preserving Federated Learning against Malicious Clients Based on Verifiable Functional Encryption | Nina Cai et.al. | 2506.12846v1 | null |
Optimization
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Client Selection Strategies for Federated Semantic Communications in Heterogeneous IoT Networks | Samer Lahoud et.al. | 2506.17063v1 | null |
2025-06-20 | FedFitTech: A Baseline in Federated Learning for Fitness Tracking | Zeyneddin Oz et.al. | 2506.16840v1 | null |
2025-06-20 | Incentivizing High-quality Participation From Federated Learning Agents | Jinlong Pang et.al. | 2506.16731v1 | null |
2025-06-20 | TriCon-SF: A Triple-Shuffle and Contribution-Aware Serial Federated Learning Framework for Heterogeneous Healthcare Data | Yuping Yan et.al. | 2506.16723v1 | null |
2025-06-19 | FLAME: Towards Federated Fine-Tuning Large Language Models Through Adaptive SMoE | Khiem Le et.al. | 2506.16600v1 | null |
2025-06-19 | Teaching Complex Systems based on Microservices | Renato Cordeiro Ferreira et.al. | 2506.16492v1 | null |
2025-06-19 | SecureFed: A Two-Phase Framework for Detecting Malicious Clients in Federated Learning | Likhitha Annapurna Kavuri et.al. | 2506.16458v1 | null |
2025-06-19 | FOCoOp: Enhancing Out-of-Distribution Robustness in Federated Prompt Learning for Vision-Language Models | Xinting Liao et.al. | 2506.16218v1 | null |
2025-06-19 | Test beam measurements and computer simulations of the ATLAS ITk R2 silicon strip detector | Jan-Hendrik Arling et.al. | 2506.16053v1 | null |
2025-06-19 | Leveraging Optimal Transport for Distributed Two-Sample Testing: An Integrated Transportation Distance-based Framework | Zhengqi Lin et.al. | 2506.16047v1 | null |
2025-06-18 | PNCS:Power-Norm Cosine Similarity for Diverse Client Selection in Federated Learning | Liangyan Li et.al. | 2506.15923v1 | null |
2025-06-18 | Heterogeneous Federated Reinforcement Learning Using Wasserstein Barycenters | Luiz Pereira et.al. | 2506.15825v1 | null |
2025-06-18 | Federated Learning for MRI-based BrainAGE: a multicenter study on post-stroke functional outcome prediction | Vincent Roca et.al. | 2506.15626v2 | null |
2025-06-18 | FedWSIDD: Federated Whole Slide Image Classification via Dataset Distillation | Haolong Jin et.al. | 2506.15365v1 | link |
2025-06-18 | Centroid Approximation for Byzantine-Tolerant Federated Learning | Mélanie Cambus et.al. | 2506.15264v1 | null |
2025-06-18 | Accessible Gesture-Driven Augmented Reality Interaction System | Yikan Wang et.al. | 2506.15189v1 | null |
2025-06-17 | FedOne: Query-Efficient Federated Learning for Black-box Discrete Prompt Learning | Ganyu Wang et.al. | 2506.14929v1 | link |
2025-06-17 | How many federal employees are not satisfied? Using response times to estimate population proportions under the survey variable cause model | Jonathan Auerbach et.al. | 2506.14915v1 | null |
2025-06-17 | Event-Driven Online Vertical Federated Learning | Ganyu Wang et.al. | 2506.14911v1 | null |
2025-06-17 | Now More Than Ever, Foundational AI Research and Infrastructure Depends on the Federal Government | Michela Taufer et.al. | 2506.14679v1 | null |
2025-06-17 | Knowledge Adaptation as Posterior Correction | Mohammad Emtiyaz Khan et.al. | 2506.14262v1 | null |
2025-06-17 | Convergence-Privacy-Fairness Trade-Off in Personalized Federated Learning | Xiyu Zhao et.al. | 2506.14251v1 | null |
2025-06-17 | A Comprehensive Survey on Underwater Acoustic Target Positioning and Tracking: Progress, Challenges, and Perspectives | Zhong Yang et.al. | 2506.14165v1 | null |
2025-06-16 | PeakWeather: MeteoSwiss Weather Station Measurements for Spatiotemporal Deep Learning | Daniele Zambon et.al. | 2506.13652v1 | link |
2025-06-16 | EBS-CFL: Efficient and Byzantine-robust Secure Clustered Federated Learning | Zhiqiang Li et.al. | 2506.13612v1 | link |
2025-06-16 | Perfect Privacy for Discriminator-Based Byzantine-Resilient Federated Learning | Yue Xia et.al. | 2506.13561v1 | null |
2025-06-16 | A Two-stage Optimization Method for Wide-range Single-electron Quantum Magnetic Sensing | Shiqian Guo et.al. | 2506.13469v1 | null |
2025-06-16 | Federated ADMM from Bayesian Duality | Thomas Möllenhoff et.al. | 2506.13150v1 | link |
2025-06-15 | Does the Expansion of Medicaid Lead to Income Adjustment -- Evidence from SIPP | Mingjian Li et.al. | 2506.12976v1 | null |
2025-06-15 | Privacy-Preserving Federated Learning against Malicious Clients Based on Verifiable Functional Encryption | Nina Cai et.al. | 2506.12846v1 | null |
Personalized
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | A Common Pool of Privacy Problems: Legal and Technical Lessons from a Large-Scale Web-Scraped Machine Learning Dataset | Rachel Hong et.al. | 2506.17185v1 | null |
2025-06-20 | Walking Fingerprinting Using Wrist Accelerometry During Activities of Daily Living in NHANES | Lily Koffman et.al. | 2506.17160v1 | null |
2025-06-20 | The fundamental problem of risk prediction for individuals: health AI, uncertainty, and personalized medicine | Lasai Barreñada et.al. | 2506.17141v1 | null |
2025-06-20 | Client Selection Strategies for Federated Semantic Communications in Heterogeneous IoT Networks | Samer Lahoud et.al. | 2506.17063v1 | null |
2025-06-20 | PersonalAI: Towards digital twins in the graph form | Mikhail Menschikov et.al. | 2506.17001v1 | null |
2025-06-20 | Multi-Objective Recommendation in the Era of Generative AI: A Survey of Recent Progress and Future Prospects | Zihan Hong et.al. | 2506.16893v1 | null |
2025-06-20 | Tracker Installations Are Not Created Equal: Understanding Tracker Configuration of Form Data Collection | Julia B. Kieserman et.al. | 2506.16891v1 | null |
2025-06-20 | Exploring the Usage of Generative AI for Group Project-Based Offline Art Courses in Elementary Schools | Zhiqing Wang et.al. | 2506.16874v1 | null |
2025-06-20 | FedFitTech: A Baseline in Federated Learning for Fitness Tracking | Zeyneddin Oz et.al. | 2506.16840v1 | null |
2025-06-20 | Incentivizing High-quality Participation From Federated Learning Agents | Jinlong Pang et.al. | 2506.16731v1 | null |
2025-06-20 | TriCon-SF: A Triple-Shuffle and Contribution-Aware Serial Federated Learning Framework for Heterogeneous Healthcare Data | Yuping Yan et.al. | 2506.16723v1 | null |
2025-06-19 | FLAME: Towards Federated Fine-Tuning Large Language Models Through Adaptive SMoE | Khiem Le et.al. | 2506.16600v1 | null |
2025-06-19 | Mr. Snuffleupagus at SemEval-2025 Task 4: Unlearning Factual Knowledge from LLMs Using Adaptive RMU | Arjun Dosajh et.al. | 2506.16548v1 | null |
2025-06-19 | Manifold Learning for Personalized and Label-Free Detection of Cardiac Arrhythmias | Amir Reza Vazifeh et.al. | 2506.16494v1 | null |
2025-06-19 | Teaching Complex Systems based on Microservices | Renato Cordeiro Ferreira et.al. | 2506.16492v1 | null |
2025-06-19 | Two-Person Cooperative Games with delta-Rationality | Fang-Fang Tang et.al. | 2506.16465v1 | null |
2025-06-19 | SecureFed: A Two-Phase Framework for Detecting Malicious Clients in Federated Learning | Likhitha Annapurna Kavuri et.al. | 2506.16458v1 | null |
2025-06-19 | Leave No One Undermined: Policy Targeting with Regret Aversion | Toru Kitagawa et.al. | 2506.16430v1 | null |
2025-06-19 | Unpacking Generative AI in Education: Computational Modeling of Teacher and Student Perspectives in Social Media Discourse | Paulina DeVito et.al. | 2506.16412v1 | null |
2025-06-19 | Probabilistic Collision Risk Estimation for Pedestrian Navigation | Amine Tourki et.al. | 2506.16219v1 | null |
2025-06-19 | FOCoOp: Enhancing Out-of-Distribution Robustness in Federated Prompt Learning for Vision-Language Models | Xinting Liao et.al. | 2506.16218v1 | null |
2025-06-19 | Align the GAP: Prior-based Unified Multi-Task Remote Physiological Measurement Framework For Domain Generalization and Personalization | Jiyao Wang et.al. | 2506.16160v1 | null |
2025-06-19 | Enhanced Dermatology Image Quality Assessment via Cross-Domain Training | Ignacio Hernández Montilla et.al. | 2506.16116v1 | null |
2025-06-19 | A Brain-to-Population Graph Learning Framework for Diagnosing Brain Disorders | Qianqian Liao et.al. | 2506.16096v1 | null |
2025-06-19 | Test beam measurements and computer simulations of the ATLAS ITk R2 silicon strip detector | Jan-Hendrik Arling et.al. | 2506.16053v1 | null |
2025-06-19 | Leveraging Optimal Transport for Distributed Two-Sample Testing: An Integrated Transportation Distance-based Framework | Zhengqi Lin et.al. | 2506.16047v1 | null |
2025-06-19 | SEP-GCN: Leveraging Similar Edge Pairs with Temporal and Spatial Contexts for Location-Based Recommender Systems | Tan Loc Nguyen et.al. | 2506.16003v1 | null |
2025-06-19 | On the optimal regret of collaborative personalized linear bandits | Bruce Huang et.al. | 2506.15943v1 | null |
2025-06-19 | Exploring Big Five Personality and AI Capability Effects in LLM-Simulated Negotiation Dialogues | Myke C. Cohen et.al. | 2506.15928v1 | null |
2025-06-18 | PNCS:Power-Norm Cosine Similarity for Diverse Client Selection in Federated Learning | Liangyan Li et.al. | 2506.15923v1 | null |
Heterogeneous
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Client Selection Strategies for Federated Semantic Communications in Heterogeneous IoT Networks | Samer Lahoud et.al. | 2506.17063v1 | null |
2025-06-20 | FedFitTech: A Baseline in Federated Learning for Fitness Tracking | Zeyneddin Oz et.al. | 2506.16840v1 | null |
2025-06-20 | Incentivizing High-quality Participation From Federated Learning Agents | Jinlong Pang et.al. | 2506.16731v1 | null |
2025-06-20 | TriCon-SF: A Triple-Shuffle and Contribution-Aware Serial Federated Learning Framework for Heterogeneous Healthcare Data | Yuping Yan et.al. | 2506.16723v1 | null |
2025-06-19 | FLAME: Towards Federated Fine-Tuning Large Language Models Through Adaptive SMoE | Khiem Le et.al. | 2506.16600v1 | null |
2025-06-19 | Teaching Complex Systems based on Microservices | Renato Cordeiro Ferreira et.al. | 2506.16492v1 | null |
2025-06-19 | SecureFed: A Two-Phase Framework for Detecting Malicious Clients in Federated Learning | Likhitha Annapurna Kavuri et.al. | 2506.16458v1 | null |
2025-06-19 | FOCoOp: Enhancing Out-of-Distribution Robustness in Federated Prompt Learning for Vision-Language Models | Xinting Liao et.al. | 2506.16218v1 | null |
2025-06-19 | Test beam measurements and computer simulations of the ATLAS ITk R2 silicon strip detector | Jan-Hendrik Arling et.al. | 2506.16053v1 | null |
2025-06-19 | Leveraging Optimal Transport for Distributed Two-Sample Testing: An Integrated Transportation Distance-based Framework | Zhengqi Lin et.al. | 2506.16047v1 | null |
2025-06-18 | PNCS:Power-Norm Cosine Similarity for Diverse Client Selection in Federated Learning | Liangyan Li et.al. | 2506.15923v1 | null |
2025-06-18 | Heterogeneous Federated Reinforcement Learning Using Wasserstein Barycenters | Luiz Pereira et.al. | 2506.15825v1 | null |
2025-06-18 | Federated Learning for MRI-based BrainAGE: a multicenter study on post-stroke functional outcome prediction | Vincent Roca et.al. | 2506.15626v2 | null |
2025-06-18 | FedWSIDD: Federated Whole Slide Image Classification via Dataset Distillation | Haolong Jin et.al. | 2506.15365v1 | link |
2025-06-18 | Centroid Approximation for Byzantine-Tolerant Federated Learning | Mélanie Cambus et.al. | 2506.15264v1 | null |
2025-06-18 | Accessible Gesture-Driven Augmented Reality Interaction System | Yikan Wang et.al. | 2506.15189v1 | null |
2025-06-17 | FedOne: Query-Efficient Federated Learning for Black-box Discrete Prompt Learning | Ganyu Wang et.al. | 2506.14929v1 | link |
2025-06-17 | How many federal employees are not satisfied? Using response times to estimate population proportions under the survey variable cause model | Jonathan Auerbach et.al. | 2506.14915v1 | null |
2025-06-17 | Event-Driven Online Vertical Federated Learning | Ganyu Wang et.al. | 2506.14911v1 | null |
2025-06-17 | Now More Than Ever, Foundational AI Research and Infrastructure Depends on the Federal Government | Michela Taufer et.al. | 2506.14679v1 | null |
2025-06-17 | Knowledge Adaptation as Posterior Correction | Mohammad Emtiyaz Khan et.al. | 2506.14262v1 | null |
2025-06-17 | Convergence-Privacy-Fairness Trade-Off in Personalized Federated Learning | Xiyu Zhao et.al. | 2506.14251v1 | null |
2025-06-17 | A Comprehensive Survey on Underwater Acoustic Target Positioning and Tracking: Progress, Challenges, and Perspectives | Zhong Yang et.al. | 2506.14165v1 | null |
2025-06-16 | PeakWeather: MeteoSwiss Weather Station Measurements for Spatiotemporal Deep Learning | Daniele Zambon et.al. | 2506.13652v1 | link |
2025-06-16 | EBS-CFL: Efficient and Byzantine-robust Secure Clustered Federated Learning | Zhiqiang Li et.al. | 2506.13612v1 | link |
2025-06-16 | Perfect Privacy for Discriminator-Based Byzantine-Resilient Federated Learning | Yue Xia et.al. | 2506.13561v1 | null |
2025-06-16 | A Two-stage Optimization Method for Wide-range Single-electron Quantum Magnetic Sensing | Shiqian Guo et.al. | 2506.13469v1 | null |
2025-06-16 | Federated ADMM from Bayesian Duality | Thomas Möllenhoff et.al. | 2506.13150v1 | link |
2025-06-15 | Does the Expansion of Medicaid Lead to Income Adjustment -- Evidence from SIPP | Mingjian Li et.al. | 2506.12976v1 | null |
2025-06-15 | Privacy-Preserving Federated Learning against Malicious Clients Based on Verifiable Functional Encryption | Nina Cai et.al. | 2506.12846v1 | null |
Framework
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Client Selection Strategies for Federated Semantic Communications in Heterogeneous IoT Networks | Samer Lahoud et.al. | 2506.17063v1 | null |
2025-06-20 | FedFitTech: A Baseline in Federated Learning for Fitness Tracking | Zeyneddin Oz et.al. | 2506.16840v1 | null |
2025-06-20 | Incentivizing High-quality Participation From Federated Learning Agents | Jinlong Pang et.al. | 2506.16731v1 | null |
2025-06-20 | TriCon-SF: A Triple-Shuffle and Contribution-Aware Serial Federated Learning Framework for Heterogeneous Healthcare Data | Yuping Yan et.al. | 2506.16723v1 | null |
2025-06-19 | FLAME: Towards Federated Fine-Tuning Large Language Models Through Adaptive SMoE | Khiem Le et.al. | 2506.16600v1 | null |
2025-06-19 | Teaching Complex Systems based on Microservices | Renato Cordeiro Ferreira et.al. | 2506.16492v1 | null |
2025-06-19 | SecureFed: A Two-Phase Framework for Detecting Malicious Clients in Federated Learning | Likhitha Annapurna Kavuri et.al. | 2506.16458v1 | null |
2025-06-19 | FOCoOp: Enhancing Out-of-Distribution Robustness in Federated Prompt Learning for Vision-Language Models | Xinting Liao et.al. | 2506.16218v1 | null |
2025-06-19 | Test beam measurements and computer simulations of the ATLAS ITk R2 silicon strip detector | Jan-Hendrik Arling et.al. | 2506.16053v1 | null |
2025-06-19 | Leveraging Optimal Transport for Distributed Two-Sample Testing: An Integrated Transportation Distance-based Framework | Zhengqi Lin et.al. | 2506.16047v1 | null |
2025-06-18 | PNCS:Power-Norm Cosine Similarity for Diverse Client Selection in Federated Learning | Liangyan Li et.al. | 2506.15923v1 | null |
2025-06-18 | Heterogeneous Federated Reinforcement Learning Using Wasserstein Barycenters | Luiz Pereira et.al. | 2506.15825v1 | null |
2025-06-18 | Federated Learning for MRI-based BrainAGE: a multicenter study on post-stroke functional outcome prediction | Vincent Roca et.al. | 2506.15626v2 | null |
2025-06-18 | FedWSIDD: Federated Whole Slide Image Classification via Dataset Distillation | Haolong Jin et.al. | 2506.15365v1 | link |
2025-06-18 | Centroid Approximation for Byzantine-Tolerant Federated Learning | Mélanie Cambus et.al. | 2506.15264v1 | null |
2025-06-18 | Accessible Gesture-Driven Augmented Reality Interaction System | Yikan Wang et.al. | 2506.15189v1 | null |
2025-06-17 | FedOne: Query-Efficient Federated Learning for Black-box Discrete Prompt Learning | Ganyu Wang et.al. | 2506.14929v1 | link |
2025-06-17 | How many federal employees are not satisfied? Using response times to estimate population proportions under the survey variable cause model | Jonathan Auerbach et.al. | 2506.14915v1 | null |
2025-06-17 | Event-Driven Online Vertical Federated Learning | Ganyu Wang et.al. | 2506.14911v1 | null |
2025-06-17 | Now More Than Ever, Foundational AI Research and Infrastructure Depends on the Federal Government | Michela Taufer et.al. | 2506.14679v1 | null |
2025-06-17 | Knowledge Adaptation as Posterior Correction | Mohammad Emtiyaz Khan et.al. | 2506.14262v1 | null |
2025-06-17 | Convergence-Privacy-Fairness Trade-Off in Personalized Federated Learning | Xiyu Zhao et.al. | 2506.14251v1 | null |
2025-06-17 | A Comprehensive Survey on Underwater Acoustic Target Positioning and Tracking: Progress, Challenges, and Perspectives | Zhong Yang et.al. | 2506.14165v1 | null |
2025-06-16 | PeakWeather: MeteoSwiss Weather Station Measurements for Spatiotemporal Deep Learning | Daniele Zambon et.al. | 2506.13652v1 | link |
2025-06-16 | EBS-CFL: Efficient and Byzantine-robust Secure Clustered Federated Learning | Zhiqiang Li et.al. | 2506.13612v1 | link |
2025-06-16 | Perfect Privacy for Discriminator-Based Byzantine-Resilient Federated Learning | Yue Xia et.al. | 2506.13561v1 | null |
2025-06-16 | A Two-stage Optimization Method for Wide-range Single-electron Quantum Magnetic Sensing | Shiqian Guo et.al. | 2506.13469v1 | null |
2025-06-16 | Federated ADMM from Bayesian Duality | Thomas Möllenhoff et.al. | 2506.13150v1 | link |
2025-06-15 | Does the Expansion of Medicaid Lead to Income Adjustment -- Evidence from SIPP | Mingjian Li et.al. | 2506.12976v1 | null |
2025-06-15 | Privacy-Preserving Federated Learning against Malicious Clients Based on Verifiable Functional Encryption | Nina Cai et.al. | 2506.12846v1 | null |
Federated Learning
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | No Free Lunch: Rethinking Internal Feedback for LLM Reasoning | Yanzhi Zhang et.al. | 2506.17219v1 | null |
2025-06-20 | Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens | Zeyuan Yang et.al. | 2506.17218v1 | null |
2025-06-20 | Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting | Tianjiao Yu et.al. | 2506.17212v1 | null |
2025-06-20 | BREAD: Branched Rollouts from Expert Anchors Bridge SFT & RL for Reasoning | Xuechen Zhang et.al. | 2506.17211v1 | null |
2025-06-20 | DreamCube: 3D Panorama Generation via Multi-plane Synchronization | Yukun Huang et.al. | 2506.17206v1 | null |
2025-06-20 | Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning | Guozheng Ma et.al. | 2506.17204v1 | null |
2025-06-20 | UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation | Teng Li et.al. | 2506.17202v1 | null |
2025-06-20 | Tighter Error Bounds for the qDRIFT Algorithm | I. J. David et.al. | 2506.17199v1 | null |
2025-06-20 | Dex1B: Learning with 1B Demonstrations for Dexterous Manipulation | Jianglong Ye et.al. | 2506.17198v1 | null |
2025-06-20 | Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres | Samuel Howard et.al. | 2506.17197v1 | null |
2025-06-20 | Detecting LLM-Generated Short Answers and Effects on Learner Performance | Shambhavi Bhushan et.al. | 2506.17196v1 | null |
2025-06-20 | Facial Landmark Visualization and Emotion Recognition Through Neural Networks | Israel Juárez-Jiménez et.al. | 2506.17191v1 | null |
2025-06-20 | Optimal Implicit Bias in Linear Regression | Kanumuri Nithin Varma et.al. | 2506.17187v1 | null |
2025-06-20 | YASMOT: Yet another stereo image multi-object tracker | Ketil Malde et.al. | 2506.17186v1 | null |
2025-06-20 | A Common Pool of Privacy Problems: Legal and Technical Lessons from a Large-Scale Web-Scraped Machine Learning Dataset | Rachel Hong et.al. | 2506.17185v1 | null |
2025-06-20 | Variational Learning of Disentangled Representations | Yuli Slavutsky et.al. | 2506.17182v1 | null |
2025-06-20 | Deep generative models as the probability transformation functions | Vitalii Bondar et.al. | 2506.17171v1 | null |
2025-06-20 | Continual Learning with Columnar Spiking Neural Networks | Denis Larionov et.al. | 2506.17169v1 | null |
2025-06-20 | Analyzing PDFs like Binaries: Adversarially Robust PDF Malware Analysis via Intermediate Representation and Language Model | Side Liu et.al. | 2506.17162v1 | null |
2025-06-20 | Co-Seg++: Mutual Prompt-Guided Collaborative Learning for Versatile Medical Segmentation | Qing Xu et.al. | 2506.17159v1 | null |
2025-06-20 | Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity | Samin Yeasar Arnob et.al. | 2506.17155v1 | null |
2025-06-20 | Profile monitoring of random functions with Gaussian process basis expansions | Takayuki Iguchi et.al. | 2506.17153v1 | null |
2025-06-20 | Do We Need Large VLMs for Spotting Soccer Actions? | Ritabrata Chakraborty et.al. | 2506.17144v1 | null |
2025-06-20 | MeDi: Metadata-Guided Diffusion Models for Mitigating Biases in Tumor Classification | David Jacob Drexlin et.al. | 2506.17140v1 | null |
2025-06-20 | Consistent Sampling and Simulation: Molecular Dynamics with Energy-Based Diffusion Models | Michael Plainer et.al. | 2506.17139v1 | null |
2025-06-20 | Semi-Supervised Multi-Modal Medical Image Segmentation for Complex Situations | Dongdong Meng et.al. | 2506.17136v1 | null |
2025-06-20 | Robust Training with Data Augmentation for Medical Imaging Classification | Josué Martínez-Martínez et.al. | 2506.17133v1 | null |
2025-06-20 | Chain-of-Trust: A Progressive Trust Evaluation Framework Enabled by Generative AI | Botao Zhu et.al. | 2506.17130v1 | null |
2025-06-20 | Rapid and Continuous Trust Evaluation for Effective Task Collaboration Through Siamese Model | Botao Zhu et.al. | 2506.17128v1 | null |
2025-06-20 | Large Language Model Unlearning for Source Code | Xue Jiang et.al. | 2506.17125v1 | null |
Dataset
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Client Selection Strategies for Federated Semantic Communications in Heterogeneous IoT Networks | Samer Lahoud et.al. | 2506.17063v1 | null |
2025-06-20 | FedFitTech: A Baseline in Federated Learning for Fitness Tracking | Zeyneddin Oz et.al. | 2506.16840v1 | null |
2025-06-20 | Incentivizing High-quality Participation From Federated Learning Agents | Jinlong Pang et.al. | 2506.16731v1 | null |
2025-06-20 | TriCon-SF: A Triple-Shuffle and Contribution-Aware Serial Federated Learning Framework for Heterogeneous Healthcare Data | Yuping Yan et.al. | 2506.16723v1 | null |
2025-06-19 | FLAME: Towards Federated Fine-Tuning Large Language Models Through Adaptive SMoE | Khiem Le et.al. | 2506.16600v1 | null |
2025-06-19 | Teaching Complex Systems based on Microservices | Renato Cordeiro Ferreira et.al. | 2506.16492v1 | null |
2025-06-19 | SecureFed: A Two-Phase Framework for Detecting Malicious Clients in Federated Learning | Likhitha Annapurna Kavuri et.al. | 2506.16458v1 | null |
2025-06-19 | FOCoOp: Enhancing Out-of-Distribution Robustness in Federated Prompt Learning for Vision-Language Models | Xinting Liao et.al. | 2506.16218v1 | null |
2025-06-19 | Test beam measurements and computer simulations of the ATLAS ITk R2 silicon strip detector | Jan-Hendrik Arling et.al. | 2506.16053v1 | null |
2025-06-19 | Leveraging Optimal Transport for Distributed Two-Sample Testing: An Integrated Transportation Distance-based Framework | Zhengqi Lin et.al. | 2506.16047v1 | null |
2025-06-18 | PNCS:Power-Norm Cosine Similarity for Diverse Client Selection in Federated Learning | Liangyan Li et.al. | 2506.15923v1 | null |
2025-06-18 | Heterogeneous Federated Reinforcement Learning Using Wasserstein Barycenters | Luiz Pereira et.al. | 2506.15825v1 | null |
2025-06-18 | Federated Learning for MRI-based BrainAGE: a multicenter study on post-stroke functional outcome prediction | Vincent Roca et.al. | 2506.15626v2 | null |
2025-06-18 | FedWSIDD: Federated Whole Slide Image Classification via Dataset Distillation | Haolong Jin et.al. | 2506.15365v1 | link |
2025-06-18 | Centroid Approximation for Byzantine-Tolerant Federated Learning | Mélanie Cambus et.al. | 2506.15264v1 | null |
2025-06-18 | Accessible Gesture-Driven Augmented Reality Interaction System | Yikan Wang et.al. | 2506.15189v1 | null |
2025-06-17 | FedOne: Query-Efficient Federated Learning for Black-box Discrete Prompt Learning | Ganyu Wang et.al. | 2506.14929v1 | link |
2025-06-17 | How many federal employees are not satisfied? Using response times to estimate population proportions under the survey variable cause model | Jonathan Auerbach et.al. | 2506.14915v1 | null |
2025-06-17 | Event-Driven Online Vertical Federated Learning | Ganyu Wang et.al. | 2506.14911v1 | null |
2025-06-17 | Now More Than Ever, Foundational AI Research and Infrastructure Depends on the Federal Government | Michela Taufer et.al. | 2506.14679v1 | null |
2025-06-17 | Knowledge Adaptation as Posterior Correction | Mohammad Emtiyaz Khan et.al. | 2506.14262v1 | null |
2025-06-17 | Convergence-Privacy-Fairness Trade-Off in Personalized Federated Learning | Xiyu Zhao et.al. | 2506.14251v1 | null |
2025-06-17 | A Comprehensive Survey on Underwater Acoustic Target Positioning and Tracking: Progress, Challenges, and Perspectives | Zhong Yang et.al. | 2506.14165v1 | null |
2025-06-16 | PeakWeather: MeteoSwiss Weather Station Measurements for Spatiotemporal Deep Learning | Daniele Zambon et.al. | 2506.13652v1 | link |
2025-06-16 | EBS-CFL: Efficient and Byzantine-robust Secure Clustered Federated Learning | Zhiqiang Li et.al. | 2506.13612v1 | link |
2025-06-16 | Perfect Privacy for Discriminator-Based Byzantine-Resilient Federated Learning | Yue Xia et.al. | 2506.13561v1 | null |
2025-06-16 | A Two-stage Optimization Method for Wide-range Single-electron Quantum Magnetic Sensing | Shiqian Guo et.al. | 2506.13469v1 | null |
2025-06-16 | Federated ADMM from Bayesian Duality | Thomas Möllenhoff et.al. | 2506.13150v1 | link |
2025-06-15 | Does the Expansion of Medicaid Lead to Income Adjustment -- Evidence from SIPP | Mingjian Li et.al. | 2506.12976v1 | null |
2025-06-15 | Privacy-Preserving Federated Learning against Malicious Clients Based on Verifiable Functional Encryption | Nina Cai et.al. | 2506.12846v1 | null |
Few-shot Learning
Meta Learning
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | RocketStack: A level-aware deep recursive ensemble learning framework with exploratory feature fusion and model pruning dynamics | Çağatay Demirel et.al. | 2506.16965v1 | null |
2025-06-20 | Tracker Installations Are Not Created Equal: Understanding Tracker Configuration of Form Data Collection | Julia B. Kieserman et.al. | 2506.16891v1 | null |
2025-06-20 | Self-supervised Feature Extraction for Enhanced Ball Detection on Soccer Robots | Can Lin et.al. | 2506.16821v1 | null |
2025-06-19 | Latent Noise Injection for Private and Statistically Aligned Synthetic Data Generation | Rex Shen et.al. | 2506.16636v1 | null |
2025-06-19 | MetaQAP -- A Meta-Learning Approach for Quality-Aware Pretraining in Image Quality Assessment | Muhammad Azeem Aslam et.al. | 2506.16601v1 | null |
2025-06-19 | External Evaluation of Discrimination Mitigation Efforts in Meta's Ad Delivery | Basileal Imana et.al. | 2506.16560v1 | null |
2025-06-19 | From LLM-anation to LLM-orchestrator: Coordinating Small Models for Data Labeling | Yao Lu et.al. | 2506.16393v1 | null |
2025-06-19 | Next-Token Prediction Should be Ambiguity-Sensitive: A Meta-Learning Perspective | Leo Gagnon et.al. | 2506.16288v1 | null |
2025-06-19 | A Quantum-Control Lambda-Calculus with Multiple Measurement Bases | Alejandro Díaz-Caro et.al. | 2506.16244v1 | null |
2025-06-19 | Holographic Baryons as Quantum Hall Droplets | Francesco Bigazzi et.al. | 2506.16205v1 | null |
2025-06-19 | A Brain-to-Population Graph Learning Framework for Diagnosing Brain Disorders | Qianqian Liao et.al. | 2506.16096v1 | null |
2025-06-19 | Self-Critique-Guided Curiosity Refinement: Enhancing Honesty and Helpfulness in Large Language Models via In-Context Learning | Duc Hieu Ho et.al. | 2506.16064v1 | null |
2025-06-18 | Unifying VXAI: A Systematic Review and Framework for the Evaluation of Explainable AI | David Dembinsky et.al. | 2506.15408v1 | null |
2025-06-18 | Contribution of expert aggregation to temperature prediction part II: Second order bounds with sleeping experts | Léo Pfitzner et.al. | 2506.15216v1 | null |
2025-06-18 | In-Context Learning for Gradient-Free Receiver Adaptation: Principles, Applications, and Theory | Matteo Zecchin et.al. | 2506.15176v1 | null |
2025-06-17 | Context Matters: Learning Generalizable Rewards via Calibrated Features | Alexandra Forsey-Smerek et.al. | 2506.15012v2 | null |
2025-06-17 | FocalClick-XL: Towards Unified and High-quality Interactive Segmentation | Xi Chen et.al. | 2506.14686v1 | null |
2025-06-17 | GAMORA: A Gesture Articulated Meta Operative Robotic Arm for Hazardous Material Handling in Containment-Level Environments | Farha Abdul Wasay et.al. | 2506.14513v1 | null |
2025-06-17 | HiLight: A Hierarchical Reinforcement Learning Framework with Global Adversarial Guidance for Large-Scale Traffic Signal Control | Yaqiao Zhu et.al. | 2506.14391v1 | null |
2025-06-17 | Vulnerability Disclosure or Notification? Best Practices for Reaching Stakeholders at Scale | Ting-Han Chen et.al. | 2506.14323v1 | null |
2025-06-17 | Meta-SurDiff: Classification Diffusion Model Optimized by Meta Learning is Reliable for Online Surgical Phase Recognition | Yufei Li et.al. | 2506.14181v1 | null |
2025-06-16 | Arctic Long Sequence Training: Scalable And Efficient Training For Multi-Million Token Sequences | Stas Bekman et.al. | 2506.13996v1 | link |
2025-06-16 | Meta Optimality for Demographic Parity Constrained Regression via Post-Processing | Kazuto Fukuchi et.al. | 2506.13947v1 | null |
2025-06-16 | Few-Shot Learning for Industrial Time Series: A Comparative Analysis Using the Example of Screw-Fastening Process Monitoring | Xinyuan Tu et.al. | 2506.13909v1 | null |
2025-06-16 | Scaling Algorithm Distillation for Continuous Control with Mamba | Samuel Beaussant et.al. | 2506.13892v1 | null |
2025-06-16 | Meta-learning how to Share Credit among Macro-Actions | Ionel-Alexandru Hosu et.al. | 2506.13690v1 | null |
2025-06-16 | Hybrid Meta-learners for Estimating Heterogeneous Treatment Effects | Zhongyuan Liang et.al. | 2506.13680v1 | null |
2025-06-16 | Bayesian Quantification of Observability and Equation of State of Twin Stars | Xavier Grundler et.al. | 2506.13677v1 | null |
2025-06-16 | Assignment of collision-induced four-level double-resonance transitions in the 3$ν$${_3}$ ${\Leftarrow}$ $ν$${_3}$ spectral region of methane | Kevin K. Lehmann et.al. | 2506.13644v1 | null |
2025-06-16 | Socratic RL: A Novel Framework for Efficient Knowledge Acquisition through Iterative Reflection and Viewpoint Distillation | Xiangfan Wu et.al. | 2506.13358v1 | null |
One-shot Learning
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | No Free Lunch: Rethinking Internal Feedback for LLM Reasoning | Yanzhi Zhang et.al. | 2506.17219v1 | null |
2025-06-20 | Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens | Zeyuan Yang et.al. | 2506.17218v1 | null |
2025-06-20 | Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting | Tianjiao Yu et.al. | 2506.17212v1 | null |
2025-06-20 | BREAD: Branched Rollouts from Expert Anchors Bridge SFT & RL for Reasoning | Xuechen Zhang et.al. | 2506.17211v1 | null |
2025-06-20 | DreamCube: 3D Panorama Generation via Multi-plane Synchronization | Yukun Huang et.al. | 2506.17206v1 | null |
2025-06-20 | Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning | Guozheng Ma et.al. | 2506.17204v1 | null |
2025-06-20 | UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation | Teng Li et.al. | 2506.17202v1 | null |
2025-06-20 | Tighter Error Bounds for the qDRIFT Algorithm | I. J. David et.al. | 2506.17199v1 | null |
2025-06-20 | Dex1B: Learning with 1B Demonstrations for Dexterous Manipulation | Jianglong Ye et.al. | 2506.17198v1 | null |
2025-06-20 | Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres | Samuel Howard et.al. | 2506.17197v1 | null |
2025-06-20 | Detecting LLM-Generated Short Answers and Effects on Learner Performance | Shambhavi Bhushan et.al. | 2506.17196v1 | null |
2025-06-20 | Facial Landmark Visualization and Emotion Recognition Through Neural Networks | Israel Juárez-Jiménez et.al. | 2506.17191v1 | null |
2025-06-20 | Optimal Implicit Bias in Linear Regression | Kanumuri Nithin Varma et.al. | 2506.17187v1 | null |
2025-06-20 | YASMOT: Yet another stereo image multi-object tracker | Ketil Malde et.al. | 2506.17186v1 | null |
2025-06-20 | A Common Pool of Privacy Problems: Legal and Technical Lessons from a Large-Scale Web-Scraped Machine Learning Dataset | Rachel Hong et.al. | 2506.17185v1 | null |
2025-06-20 | Variational Learning of Disentangled Representations | Yuli Slavutsky et.al. | 2506.17182v1 | null |
2025-06-20 | Deep generative models as the probability transformation functions | Vitalii Bondar et.al. | 2506.17171v1 | null |
2025-06-20 | Continual Learning with Columnar Spiking Neural Networks | Denis Larionov et.al. | 2506.17169v1 | null |
2025-06-20 | Analyzing PDFs like Binaries: Adversarially Robust PDF Malware Analysis via Intermediate Representation and Language Model | Side Liu et.al. | 2506.17162v1 | null |
2025-06-20 | Co-Seg++: Mutual Prompt-Guided Collaborative Learning for Versatile Medical Segmentation | Qing Xu et.al. | 2506.17159v1 | null |
2025-06-20 | Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity | Samin Yeasar Arnob et.al. | 2506.17155v1 | null |
2025-06-20 | Profile monitoring of random functions with Gaussian process basis expansions | Takayuki Iguchi et.al. | 2506.17153v1 | null |
2025-06-20 | Do We Need Large VLMs for Spotting Soccer Actions? | Ritabrata Chakraborty et.al. | 2506.17144v1 | null |
2025-06-20 | MeDi: Metadata-Guided Diffusion Models for Mitigating Biases in Tumor Classification | David Jacob Drexlin et.al. | 2506.17140v1 | null |
2025-06-20 | Consistent Sampling and Simulation: Molecular Dynamics with Energy-Based Diffusion Models | Michael Plainer et.al. | 2506.17139v1 | null |
2025-06-20 | Semi-Supervised Multi-Modal Medical Image Segmentation for Complex Situations | Dongdong Meng et.al. | 2506.17136v1 | null |
2025-06-20 | Robust Training with Data Augmentation for Medical Imaging Classification | Josué Martínez-Martínez et.al. | 2506.17133v1 | null |
2025-06-20 | Chain-of-Trust: A Progressive Trust Evaluation Framework Enabled by Generative AI | Botao Zhu et.al. | 2506.17130v1 | null |
2025-06-20 | Rapid and Continuous Trust Evaluation for Effective Task Collaboration Through Siamese Model | Botao Zhu et.al. | 2506.17128v1 | null |
2025-06-20 | Large Language Model Unlearning for Source Code | Xue Jiang et.al. | 2506.17125v1 | null |
Few-shot Learning
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | No Free Lunch: Rethinking Internal Feedback for LLM Reasoning | Yanzhi Zhang et.al. | 2506.17219v1 | null |
2025-06-20 | Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens | Zeyuan Yang et.al. | 2506.17218v1 | null |
2025-06-20 | Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting | Tianjiao Yu et.al. | 2506.17212v1 | null |
2025-06-20 | BREAD: Branched Rollouts from Expert Anchors Bridge SFT & RL for Reasoning | Xuechen Zhang et.al. | 2506.17211v1 | null |
2025-06-20 | DreamCube: 3D Panorama Generation via Multi-plane Synchronization | Yukun Huang et.al. | 2506.17206v1 | null |
2025-06-20 | Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning | Guozheng Ma et.al. | 2506.17204v1 | null |
2025-06-20 | UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation | Teng Li et.al. | 2506.17202v1 | null |
2025-06-20 | Tighter Error Bounds for the qDRIFT Algorithm | I. J. David et.al. | 2506.17199v1 | null |
2025-06-20 | Dex1B: Learning with 1B Demonstrations for Dexterous Manipulation | Jianglong Ye et.al. | 2506.17198v1 | null |
2025-06-20 | Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres | Samuel Howard et.al. | 2506.17197v1 | null |
2025-06-20 | Detecting LLM-Generated Short Answers and Effects on Learner Performance | Shambhavi Bhushan et.al. | 2506.17196v1 | null |
2025-06-20 | Facial Landmark Visualization and Emotion Recognition Through Neural Networks | Israel Juárez-Jiménez et.al. | 2506.17191v1 | null |
2025-06-20 | Optimal Implicit Bias in Linear Regression | Kanumuri Nithin Varma et.al. | 2506.17187v1 | null |
2025-06-20 | YASMOT: Yet another stereo image multi-object tracker | Ketil Malde et.al. | 2506.17186v1 | null |
2025-06-20 | A Common Pool of Privacy Problems: Legal and Technical Lessons from a Large-Scale Web-Scraped Machine Learning Dataset | Rachel Hong et.al. | 2506.17185v1 | null |
2025-06-20 | Variational Learning of Disentangled Representations | Yuli Slavutsky et.al. | 2506.17182v1 | null |
2025-06-20 | Deep generative models as the probability transformation functions | Vitalii Bondar et.al. | 2506.17171v1 | null |
2025-06-20 | Continual Learning with Columnar Spiking Neural Networks | Denis Larionov et.al. | 2506.17169v1 | null |
2025-06-20 | Analyzing PDFs like Binaries: Adversarially Robust PDF Malware Analysis via Intermediate Representation and Language Model | Side Liu et.al. | 2506.17162v1 | null |
2025-06-20 | Co-Seg++: Mutual Prompt-Guided Collaborative Learning for Versatile Medical Segmentation | Qing Xu et.al. | 2506.17159v1 | null |
2025-06-20 | Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity | Samin Yeasar Arnob et.al. | 2506.17155v1 | null |
2025-06-20 | Profile monitoring of random functions with Gaussian process basis expansions | Takayuki Iguchi et.al. | 2506.17153v1 | null |
2025-06-20 | Do We Need Large VLMs for Spotting Soccer Actions? | Ritabrata Chakraborty et.al. | 2506.17144v1 | null |
2025-06-20 | MeDi: Metadata-Guided Diffusion Models for Mitigating Biases in Tumor Classification | David Jacob Drexlin et.al. | 2506.17140v1 | null |
2025-06-20 | Consistent Sampling and Simulation: Molecular Dynamics with Energy-Based Diffusion Models | Michael Plainer et.al. | 2506.17139v1 | null |
2025-06-20 | Semi-Supervised Multi-Modal Medical Image Segmentation for Complex Situations | Dongdong Meng et.al. | 2506.17136v1 | null |
2025-06-20 | Robust Training with Data Augmentation for Medical Imaging Classification | Josué Martínez-Martínez et.al. | 2506.17133v1 | null |
2025-06-20 | Chain-of-Trust: A Progressive Trust Evaluation Framework Enabled by Generative AI | Botao Zhu et.al. | 2506.17130v1 | null |
2025-06-20 | Rapid and Continuous Trust Evaluation for Effective Task Collaboration Through Siamese Model | Botao Zhu et.al. | 2506.17128v1 | null |
2025-06-20 | Large Language Model Unlearning for Source Code | Xue Jiang et.al. | 2506.17125v1 | null |
Unsupervised Learning
Unsupervised Learning
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | No Free Lunch: Rethinking Internal Feedback for LLM Reasoning | Yanzhi Zhang et.al. | 2506.17219v1 | null |
2025-06-20 | Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens | Zeyuan Yang et.al. | 2506.17218v1 | null |
2025-06-20 | Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting | Tianjiao Yu et.al. | 2506.17212v1 | null |
2025-06-20 | BREAD: Branched Rollouts from Expert Anchors Bridge SFT & RL for Reasoning | Xuechen Zhang et.al. | 2506.17211v1 | null |
2025-06-20 | DreamCube: 3D Panorama Generation via Multi-plane Synchronization | Yukun Huang et.al. | 2506.17206v1 | null |
2025-06-20 | Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning | Guozheng Ma et.al. | 2506.17204v1 | null |
2025-06-20 | UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation | Teng Li et.al. | 2506.17202v1 | null |
2025-06-20 | Tighter Error Bounds for the qDRIFT Algorithm | I. J. David et.al. | 2506.17199v1 | null |
2025-06-20 | Dex1B: Learning with 1B Demonstrations for Dexterous Manipulation | Jianglong Ye et.al. | 2506.17198v1 | null |
2025-06-20 | Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres | Samuel Howard et.al. | 2506.17197v1 | null |
2025-06-20 | Detecting LLM-Generated Short Answers and Effects on Learner Performance | Shambhavi Bhushan et.al. | 2506.17196v1 | null |
2025-06-20 | Facial Landmark Visualization and Emotion Recognition Through Neural Networks | Israel Juárez-Jiménez et.al. | 2506.17191v1 | null |
2025-06-20 | Optimal Implicit Bias in Linear Regression | Kanumuri Nithin Varma et.al. | 2506.17187v1 | null |
2025-06-20 | YASMOT: Yet another stereo image multi-object tracker | Ketil Malde et.al. | 2506.17186v1 | null |
2025-06-20 | A Common Pool of Privacy Problems: Legal and Technical Lessons from a Large-Scale Web-Scraped Machine Learning Dataset | Rachel Hong et.al. | 2506.17185v1 | null |
2025-06-20 | Variational Learning of Disentangled Representations | Yuli Slavutsky et.al. | 2506.17182v1 | null |
2025-06-20 | Deep generative models as the probability transformation functions | Vitalii Bondar et.al. | 2506.17171v1 | null |
2025-06-20 | Continual Learning with Columnar Spiking Neural Networks | Denis Larionov et.al. | 2506.17169v1 | null |
2025-06-20 | Analyzing PDFs like Binaries: Adversarially Robust PDF Malware Analysis via Intermediate Representation and Language Model | Side Liu et.al. | 2506.17162v1 | null |
2025-06-20 | Co-Seg++: Mutual Prompt-Guided Collaborative Learning for Versatile Medical Segmentation | Qing Xu et.al. | 2506.17159v1 | null |
2025-06-20 | Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity | Samin Yeasar Arnob et.al. | 2506.17155v1 | null |
2025-06-20 | Profile monitoring of random functions with Gaussian process basis expansions | Takayuki Iguchi et.al. | 2506.17153v1 | null |
2025-06-20 | Do We Need Large VLMs for Spotting Soccer Actions? | Ritabrata Chakraborty et.al. | 2506.17144v1 | null |
2025-06-20 | MeDi: Metadata-Guided Diffusion Models for Mitigating Biases in Tumor Classification | David Jacob Drexlin et.al. | 2506.17140v1 | null |
2025-06-20 | Consistent Sampling and Simulation: Molecular Dynamics with Energy-Based Diffusion Models | Michael Plainer et.al. | 2506.17139v1 | null |
2025-06-20 | On the Theory of Conditional Feature Alignment for Unsupervised Domain-Adaptive Counting | Zhuonan Liang et.al. | 2506.17137v1 | null |
2025-06-20 | Semi-Supervised Multi-Modal Medical Image Segmentation for Complex Situations | Dongdong Meng et.al. | 2506.17136v1 | null |
2025-06-20 | Robust Training with Data Augmentation for Medical Imaging Classification | Josué Martínez-Martínez et.al. | 2506.17133v1 | null |
2025-06-20 | Chain-of-Trust: A Progressive Trust Evaluation Framework Enabled by Generative AI | Botao Zhu et.al. | 2506.17130v1 | null |
2025-06-20 | Rapid and Continuous Trust Evaluation for Effective Task Collaboration Through Siamese Model | Botao Zhu et.al. | 2506.17128v1 | null |
Transfer Learning
Transfer Learning
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | No Free Lunch: Rethinking Internal Feedback for LLM Reasoning | Yanzhi Zhang et.al. | 2506.17219v1 | null |
2025-06-20 | Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens | Zeyuan Yang et.al. | 2506.17218v1 | null |
2025-06-20 | Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting | Tianjiao Yu et.al. | 2506.17212v1 | null |
2025-06-20 | BREAD: Branched Rollouts from Expert Anchors Bridge SFT & RL for Reasoning | Xuechen Zhang et.al. | 2506.17211v1 | null |
2025-06-20 | DreamCube: 3D Panorama Generation via Multi-plane Synchronization | Yukun Huang et.al. | 2506.17206v1 | null |
2025-06-20 | Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning | Guozheng Ma et.al. | 2506.17204v1 | null |
2025-06-20 | UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation | Teng Li et.al. | 2506.17202v1 | null |
2025-06-20 | Tighter Error Bounds for the qDRIFT Algorithm | I. J. David et.al. | 2506.17199v1 | null |
2025-06-20 | Dex1B: Learning with 1B Demonstrations for Dexterous Manipulation | Jianglong Ye et.al. | 2506.17198v1 | null |
2025-06-20 | Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres | Samuel Howard et.al. | 2506.17197v1 | null |
2025-06-20 | Detecting LLM-Generated Short Answers and Effects on Learner Performance | Shambhavi Bhushan et.al. | 2506.17196v1 | null |
2025-06-20 | Tensor network calculation of boundary and corner magnetization | Roman Krcmar et.al. | 2506.17194v1 | null |
2025-06-20 | Facial Landmark Visualization and Emotion Recognition Through Neural Networks | Israel Juárez-Jiménez et.al. | 2506.17191v1 | null |
2025-06-20 | Optimal Implicit Bias in Linear Regression | Kanumuri Nithin Varma et.al. | 2506.17187v1 | null |
2025-06-20 | YASMOT: Yet another stereo image multi-object tracker | Ketil Malde et.al. | 2506.17186v1 | null |
2025-06-20 | A Common Pool of Privacy Problems: Legal and Technical Lessons from a Large-Scale Web-Scraped Machine Learning Dataset | Rachel Hong et.al. | 2506.17185v1 | null |
2025-06-20 | Judo: A User-Friendly Open-Source Package for Sampling-Based Model Predictive Control | Albert H. Li et.al. | 2506.17184v1 | null |
2025-06-20 | Variational Learning of Disentangled Representations | Yuli Slavutsky et.al. | 2506.17182v1 | null |
2025-06-20 | Deep generative models as the probability transformation functions | Vitalii Bondar et.al. | 2506.17171v1 | null |
2025-06-20 | Continual Learning with Columnar Spiking Neural Networks | Denis Larionov et.al. | 2506.17169v1 | null |
2025-06-20 | Analyzing PDFs like Binaries: Adversarially Robust PDF Malware Analysis via Intermediate Representation and Language Model | Side Liu et.al. | 2506.17162v1 | null |
2025-06-20 | Co-Seg++: Mutual Prompt-Guided Collaborative Learning for Versatile Medical Segmentation | Qing Xu et.al. | 2506.17159v1 | null |
2025-06-20 | Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity | Samin Yeasar Arnob et.al. | 2506.17155v1 | null |
2025-06-20 | Profile monitoring of random functions with Gaussian process basis expansions | Takayuki Iguchi et.al. | 2506.17153v1 | null |
2025-06-20 | Do We Need Large VLMs for Spotting Soccer Actions? | Ritabrata Chakraborty et.al. | 2506.17144v1 | null |
2025-06-20 | MeDi: Metadata-Guided Diffusion Models for Mitigating Biases in Tumor Classification | David Jacob Drexlin et.al. | 2506.17140v1 | null |
2025-06-20 | Consistent Sampling and Simulation: Molecular Dynamics with Energy-Based Diffusion Models | Michael Plainer et.al. | 2506.17139v1 | null |
2025-06-20 | Semi-Supervised Multi-Modal Medical Image Segmentation for Complex Situations | Dongdong Meng et.al. | 2506.17136v1 | null |
2025-06-20 | Robust Training with Data Augmentation for Medical Imaging Classification | Josué Martínez-Martínez et.al. | 2506.17133v1 | null |
2025-06-20 | Chain-of-Trust: A Progressive Trust Evaluation Framework Enabled by Generative AI | Botao Zhu et.al. | 2506.17130v1 | null |
Reinforcement Learning
Reinforcement Learning
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | VLN-R1: Vision-Language Navigation via Reinforcement Fine-Tuning | Zhangyang Qi et.al. | 2506.17221v1 | null |
2025-06-20 | No Free Lunch: Rethinking Internal Feedback for LLM Reasoning | Yanzhi Zhang et.al. | 2506.17219v1 | null |
2025-06-20 | Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens | Zeyuan Yang et.al. | 2506.17218v1 | null |
2025-06-20 | BREAD: Branched Rollouts from Expert Anchors Bridge SFT & RL for Reasoning | Xuechen Zhang et.al. | 2506.17211v1 | null |
2025-06-20 | Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning | Guozheng Ma et.al. | 2506.17204v1 | null |
2025-06-20 | Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity | Samin Yeasar Arnob et.al. | 2506.17155v1 | null |
2025-06-20 | When Can Model-Free Reinforcement Learning be Enough for Thinking? | Josiah P. Hanna et.al. | 2506.17124v1 | null |
2025-06-20 | TransDreamerV3: Implanting Transformer In DreamerV3 | Shruti Sadanand Dongare et.al. | 2506.17103v1 | null |
2025-06-20 | Tower+: Bridging Generality and Translation Specialization in Multilingual LLMs | Ricardo Rei et.al. | 2506.17080v1 | null |
2025-06-20 | Scalable and Reliable Multi-agent Reinforcement Learning for Traffic Assignment | Leizhen Wang et.al. | 2506.17029v1 | null |
2025-06-20 | Robust Reinforcement Learning for Discrete Compositional Generation via General Soft Operators | Marco Jiralerspong et.al. | 2506.17007v1 | null |
2025-06-20 | "Whoever needs to see it, will see it": Motivations and Labor of Creating Algorithmic Conspirituality Content on TikTok | Ankolika De et.al. | 2506.16851v1 | null |
2025-06-20 | Learning Dexterous Object Handover | Daniel Frau-Alfaro et.al. | 2506.16822v1 | null |
2025-06-20 | Integrating Traditional Technical Analysis with AI: A Multi-Agent LLM-Based Approach to Stock Market Forecasting | Michał Wawer et.al. | 2506.16813v1 | null |
2025-06-20 | RealSR-R1: Reinforcement Learning for Real-World Image Super-Resolution with Vision-Language Chain-of-Thought | Junbo Qiao et.al. | 2506.16796v1 | null |
2025-06-20 | Robust Dynamic Material Handling via Adaptive Constrained Evolutionary Reinforcement Learning | Chengpeng Hu et.al. | 2506.16795v1 | null |
2025-06-20 | What Is the Point of Equality in Machine Learning Fairness? Beyond Equality of Opportunity | Youjin Kong et.al. | 2506.16782v1 | null |
2025-06-20 | Reinforcement learning for hybrid charging stations planning and operation considering fixed and mobile chargers | Yanchen Zhu et.al. | 2506.16764v1 | null |
2025-06-20 | Off-Policy Actor-Critic for Adversarial Observation Robustness: Virtual Alternative Training via Symmetric Policy Evaluation | Kosuke Nakanishi et.al. | 2506.16753v1 | null |
2025-06-20 | DRARL: Disengagement-Reason-Augmented Reinforcement Learning for Efficient Improvement of Autonomous Driving Policy | Weitao Zhou et.al. | 2506.16720v1 | null |
2025-06-20 | Generalizable Agent Modeling for Agent Collaboration-Competition Adaptation with Multi-Retrieval and Dynamic Generation | Chenxu Wang et.al. | 2506.16718v1 | null |
2025-06-20 | ReasonGRM: Enhancing Generative Reward Models through Large Reasoning Models | Bin Chen et.al. | 2506.16712v1 | null |
2025-06-20 | Interpretable Low-Dimensional Modeling of Spatiotemporal Agent States for Decision Making in Football Tactics | Kenjiro Ide et.al. | 2506.16696v1 | null |
2025-06-20 | Comparison of Lumerical FDTD and Tidy3D for three-dimensional FDTD simulations of passive silicon photonic components | Zuyang Liu et.al. | 2506.16665v1 | null |
2025-06-19 | Distribution Parameter Actor-Critic: Shifting the Agent-Environment Boundary for Diverse Action Spaces | Jiamin He et.al. | 2506.16608v1 | null |
2025-06-19 | Energy-Based Transfer for Reinforcement Learning | Zeyun Deng et.al. | 2506.16590v1 | null |
2025-06-19 | BIDA: A Bi-level Interaction Decision-making Algorithm for Autonomous Vehicles in Dynamic Traffic Scenarios | Liyang Yu et.al. | 2506.16546v1 | null |
2025-06-19 | EFormer: An Effective Edge-based Transformer for Vehicle Routing Problems | Dian Meng et.al. | 2506.16428v1 | null |
2025-06-19 | GoalLadder: Incremental Goal Discovery with Vision-Language Models | Alexey Zakharov et.al. | 2506.16396v1 | null |
2025-06-19 | From LLM-anation to LLM-orchestrator: Coordinating Small Models for Data Labeling | Yao Lu et.al. | 2506.16393v1 | null |
Transformer
Vision Transformer
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | VLN-R1: Vision-Language Navigation via Reinforcement Fine-Tuning | Zhangyang Qi et.al. | 2506.17221v1 | null |
2025-06-20 | Emergent Temporal Correspondences from Video Diffusion Transformers | Jisu Nam et.al. | 2506.17220v1 | link |
2025-06-20 | Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens | Zeyuan Yang et.al. | 2506.17218v1 | null |
2025-06-20 | Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation | Xiuyu Yang et.al. | 2506.17213v1 | null |
2025-06-20 | Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting | Tianjiao Yu et.al. | 2506.17212v1 | null |
2025-06-20 | DreamCube: 3D Panorama Generation via Multi-plane Synchronization | Yukun Huang et.al. | 2506.17206v1 | null |
2025-06-20 | UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation | Teng Li et.al. | 2506.17202v1 | null |
2025-06-20 | Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition | Jiaqi Li et.al. | 2506.17201v1 | null |
2025-06-20 | Dex1B: Learning with 1B Demonstrations for Dexterous Manipulation | Jianglong Ye et.al. | 2506.17198v1 | null |
2025-06-20 | Facial Landmark Visualization and Emotion Recognition Through Neural Networks | Israel Juárez-Jiménez et.al. | 2506.17191v1 | null |
2025-06-20 | YASMOT: Yet another stereo image multi-object tracker | Ketil Malde et.al. | 2506.17186v1 | null |
2025-06-20 | Fault Tolerance by Construction | Benjamin Rodatz et.al. | 2506.17181v1 | null |
2025-06-20 | Deep generative models as the probability transformation functions | Vitalii Bondar et.al. | 2506.17171v1 | null |
2025-06-20 | Scaling limits for sample autocovariance operators of Hilbert space-valued linear processes | Marie-Christine Düker et.al. | 2506.17168v1 | null |
2025-06-20 | Proportional Sensitivity in Generative Adversarial Network (GAN)-Augmented Brain Tumor Classification Using Convolutional Neural Network | Mahin Montasir Afif et.al. | 2506.17165v1 | null |
2025-06-20 | The MedPerturb Dataset: What Non-Content Perturbations Reveal About Human and Clinical LLM Decision Making | Abinitha Gourabathina et.al. | 2506.17163v1 | null |
2025-06-20 | Walking Fingerprinting Using Wrist Accelerometry During Activities of Daily Living in NHANES | Lily Koffman et.al. | 2506.17160v1 | null |
2025-06-20 | Co-Seg++: Mutual Prompt-Guided Collaborative Learning for Versatile Medical Segmentation | Qing Xu et.al. | 2506.17159v1 | null |
2025-06-20 | Do We Need Large VLMs for Spotting Soccer Actions? | Ritabrata Chakraborty et.al. | 2506.17144v1 | null |
2025-06-20 | MeDi: Metadata-Guided Diffusion Models for Mitigating Biases in Tumor Classification | David Jacob Drexlin et.al. | 2506.17140v1 | null |
2025-06-20 | On the Theory of Conditional Feature Alignment for Unsupervised Domain-Adaptive Counting | Zhuonan Liang et.al. | 2506.17137v1 | null |
2025-06-20 | Semi-Supervised Multi-Modal Medical Image Segmentation for Complex Situations | Dongdong Meng et.al. | 2506.17136v1 | null |
2025-06-20 | Dynamic Watermark Generation for Digital Images using Perimeter Gated SPAD Imager PUFs | Md Sakibur Sajal et.al. | 2506.17134v1 | null |
2025-06-20 | Robust Training with Data Augmentation for Medical Imaging Classification | Josué Martínez-Martínez et.al. | 2506.17133v1 | null |
2025-06-20 | Reassessing Code Authorship Attribution in the Era of Language Models | Atish Kumar Dipongkor et.al. | 2506.17120v1 | null |
2025-06-20 | RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking | Teng Guo et.al. | 2506.17119v1 | null |
2025-06-20 | A Vision for Trustworthy, Fair, and Efficient Socio-Technical Control using Karma Economies | Ezzat Elokda et.al. | 2506.17115v1 | null |
2025-06-20 | MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation | Shoubin Yu et.al. | 2506.17113v1 | null |
2025-06-20 | Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping | Teng Guo et.al. | 2506.17110v1 | null |
2025-06-20 | TransDreamerV3: Implanting Transformer In DreamerV3 | Shruti Sadanand Dongare et.al. | 2506.17103v1 | null |
Transformer
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | Emergent Temporal Correspondences from Video Diffusion Transformers | Jisu Nam et.al. | 2506.17220v1 | link |
2025-06-20 | Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting | Tianjiao Yu et.al. | 2506.17212v1 | null |
2025-06-20 | UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation | Teng Li et.al. | 2506.17202v1 | null |
2025-06-20 | Fault Tolerance by Construction | Benjamin Rodatz et.al. | 2506.17181v1 | null |
2025-06-20 | Deep generative models as the probability transformation functions | Vitalii Bondar et.al. | 2506.17171v1 | null |
2025-06-20 | Scaling limits for sample autocovariance operators of Hilbert space-valued linear processes | Marie-Christine Düker et.al. | 2506.17168v1 | null |
2025-06-20 | The MedPerturb Dataset: What Non-Content Perturbations Reveal About Human and Clinical LLM Decision Making | Abinitha Gourabathina et.al. | 2506.17163v1 | null |
2025-06-20 | Walking Fingerprinting Using Wrist Accelerometry During Activities of Daily Living in NHANES | Lily Koffman et.al. | 2506.17160v1 | null |
2025-06-20 | Reassessing Code Authorship Attribution in the Era of Language Models | Atish Kumar Dipongkor et.al. | 2506.17120v1 | null |
2025-06-20 | TransDreamerV3: Implanting Transformer In DreamerV3 | Shruti Sadanand Dongare et.al. | 2506.17103v1 | null |
2025-06-20 | Cross-Modal Epileptic Signal Harmonization: Frequency Domain Mapping Quantization for Pre-training a Unified Neurophysiological Transformer | Runkai Zhang et.al. | 2506.17068v1 | null |
2025-06-20 | From Concepts to Components: Concept-Agnostic Attention Module Discovery in Transformers | Jingtong Su et.al. | 2506.17052v1 | null |
2025-06-20 | Relaxed syntax modeling in Transformers for future-proof license plate recognition | Florent Meyer et.al. | 2506.17051v1 | null |
2025-06-20 | MAWIFlow Benchmark: Realistic Flow-Based Evaluation for Network Intrusion Detection | Joshua Schraven et.al. | 2506.17041v1 | null |
2025-06-20 | Stretching Beyond the Obvious: A Gradient-Free Framework to Unveil the Hidden Landscape of Visual Invariance | Lorenzo Tausani et.al. | 2506.17040v1 | null |
2025-06-20 | LSCD: Lomb-Scargle Conditioned Diffusion for Time series Imputation | Elizabeth Fons et.al. | 2506.17039v1 | null |
2025-06-20 | $N=1$ Supersymmetry invariance of the Abelian Stueckelberg model | M. A. L. Capri et.al. | 2506.17021v1 | null |
2025-06-20 | A Quantile Regression Approach for Remaining Useful Life Estimation with State Space Models | Davide Frizzo et.al. | 2506.17018v1 | null |
2025-06-20 | The Hidden Cost of an Image: Quantifying the Energy Consumption of AI Image Generation | Giulia Bertazzini et.al. | 2506.17016v1 | null |
2025-06-20 | A Semi-Parametric Torus-to-Torus Regression Model with Geometric Loss: Application to Cyclone Data | Surojit Biswas et.al. | 2506.17014v1 | null |
2025-06-20 | Estimating Deprivation Cost Functions for Power Outages During Disasters: A Discrete Choice Modeling Approach | Xiangpeng Li et.al. | 2506.16993v1 | null |
2025-06-20 | Latent Concept Disentanglement in Transformer-based Language Models | Guan Zhe Hong et.al. | 2506.16975v1 | null |
2025-06-20 | MM-AttacKG: A Multimodal Approach to Attack Graph Construction with Large Language Models | Yongheng Zhang et.al. | 2506.16968v1 | null |
2025-06-20 | Reversing Flow for Image Restoration | Haina Qin et.al. | 2506.16961v1 | null |
2025-06-20 | PET Tracer Separation Using Conditional Diffusion Transformer with Multi-latent Space Learning | Bin Huang et.al. | 2506.16934v1 | null |
2025-06-20 | Multimodal Fused Learning for Solving the Generalized Traveling Salesman Problem in Robotic Task Planning | Jiaqi Chen et.al. | 2506.16931v1 | null |
2025-06-20 | EHCube4P: Learning Epistatic Patterns Through Hypercube Graph Convolution Neural Network for Protein Fitness Function Estimation | Muhammad Daud et.al. | 2506.16921v1 | null |
2025-06-20 | Controlling Enhancement of Transmitted Goos-Hänchen Shifts: From Symmetric to Unidirectional | Zhuolin Wu et.al. | 2506.16913v1 | null |
2025-06-20 | Winding-Control Mechanism of Non-Hermitian Systems | Yongxu Fu et.al. | 2506.16887v1 | null |
2025-06-20 | Vision-Based Multirotor Control for Spherical Target Tracking: A Bearing-Angle Approach | Marcelo Jacinto et.al. | 2506.16870v1 | null |
Contrastive Learning
Contrastive Learning
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | No Free Lunch: Rethinking Internal Feedback for LLM Reasoning | Yanzhi Zhang et.al. | 2506.17219v1 | null |
2025-06-20 | Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens | Zeyuan Yang et.al. | 2506.17218v1 | null |
2025-06-20 | Part$^{2}$GS: Part-aware Modeling of Articulated Objects using 3D Gaussian Splatting | Tianjiao Yu et.al. | 2506.17212v1 | null |
2025-06-20 | BREAD: Branched Rollouts from Expert Anchors Bridge SFT & RL for Reasoning | Xuechen Zhang et.al. | 2506.17211v1 | null |
2025-06-20 | DreamCube: 3D Panorama Generation via Multi-plane Synchronization | Yukun Huang et.al. | 2506.17206v1 | null |
2025-06-20 | Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning | Guozheng Ma et.al. | 2506.17204v1 | null |
2025-06-20 | Confidence Scoring for LLM-Generated SQL in Supply Chain Data Extraction | Jiekai Ma et.al. | 2506.17203v1 | null |
2025-06-20 | UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation | Teng Li et.al. | 2506.17202v1 | null |
2025-06-20 | Tighter Error Bounds for the qDRIFT Algorithm | I. J. David et.al. | 2506.17199v1 | null |
2025-06-20 | Dex1B: Learning with 1B Demonstrations for Dexterous Manipulation | Jianglong Ye et.al. | 2506.17198v1 | null |
2025-06-20 | Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres | Samuel Howard et.al. | 2506.17197v1 | null |
2025-06-20 | Detecting LLM-Generated Short Answers and Effects on Learner Performance | Shambhavi Bhushan et.al. | 2506.17196v1 | null |
2025-06-20 | Facial Landmark Visualization and Emotion Recognition Through Neural Networks | Israel Juárez-Jiménez et.al. | 2506.17191v1 | null |
2025-06-20 | Optimal Implicit Bias in Linear Regression | Kanumuri Nithin Varma et.al. | 2506.17187v1 | null |
2025-06-20 | YASMOT: Yet another stereo image multi-object tracker | Ketil Malde et.al. | 2506.17186v1 | null |
2025-06-20 | A Common Pool of Privacy Problems: Legal and Technical Lessons from a Large-Scale Web-Scraped Machine Learning Dataset | Rachel Hong et.al. | 2506.17185v1 | null |
2025-06-20 | Variational Learning of Disentangled Representations | Yuli Slavutsky et.al. | 2506.17182v1 | null |
2025-06-20 | Deep generative models as the probability transformation functions | Vitalii Bondar et.al. | 2506.17171v1 | null |
2025-06-20 | Continual Learning with Columnar Spiking Neural Networks | Denis Larionov et.al. | 2506.17169v1 | null |
2025-06-20 | Analyzing PDFs like Binaries: Adversarially Robust PDF Malware Analysis via Intermediate Representation and Language Model | Side Liu et.al. | 2506.17162v1 | null |
2025-06-20 | Co-Seg++: Mutual Prompt-Guided Collaborative Learning for Versatile Medical Segmentation | Qing Xu et.al. | 2506.17159v1 | null |
2025-06-20 | Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity | Samin Yeasar Arnob et.al. | 2506.17155v1 | null |
2025-06-20 | Profile monitoring of random functions with Gaussian process basis expansions | Takayuki Iguchi et.al. | 2506.17153v1 | null |
2025-06-20 | Do We Need Large VLMs for Spotting Soccer Actions? | Ritabrata Chakraborty et.al. | 2506.17144v1 | null |
2025-06-20 | MeDi: Metadata-Guided Diffusion Models for Mitigating Biases in Tumor Classification | David Jacob Drexlin et.al. | 2506.17140v1 | null |
2025-06-20 | Consistent Sampling and Simulation: Molecular Dynamics with Energy-Based Diffusion Models | Michael Plainer et.al. | 2506.17139v1 | null |
2025-06-20 | Semi-Supervised Multi-Modal Medical Image Segmentation for Complex Situations | Dongdong Meng et.al. | 2506.17136v1 | null |
2025-06-20 | Robust Training with Data Augmentation for Medical Imaging Classification | Josué Martínez-Martínez et.al. | 2506.17133v1 | null |
2025-06-20 | Chain-of-Trust: A Progressive Trust Evaluation Framework Enabled by Generative AI | Botao Zhu et.al. | 2506.17130v1 | null |
2025-06-20 | Rapid and Continuous Trust Evaluation for Effective Task Collaboration Through Siamese Model | Botao Zhu et.al. | 2506.17128v1 | null |
Graph Neural Network
Graph Neural Network
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-06-20 | VLN-R1: Vision-Language Navigation via Reinforcement Fine-Tuning | Zhangyang Qi et.al. | 2506.17221v1 | null |
2025-06-20 | Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning | Guozheng Ma et.al. | 2506.17204v1 | null |
2025-06-20 | UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation | Teng Li et.al. | 2506.17202v1 | null |
2025-06-20 | Tensor network calculation of boundary and corner magnetization | Roman Krcmar et.al. | 2506.17194v1 | null |
2025-06-20 | Facial Landmark Visualization and Emotion Recognition Through Neural Networks | Israel Juárez-Jiménez et.al. | 2506.17191v1 | null |
2025-06-20 | On Energy-Efficient Passive Beamforming Design of RIS-Assisted CoMP-NOMA Networks | Muhammad Umer et.al. | 2506.17189v1 | null |
2025-06-20 | How universal is the mean-field universality class for percolation in complex networks? | Lorenzo Cirigliano et.al. | 2506.17175v1 | null |
2025-06-20 | High-accuracy inference using HfO$_x$S$_y$/HfS$_2$ Memristors | Aferdita Xhameni et.al. | 2506.17174v1 | null |
2025-06-20 | Deep generative models as the probability transformation functions | Vitalii Bondar et.al. | 2506.17171v1 | null |
2025-06-20 | Continual Learning with Columnar Spiking Neural Networks | Denis Larionov et.al. | 2506.17169v1 | null |
2025-06-20 | Proportional Sensitivity in Generative Adversarial Network (GAN)-Augmented Brain Tumor Classification Using Convolutional Neural Network | Mahin Montasir Afif et.al. | 2506.17165v1 | null |
2025-06-20 | Analyzing PDFs like Binaries: Adversarially Robust PDF Malware Analysis via Intermediate Representation and Language Model | Side Liu et.al. | 2506.17162v1 | null |
2025-06-20 | JSJ splittings for all Artin groups | Oli Jones et.al. | 2506.17157v1 | null |
2025-06-20 | Robust Training with Data Augmentation for Medical Imaging Classification | Josué Martínez-Martínez et.al. | 2506.17133v1 | null |
2025-06-20 | Chain-of-Trust: A Progressive Trust Evaluation Framework Enabled by Generative AI | Botao Zhu et.al. | 2506.17130v1 | null |
2025-06-20 | Cascade at local yield strain for silica and metallic glass | Nandlal Pingua et.al. | 2506.17129v1 | null |
2025-06-20 | Rapid and Continuous Trust Evaluation for Effective Task Collaboration Through Siamese Model | Botao Zhu et.al. | 2506.17128v1 | null |
2025-06-20 | Identifiability of Deep Polynomial Neural Networks | Konstantin Usevich et.al. | 2506.17093v1 | null |
2025-06-20 | PCG-Informed Neural Solvers for High-Resolution Homogenization of Periodic Microstructures | Yu Xing et.al. | 2506.17087v1 | null |
2025-06-20 | JANUS: Resilient and Adaptive Data Transmission for Enabling Timely and Efficient Cross-Facility Scientific Workflows | Vladislav Esaulov et.al. | 2506.17084v1 | null |
2025-06-20 | Brain-inspired interpretable reservoir computing with resonant recurrent neural networks | Mark A. Kramer et.al. | 2506.17083v1 | null |
2025-06-20 | Matroids, intersecting bases, and Borsuk property | Gyivan López-Campos et.al. | 2506.17082v1 | null |
2025-06-20 | Efficient and faithful reconstruction of dynamical attractors using homogeneous differentiators | Uros Sutulovic et.al. | 2506.17079v1 | null |
2025-06-20 | Neural Polar Decoders for DNA Data Storage | Ziv Aharoni et.al. | 2506.17076v1 | null |
2025-06-20 | Quantum k-SAT Related Hypergraph Problems | Simon-Luca Kremer et.al. | 2506.17066v1 | null |
2025-06-20 | Flow-Based Non-stationary Temporal Regime Causal Structure Learning | Abdellah Rahmani et.al. | 2506.17065v1 | null |
2025-06-20 | Generative Modeling of Full-Atom Protein Conformations using Latent Diffusion on Graph Embeddings | Aditya Sengar et.al. | 2506.17064v1 | null |
2025-06-20 | Client Selection Strategies for Federated Semantic Communications in Heterogeneous IoT Networks | Samer Lahoud et.al. | 2506.17063v1 | null |
2025-06-20 | Relaxed syntax modeling in Transformers for future-proof license plate recognition | Florent Meyer et.al. | 2506.17051v1 | null |
2025-06-20 | Navigating the Deep: Signature Extraction on Deep Neural Networks | Haolin Liu et.al. | 2506.17047v1 | null |