Skip to content

Transformer

Transformer

Publish Date Title Authors PDF Code
2025-05-23 Multi-Modal Spectral Parametrization Method (MMSPM) for analyzing EEG activity with distinct scaling regimes Frigyes Samuel Racz et.al. 2505.18117v1 null
2025-05-23 From Temporal to Spatial: Designing Spatialized Interactions with Segmented-audios in Immersive Environments for Active Engagement with Performing Arts Intangible Cultural Heritage Yuqi Wang et.al. 2505.18112v1 null
2025-05-23 Accelerating Learned Image Compression Through Modeling Neural Training Dynamics Yichi Zhang et.al. 2505.18107v1 null
2025-05-23 The Noether formalism for constructing conserved quantities in teleparallel equivalents of general relativity E. D. Emtsova et.al. 2505.18084v1 null
2025-05-23 A Foundation Model Framework for Multi-View MRI Classification of Extramural Vascular Invasion and Mesorectal Fascia Invasion in Rectal Cancer Yumeng Zhang et.al. 2505.18058v1 null
2025-05-23 LookWhere? Efficient Visual Recognition by Learning Where to Look and What to See from Self-Supervision Anthony Fuller et.al. 2505.18051v1 null
2025-05-23 RestoreVAR: Visual Autoregressive Generation for All-in-One Image Restoration Sudarshan Rajagopalan et.al. 2505.18047v1 null
2025-05-23 A Wavelet-based Stereo Matching Framework for Solving Frequency Convergence Inconsistency Xiaobao Wei et.al. 2505.18024v1 null
2025-05-23 Classification of assembly tasks combining multiple primitive actions using Transformers and xLSTMs Miguel Neves et.al. 2505.18012v1 null
2025-05-23 TRACE for Tracking the Emergence of Semantic Representations in Transformers Nura Aljaafari et.al. 2505.17998v1 null
2025-05-23 Canonical Pose Reconstruction from Single Depth Image for 3D Non-rigid Pose Recovery on Limited Datasets Fahd Alhamazani et.al. 2505.17992v1 null
2025-05-23 ADLGen: Synthesizing Symbolic, Event-Triggered Sensor Sequences for Human Activity Modeling Weihang You et.al. 2505.17987v1 null
2025-05-23 Explainable Anatomy-Guided AI for Prostate MRI: Foundation Models and In Silico Clinical Trials for Virtual Biopsy-based Risk Assessment Danial Khan et.al. 2505.17971v1 null
2025-05-23 SVD-Free Low-Rank Adaptive Gradient Optimization for Large Language Models Ionut-Vlad Modoranu et.al. 2505.17967v1 null
2025-05-23 Understanding Gated Neurons in Transformers from Their Input-Output Functionality Sebastian Gerstner et.al. 2505.17936v1 null
2025-05-23 Selection Mechanisms for Sequence Modeling using Linear State Space Models Umberto Casti et.al. 2505.17932v1 null
2025-05-23 Predicting Length of Stay in Neurological ICU Patients Using Classical Machine Learning and Neural Network Models: A Benchmark Study on MIMIC-IV Alexander Gabitashvili et.al. 2505.17929v1 null
2025-05-23 Language models can learn implicit multi-hop reasoning, but only if they have lots of training data Yuekun Yao et.al. 2505.17923v1 null
2025-05-23 Isospectrality and non-locality of generalized Dirac combs Giuliano Angelone et.al. 2505.17920v1 null
2025-05-23 NeuroTrails: Training with Dynamic Sparse Heads as the Key to Effective Ensembling Bram Grooten et.al. 2505.17909v1 null
2025-05-23 SpectraLDS: Provable Distillation for Linear Dynamical Systems Devan Shah et.al. 2505.17868v1 null
2025-05-23 The emergence of sparse attention: impact of data distribution and benefits of repetition Nicolas Zucchet et.al. 2505.17863v1 null
2025-05-23 Stochastic Weight Sharing for Bayesian Neural Networks Moule Lin et.al. 2505.17856v1 null
2025-05-23 Scaling Recurrent Neural Networks to a Billion Parameters with Zero-Order Optimization Francois Chaubard et.al. 2505.17852v1 null
2025-05-23 TransDF: Time-Series Forecasting Needs Transformed Label Alignment Hao Wang et.al. 2505.17847v1 null
2025-05-23 Continuum Transformers Perform In-Context Learning by Operator Gradient Descent Abhiti Mishra et.al. 2505.17838v1 null
2025-05-23 Hybrid Mamba-Transformer Decoder for Error-Correcting Codes Shy-el Cohen et.al. 2505.17834v1 null
2025-05-23 Low-Resource NMT: A Case Study on the Written and Spoken Languages in Hong Kong Hei Yi Mak et.al. 2505.17816v1 null
2025-05-23 An Attention Infused Deep Learning System with Grad-CAM Visualization for Early Screening of Glaucoma Ramanathan Swaminathan et.al. 2505.17808v1 null
2025-05-23 Temporal Consistency Constrained Transferable Adversarial Attacks with Background Mixup for Action Recognition Ping Li et.al. 2505.17807v1 null