Semantic Segmentation

Publish Date	Title	Authors	PDF	Code
2025-07-03	Subtyping in DHOL -- Extended preprint	Colin Rothgang et.al.	2507.02855v1	null
2025-07-03	Legal Requirements Translation from Law	Anmol Singhal et.al.	2507.02846v1	null
2025-07-03	Visual Contextual Attack: Jailbreaking MLLMs with Image-Driven Context Injection	Ziqi Miao et.al.	2507.02844v1	null
2025-07-03	Confidence-driven Gradient Modulation for Multimodal Human Activity Recognition: A Dynamic Contrastive Dual-Path Learning Approach	Panpan Ji et.al.	2507.02826v1	null
2025-07-03	LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion	Fangfu Liu et.al.	2507.02813v1	null
2025-07-03	No time to train! Training-Free Reference-Based Instance Segmentation	Miguel Espinosa et.al.	2507.02798v1	null
2025-07-03	From Long Videos to Engaging Clips: A Human-Inspired Video Editing Framework with Multimodal Narrative Understanding	Xiangfeng Wang et.al.	2507.02790v1	null
2025-07-03	From Pixels to Damage Severity: Estimating Earthquake Impacts Using Semantic Segmentation of Social Media Images	Danrong Zhang et.al.	2507.02781v1	null
2025-07-03	A Proof-Theoretic View of Basic Intuitionistic Conditional Logic (Extended Version)	Tiziano Dalmonte et.al.	2507.02767v1	null
2025-07-03	DexVLG: Dexterous Vision-Language-Grasp Model at Scale	Jiawei He et.al.	2507.02747v1	null
2025-07-03	Prompt learning with bounding box constraints for medical image segmentation	Mélanie Gaillochet et.al.	2507.02743v1	null
2025-07-03	SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment	Qi Xu et.al.	2507.02705v1	null
2025-07-03	Integrating path-planning and control for robotic unicycles	Máté B. Vizi et.al.	2507.02700v1	null
2025-07-03	APT: Adaptive Personalized Training for Diffusion Models with Limited Data	JungWoo Chae et.al.	2507.02687v1	null
2025-07-03	MEGANet-W: A Wavelet-Driven Edge-Guided Attention Framework for Weak Boundary Polyp Detection	Zhe Yee Tan et.al.	2507.02668v1	null
2025-07-03	AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models	Ziyin Zhou et.al.	2507.02664v1	null
2025-07-03	VRAgent-R1: Boosting Video Recommendation with MLLM-based Agents via Reinforcement Learning	Siran Chen et.al.	2507.02626v1	null
2025-07-03	ArtGS:3D Gaussian Splatting for Interactive Visual-Physical Modeling and Manipulation of Articulated Objects	Qiaojun Yu et.al.	2507.02600v1	null
2025-07-03	Structure-aware Semantic Discrepancy and Consistency for 3D Medical Image Self-supervised Learning	Tan Pan et.al.	2507.02581v1	null
2025-07-03	Parametric shape models for vessels learned from segmentations via differentiable voxelization	Alina F. Dima et.al.	2507.02576v1	null
2025-07-03	Reconstructing Close Human Interaction with Appearance and Proxemics Reasoning	Buzhen Huang et.al.	2507.02565v1	null
2025-07-03	Multi-Utterance Speech Separation and Association Trained on Short Segments	Yuzhu Wang et.al.	2507.02562v1	null
2025-07-03	Clarifying Before Reasoning: A Coq Prover with Structural Context	Yanzhen Lu et.al.	2507.02541v1	null
2025-07-03	Open-Source System for Multilingual Translation and Cloned Speech Synthesis	Mateo Cámara et.al.	2507.02530v1	null
2025-07-03	MedFormer: Hierarchical Medical Vision Transformer with Content-Aware Dual Sparse Selection Attention	Zunhui Xia et.al.	2507.02488v1	null
2025-07-03	On the width and profiles of cosmic filaments	Qi-Rui Yang et.al.	2507.02476v1	null
2025-07-03	Optimisation of amplification and gas mixture for directional Dark Matter searches with the CYGNO/INITIUM project	Giorgio Dho et.al.	2507.02474v1	null
2025-07-03	Resolving CAP Through Automata-Theoretic Economic Design: A Unified Mathematical Framework for Real-Time Partition-Tolerant Systems	Craig S Wright et.al.	2507.02464v1	null
2025-07-03	Weakly-supervised Contrastive Learning with Quantity Prompts for Moving Infrared Small Target Detection	Weiwei Duan et.al.	2507.02454v1	null
2025-07-03	Network structural change point detection and reconstruction for balanced neuronal networks	Kai Chen et.al.	2507.02450v1	null