arxiv cs.CV
banner
arxiv-cs-cv.bsky.social
arxiv cs.CV
@arxiv-cs-cv.bsky.social
Computer Science -- Computer Vision and Pattern Recognition (cs.CV)

source: export.arxiv.org/rss/cs.CV
maintainer: @tmaehara.bsky.social
Wei Zhang, Yeying Jin, Xin Li, Yan Zhang, Xiaofeng Cong, Cong Wang, Fengcai Qiao, zhichao Lian
UniFit: Towards Universal Virtual Try-on with MLLM-Guided Semantic Alignment
https://arxiv.org/abs/2511.15831
November 21, 2025 at 10:04 AM
Chengxi Zeng, Yuxuan Jiang, Aaron Zhang
EfficientSAM3: Progressive Hierarchical Distillation for Video Concept Segmentation from SAM1, 2, and 3
https://arxiv.org/abs/2511.15833
November 21, 2025 at 10:04 AM
Sajjad Pakdamansavoji, Yintao Ma, Amir Rasouli, Tongtong Cao
WALDO: Where Unseen Model-based 6D Pose Estimation Meets Occlusion
https://arxiv.org/abs/2511.15874
November 21, 2025 at 9:59 AM
Lukas Arzoumanidis, Julius Knechtel, Jan-Henrik Haunert, Youness Dehbi
Automatic Uncertainty-Aware Synthetic Data Bootstrapping for Historical Map Segmentation
https://arxiv.org/abs/2511.15875
November 21, 2025 at 9:58 AM
Yintao Ma, Sajjad Pakdamansavoji, Amir Rasouli, Tongtong Cao
Box6D : Zero-shot Category-level 6D Pose Estimation of Warehouse Boxes
https://arxiv.org/abs/2511.15884
November 21, 2025 at 9:58 AM
Meilong Xu, Di Fu, Jiaxing Zhang, Gong Yu, Jiayu Zheng, Xiaoling Hu, Dongdi Zhao, Feiyang Li, Chao Chen, Yong Cao
RB-FT: Rationale-Bootstrapped Fine-Tuning for Video Classification
https://arxiv.org/abs/2511.15923
November 21, 2025 at 9:57 AM
Zihan Li, Yiqing Wang, Sina Farsiu, Paul Kinahan
Boosting Medical Visual Understanding From Multi-Granular Language Learning
https://arxiv.org/abs/2511.15943
November 21, 2025 at 9:57 AM
Milos Vukadinovic, Hirotaka Ieki, Yuki Sahasi, David Ouyang, Bryan He
Automated Interpretable 2D Video Extraction from 3D Echocardiography
https://arxiv.org/abs/2511.15946
November 21, 2025 at 9:56 AM
Raphael Ruschel, Hardikkumar Prajapati, Awsafur Rahman, B. S. Manjunath
Click2Graph: Interactive Panoptic Video Scene Graphs from a Single Click
https://arxiv.org/abs/2511.15948
November 21, 2025 at 9:54 AM
Muyao Yuan, Yuanhong Zhang, Weizhan Zhang, Lan Ma, Yuan Gao, Jiangyong Ying, Yudeng Xin
InfoCLIP: Bridging Vision-Language Pretraining and Open-Vocabulary Semantic Segmentation via Information-Theoretic Alignment Transfer
https://arxiv.org/abs/2511.15967
November 21, 2025 at 9:53 AM
Jingru Zhang, Saed Moradi, Ashirbani Saha
Externally Validated Multi-Task Learning via Consistency Regularization Using Differentiable BI-RADS Features for Breast Ultrasound Tumor Segmentation
https://arxiv.org/abs/2511.15968
November 21, 2025 at 9:53 AM
Xinyu Nan, Lingtao Mao, Huangyu Dai, Zexin Zheng, Xinyu Sun, Zihan Liang, Ben Chen, Yuqing Ding, Chenyi Lei, Wenwu Ou, Han Li
UniDGF: A Unified Detection-to-Generation Framework for Hierarchical Object Visual Recognition
https://arxiv.org/abs/2511.15984
November 21, 2025 at 9:52 AM
Dawei Li, Zijian Gu, Peng Wang, Chuhan Song, Zhen Tan, Mohan Zhang, Tianlong Chen, Yu Tian, Song Wang
Fairness in Multi-modal Medical Diagnosis with Demonstration Selection
https://arxiv.org/abs/2511.15986
November 21, 2025 at 9:52 AM
Nimeshika Udayangani, Hadi M. Dolatabadi, Sarah Erfani, Christopher Leckie
Exploiting Inter-Sample Information for Long-tailed Out-of-Distribution Detection
https://arxiv.org/abs/2511.16015
November 21, 2025 at 9:51 AM
Dingkun Zhou, Patrick P. K. Chan, Hengxu Wu, Shikang Zheng, Ruiqi Huang, Yuanjie Zhao
Physically Realistic Sequence-Level Adversarial Clothing for Robust Human-Detection Evasion
https://arxiv.org/abs/2511.16020
November 21, 2025 at 9:49 AM
Xiao He, Zhijun Tu, Kun Cheng, Mingrui Zhu, Jie Hu, Nannan Wang, Xinbo Gao
Mixture of Ranks with Degradation-Aware Routing for One-Step Real-World Image Super-Resolution
https://arxiv.org/abs/2511.16024
November 21, 2025 at 9:49 AM
Mohamed Abdallah Salem, Hamdy Ahmed Ashur, Ahmed Elshinnawy
Towards a Safer and Sustainable Manufacturing Process: Material classification in Laser Cutting Using Deep Learning
https://arxiv.org/abs/2511.16026
November 21, 2025 at 9:48 AM
Zijian Wu, Mingfeng Jiang, Zidian Lin, Ying Song, Hanjie Ma, Qun Wu, Dongping Zhang, Guiyang Pu
CuriGS: Curriculum-Guided Gaussian Splatting for Sparse View Synthesis
https://arxiv.org/abs/2511.16030
November 21, 2025 at 9:48 AM
Timilehin T. Ayanlade, Anirudha Powadi, Talukder Z. Jubery, Baskar Ganapathysubramanian, Soumik Sarkar
Crossmodal learning for Crop Canopy Trait Estimation
https://arxiv.org/abs/2511.16031
November 21, 2025 at 9:47 AM
Qing Wang, Chong-Wah Ngo, Ee-Peng Lim, Qianru Sun
LLMs-based Augmentation for Domain Adaptation in Long-tailed Food Datasets
https://arxiv.org/abs/2511.16037
November 21, 2025 at 9:47 AM
Boxun Xu, Yu Wang, Zihu Wang, Peng Li
AMS-KV: Adaptive KV Caching in Multi-Scale Visual Autoregressive Transformers
https://arxiv.org/abs/2511.16047
November 21, 2025 at 9:37 AM
Pei Liu, Songtao Wang, Lang Zhang, Xingyue Peng, Yuandong Lyu, Jiaxin Deng, Songxin Lu, Weiliang Ma, Xueyang Zhang, Yifei Zhan, XianPeng Lang, Jun Ma
LiSTAR: Ray-Centric World Models for 4D LiDAR Sequences in Autonomous Driving
https://arxiv.org/abs/2511.16049
November 21, 2025 at 9:36 AM
Zishan Xu, Yifu Guo, Yuquan Lu, Fengyu Yang, Junxin Li
VideoSeg-R1:Reasoning Video Object Segmentation via Reinforcement Learning
https://arxiv.org/abs/2511.16077
November 21, 2025 at 9:36 AM
Meihua Zhou, Liping Yu, Jiawei Cai, Wai Kin Fung, Ruiguo Hu, Jiarui Zhao, Wenzhuo Liu, Nan Wan
SpectralTrain: A Universal Framework for Hyperspectral Image Classification
https://arxiv.org/abs/2511.16084
November 21, 2025 at 9:35 AM
Renxiang Xiao, Wei Liu, Yuanfan Zhang, Yushuai Chen, Jinming Chen, Zilu Wang, Liang Hu
Rad-GS: Radar-Vision Integration for 3D Gaussian Splatting SLAM in Outdoor Environments
https://arxiv.org/abs/2511.16091
November 21, 2025 at 9:35 AM