arXiv cs.CV Computer Vision and Pattern Recognition
cscv-bot.bsky.social
arXiv cs.CV Computer Vision and Pattern Recognition
@cscv-bot.bsky.social
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
Lee, Jung, Chun, Lee, Cai, Huang, Talreja, Dao, Liang, Huang, Huang: TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos https://arxiv.org/abs/2511.21690 https://arxiv.org/pdf/2511.21690 https://arxiv.org/html/2511.21690
November 27, 2025 at 6:35 AM
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
Lorenzo Shaikewitz, Charis Georgiou, Luca Carlone: Uncertainty Quantification for Visual Object Pose Estimation https://arxiv.org/abs/2511.21666 https://arxiv.org/pdf/2511.21666 https://arxiv.org/html/2511.21666
November 27, 2025 at 6:35 AM
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
Anantha Padmanaban Krishna Kumar (Boston University): Mechanisms of Non-Monotonic Scaling in Vision Transformers https://arxiv.org/abs/2511.21635 https://arxiv.org/pdf/2511.21635 https://arxiv.org/html/2511.21635
November 27, 2025 at 6:34 AM
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
Husne Ara Rubaiyeat, Hasan Mahmud, Md Kamrul Hasan: Bangla Sign Language Translation: Dataset Creation Challenges, Benchmarking and Prospects https://arxiv.org/abs/2511.21533 https://arxiv.org/pdf/2511.21533 https://arxiv.org/html/2511.21533
November 27, 2025 at 6:30 AM
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
Islam, Hossen, Arif, Al Noman, Rahman: BanglaMM-Disaster: A Multimodal Transformer-Based Deep Learning Framework for Multiclass Disaster Classification in Bangla https://arxiv.org/abs/2511.21364 https://arxiv.org/pdf/2511.21364 https://arxiv.org/html/2511.21364
November 27, 2025 at 6:34 AM
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
Yicheng Zhong, Peiji Yang, Zhisheng Wang: Multi-Reward GRPO for Stable and Prosodic Single-Codebook TTS LLMs at Scale https://arxiv.org/abs/2511.21270 https://arxiv.org/pdf/2511.21270 https://arxiv.org/html/2511.21270
November 27, 2025 at 6:35 AM
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
Xinyue Guo, Xiaoran Yang, Lipan Zhang, Jianxuan Yang, Zhao Wang, Jian Luan: AV-Edit: Multimodal Generative Sound Effect Editing via Audio-Visual Semantic Joint Control https://arxiv.org/abs/2511.21146 https://arxiv.org/pdf/2511.21146 https://arxiv.org/html/2511.21146
November 27, 2025 at 6:33 AM
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
Taejun Kim, Amy Karlson, Aakar Gupta, Tovi Grossman, Jason Wu, Parastoo Abtahi, Christopher Collins, Michael Glueck, Hemant Bhaskar Surale: STAR: Smartphone-analogous Typing in Augmented Reality https://arxiv.org/abs/2511.21143 https://arxiv.org/pdf/2511.21143 https://arxiv.org/html/2511.21143
November 27, 2025 at 6:32 AM
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
Chen, Guo, Chu, Luo, Shen, Sun, Hu, Xie, Yang, Shi, Gu, Liu, Han, Wu, Xu, Zhang: SocialNav: Training Human-Inspired Foundation Model for Socially-Aware Embodied Navigation https://arxiv.org/abs/2511.21135 https://arxiv.org/pdf/2511.21135 https://arxiv.org/html/2511.21135
November 27, 2025 at 6:34 AM
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
Chujie Wang, Jianyu Lu, Zhiyuan Luo, Xi Chen, Chu He: OVOD-Agent: A Markov-Bandit Framework for Proactive Visual Reasoning and Self-Evolving Detection https://arxiv.org/abs/2511.21064 https://arxiv.org/pdf/2511.21064 https://arxiv.org/html/2511.21064
November 27, 2025 at 6:29 AM
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
Chen, Liang, Guan, Sun, Zhao, Jiang, Huang, Ding, Han: AerialMind: Towards Referring Multi-Object Tracking in UAV Scenarios https://arxiv.org/abs/2511.21053 https://arxiv.org/pdf/2511.21053 https://arxiv.org/html/2511.21053
November 27, 2025 at 6:34 AM
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
Dinanath Padhya, Krishna Acharya, Bipul Kumar Dahal, Dinesh Baniya Kshatri: CNN-LSTM Hybrid Architecture for Over-the-Air Automatic Modulation Classification Using SDR https://arxiv.org/abs/2511.21040 https://arxiv.org/pdf/2511.21040 https://arxiv.org/html/2511.21040
November 27, 2025 at 6:33 AM
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
Chicago Y. Park, Michael T. McCann, Cristina Garcia-Cardona, Brendt Wohlberg, Ulugbek S. Kamilov: Deep Parameter Interpolation for Scalar Conditioning https://arxiv.org/abs/2511.21028 https://arxiv.org/pdf/2511.21028 https://arxiv.org/html/2511.21028
November 27, 2025 at 6:36 AM
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
Taehoon Kang, Taeyong Kim: Probabilistic Wildfire Spread Prediction Using an Autoregressive Conditional Generative Adversarial Network https://arxiv.org/abs/2511.21019 https://arxiv.org/pdf/2511.21019 https://arxiv.org/html/2511.21019
November 27, 2025 at 6:33 AM
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
Wang, Huang, Zhou, Yin, Bao, Lyu, Liu, Zhang, Wu, Fei-Fei, Li: ENACT: Evaluating Embodied Cognition with World Modeling of Egocentric Interaction https://arxiv.org/abs/2511.20937 https://arxiv.org/pdf/2511.20937 https://arxiv.org/html/2511.20937
November 27, 2025 at 6:29 AM
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
Biagio La Rosa, Leilani H. Gilpin: Guaranteed Optimal Compositional Explanations for Neurons https://arxiv.org/abs/2511.20934 https://arxiv.org/pdf/2511.20934 https://arxiv.org/html/2511.20934
November 27, 2025 at 6:29 AM
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
Xiaojiao Xiao, Qinmin Vivian Hu, Tae Hyun Kim, Guanghui Wang: Adversarial Multi-Task Learning for Liver Tumor Segmentation, Dynamic Enhancement Regression, and Classification https://arxiv.org/abs/2511.20793 https://arxiv.org/pdf/2511.20793 https://arxiv.org/html/2511.20793
November 27, 2025 at 6:36 AM
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
Thomas Norrenbrock, Timo Kaiser, Sovan Biswas, Neslihan Kose, Ramesh Manuvinakurike, Bodo Rosenhahn: CHiQPM: Calibrated Hierarchical Interpretable Image Classification https://arxiv.org/abs/2511.20779 https://arxiv.org/pdf/2511.20779 https://arxiv.org/html/2511.20779
November 27, 2025 at 6:33 AM
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
Megahed, Abou-Alwan, Fuller, Demellawy, Hawken, Chan: Automated Histopathologic Assessment of Hirschsprung Disease Using a Multi-Stage Vision Transformer Framework https://arxiv.org/abs/2511.20734 https://arxiv.org/pdf/2511.20734 https://arxiv.org/html/2511.20734
November 27, 2025 at 6:51 AM
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
Ziyuan Gao, Philippe Morel: Prompt-Aware Adaptive Elastic Weight Consolidation for Continual Learning in Medical Vision-Language Models https://arxiv.org/abs/2511.20732 https://arxiv.org/pdf/2511.20732 https://arxiv.org/html/2511.20732
November 27, 2025 at 6:33 AM
Reposted by arXiv cs.CV Computer Vision and Pattern Recognition
Nelson H. T. Lemes, Jos\'e Claudinei Ferreira, Higor V. M. Ferreira: A Fractional Variational Approach to Spectral Filtering Using the Fourier Transform https://arxiv.org/abs/2511.20675 https://arxiv.org/pdf/2511.20675 https://arxiv.org/html/2511.20675
November 27, 2025 at 6:36 AM
Dalva, Qian, Goldenberg, Chen, Aberman, Tulyakov, Yanardag, Wang: Canvas-to-Image: Compositional Image Generation with Multimodal Controls https://arxiv.org/abs/2511.21691 https://arxiv.org/pdf/2511.21691 https://arxiv.org/html/2511.21691
November 27, 2025 at 6:32 AM
Hu, Lin, Long, Ran, Jiang, Wang, Zhu, Xu, Wang, Pang: G$^2$VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning https://arxiv.org/abs/2511.21688 https://arxiv.org/pdf/2511.21688 https://arxiv.org/html/2511.21688
November 27, 2025 at 6:32 AM
Zihui Xue, Kristen Grauman, Dima Damen, Andrew Zisserman, Tengda Han: Seeing without Pixels: Perception from Camera Trajectories https://arxiv.org/abs/2511.21681 https://arxiv.org/pdf/2511.21681 https://arxiv.org/html/2511.21681
November 27, 2025 at 6:32 AM
Pandiyaraju V, Sreya Mynampati, Abishek Karthik, Poovarasan L, D. Saraswathi: Revolutionizing Glioma Segmentation & Grading Using 3D MRI - Guided Hybrid Deep Learning Models https://arxiv.org/abs/2511.21673 https://arxiv.org/pdf/2511.21673 https://arxiv.org/html/2511.21673
November 27, 2025 at 6:32 AM