Zhenjun Zhao
ericzzj.bsky.social
Zhenjun Zhao
@ericzzj.bsky.social
ericzzj1989.github.io
PhD from CUHK. 3D vision, SLAM, SfM, Image Matching (https://github.com/ericzzj1989/Awesome-Image-Matching).
Pinned
🎉 Thrilled to share our CVPR 2025 Award Candidate & Oral paper:

🔹 GlobustVP
Convex Relaxation for Robust Vanishing Point Estimation in Manhattan World

🧱 Global optimality
💥 Tolerates up to 70% outliers
⚡ Fast runtime

📄 Paper: arxiv.org/abs/2505.04788

💻 Code: github.com/WU-CVGL/GlobustVP

1/
This work has been accepted to WACV 2026!
Surgical Gaussian Surfels: Highly Accurate Real-time Surgical Scene Rendering

@idris1.bsky.social, @ericzzj.bsky.social, Samuel Schmidgall, Yumeng Wang, Paul Maria Scheikl, Axel Krieger

tl;dr: Gaussian Surfels->dynamic surgical scenes

arxiv.org/abs/2503.04079
November 22, 2025 at 6:24 PM
CuriGS: Curriculum-Guided Gaussian Splatting for Sparse View Synthesis

Zijian Wu, Mingfeng Jiang, Zidian Lin, Ying Song, Hanjie Ma, Qun Wu, Dongping Zhang, Guiyang Pu

tl;dr: real views+multiple perturbation magnitudes->pseudo-views->optimization

arxiv.org/abs/2511.16030
November 22, 2025 at 6:14 PM
SAM 3D: 3Dfy Anything in Images

tl;dr: 3D version of SAM

arxiv.org/abs/2511.16144
November 22, 2025 at 6:14 PM
Reposted by Zhenjun Zhao
RoMa v2: Harder Better Faster Denser Feature Matching
@parskatt.bsky.social et 11 al.

tl;dr: in title.
Predict covariance per-pixel, more datasets, use DINOv3, adjust architecture.

arxiv.org/abs/2511.15706
November 20, 2025 at 9:08 AM
Reposted by Zhenjun Zhao
RoMa v2 is now out! (github.com/Parskatt/rom..., arxiv.org/abs/2511.15706)

Here are the main improvements we made since RoMa:
November 20, 2025 at 9:25 AM
Reposted by Zhenjun Zhao
We’re live! 🚀 Streaming: tinyurl.com/bdtk2nzs
The International Workshop on AI4Robotics by @naverlabseurope
2dys of Spatial AI, SLAM, robot learning, HRI, autonomy
This AM CET: @martinhumenberger.bsky.social @marcpollefeys.bsky.social Andrea Vedaldi Cordelia Schmid & @andrewdavidson.bsky.social ⬇️
November 20, 2025 at 8:40 AM
IBGS: Image-Based Gaussian Splatting

Hoang Chuong Nguyen, Wei Mao, Jose M. Alvarez, Miaomiao Liu

tl;dr: base color from 3DGS rendering and learned residual inferred from nearby training images->pixel color

arxiv.org/abs/2511.14357
November 19, 2025 at 7:53 PM
Co-Me: Confidence-Guided Token Merging for Visual Geometric Transformers

Yutian Chen, @yuhengqiu.bsky.social, Ruogu Li, Ali Agha, Shayegan Omidshafiei, Jay Patrikar, @smash0190.bsky.social

tl;dr: ViT->distillation->per-token confidence->rank tokens->selective merging

arxiv.org/abs/2511.14751
November 19, 2025 at 7:53 PM
Towards Rotation-only Imaging Geometry: Rotation Estimation

Xinrui Li, Qi Cai, Yuanxin Wu

tl;dr: pose-only->decouple translation from rotation->rotation-only; reprojection error on rotation manifold

arxiv.org/abs/2511.12415
November 18, 2025 at 1:30 PM
CloseUpShot: Close-up Novel View Synthesis from Sparse-views via Point-conditioned Diffusion Model

Yuqi Zhang, Guanying Chen, Jiaxing Chen, Chuanyu Fu, Chuan Huang, Shuguang Cui

tl;dr: enhance the quality of conditioning images

arxiv.org/abs/2511.13121
November 18, 2025 at 1:30 PM
OmniVGGT: Omni-Modality Driven Visual Geometry Grounded

Haosong Peng, Hao Li, Yalun Dai, @yushi-lan.bsky.social, Yihang Luo, Tianyu Qi, Zhengshen Zhang, Yufeng Zhan, Junfei Zhang, Wenchao Xu, Ziwei Liu

tl;dr: depth and camera intrinsics/extrinsics->VGGT

arxiv.org/abs/2511.10560
November 14, 2025 at 3:11 PM
SMF-VO: Direct Ego-Motion Estimation via Sparse Motion Fields

Sangheon Yang, Yeongin Yoon, Hong Mo Jung, Jongwoo Lim

tl;dr: sparse optical flow->linear and angular velocity; generalized 3D ray-based motion field->different camera models

arxiv.org/abs/2511.09072
November 13, 2025 at 11:30 AM
OUGS: Active View Selection via Object-aware Uncertainty Estimation in 3DGS

Haiyi Li, Qi Chen, Denis Kalkofen, Hsiang-Ting Chen

tl;dr: Gaussian parameters->covariance->diagonal Fisher Information Matrix->uncertainty

arxiv.org/abs/2511.09397
November 13, 2025 at 11:29 AM
ConeGS: Error-Guided Densification Using Pixel Cones for Improved Reconstruction with Fewer Primitives

Bartłomiej Baranowski, @s-esposito.bsky.social, @pgschossmann.bsky.social, @apchen.bsky.social, @andreasgeiger.bsky.social

arxiv.org/abs/2511.06810
November 11, 2025 at 3:42 PM
YoNoSplat: You Only Need One Model for Feedforward 3D Gaussian Splatting

Botao Ye, Boqi Chen, @haofeixu.bsky.social, @danielbarath.bsky.social, @marcpollefeys.bsky.social

arxiv.org/abs/2511.07321
November 11, 2025 at 3:39 PM
4D3R: Motion-Aware Neural Reconstruction and Rendering of Dynamic Scenes from Monocular Videos

Mengqi Guo, Bo Xu, Yanyan Li, Gim Hee Lee

tl;dr: joint optimization of motion mask and scene reconstruction

arxiv.org/abs/2511.05229
November 10, 2025 at 2:39 PM
FastGS: Training 3D Gaussian Splatting in 100 Seconds

Shiwei Ren, Tianci Wen, Yongchun Fang, Biao Lu

tl;dr: multi-view consistency->densification and pruning

arxiv.org/abs/2511.04283
November 7, 2025 at 1:51 PM
Cycle-Sync: Robust Global Camera Pose Estimation through Enhanced Cycle-Consistent Synchronization

Shaohan Li, Yunpeng Shi, Gilad Lerman

tl;dr: translation averaging->Welsch loss->Message-Passing Least Squares with distance-based cycle-consistency

arxiv.org/abs/2511.02329
November 5, 2025 at 1:00 PM
TurboMap: GPU-Accelerated Local Mapping for Visual SLAM

Parsa Hosseininejad, Kimia Khabiri, Shishir Gopinath, Soudabeh Mohammadhashemi, Karthik Dantu, Steven Y. Ko

tl;dr: GPU->triangulation & map point fusion & local BA; CPU->keyframe culling

arxiv.org/abs/2511.02036
November 5, 2025 at 12:59 PM
WildfireX-SLAM: A Large-scale Low-altitude RGB-D Dataset for Wildfire SLAM and Beyond

Zhicong Sun, Jacqueline Lo, Jinxing Hu

tl;dr: synthetic dataset

arxiv.org/abs/2510.27133
November 4, 2025 at 4:33 PM
Unified Diffusion VLA: Vision-Language-Action Model via Joint Discrete Denoising Diffusion Process

Jiayi Chen, Wenxuan Song, Pengxiang Ding, Ziyang Zhou, Han Zhao, Feilong Tang, Donglin Wang, Haoang Li

tl;dr: multiple modalities->single synchronous denoising trajectory

arxiv.org/abs/2511.01718
November 4, 2025 at 4:32 PM
JOGS: Joint Optimization of Pose Estimation and 3D Gaussian Splatting

Yuxuan Li, Tao Wang, Xianben Yang

tl;dr: Lucas-Kanade 3D optical flow+reprojection errors->poses; standard differentiable rendering3DGS parameters

arxiv.org/abs/2510.26117
October 31, 2025 at 8:39 AM
PointSt3R: Point Tracking through 3D Grounded Correspondence

Rhodri Guerrier, Adam W. Harley, @dimadamen.bsky.social

tl;dr: fine-tune MASt3R with point matching loss and visibility head to handle dynamic scenes

arxiv.org/abs/2510.26443
October 31, 2025 at 8:39 AM
The Impact and Outlook of 3D Gaussian Splatting

Bernhard Kerbl

tl;dr: in title

arxiv.org/abs/2510.26694
October 31, 2025 at 8:36 AM
STG-Avatar: Animatable Human Avatars via Spacetime Gaussian

Guangan Jiang, Tianzi Zhang, Dong Li, @ericzzj.bsky.social, Haoang Li, Mingrui Li, Hongyu Wang

tl;dr: 3DGS-based framework for high-fidelity animatable human avatar reconstruction

arxiv.org/abs/2510.22140
October 30, 2025 at 8:34 AM