Zhenjun Zhao
ericzzj.bsky.social
Zhenjun Zhao
@ericzzj.bsky.social
ericzzj1989.github.io
PhD from CUHK. 3D vision, SLAM, SfM, Image Matching (https://github.com/ericzzj1989/Awesome-Image-Matching).
CuriGS: Curriculum-Guided Gaussian Splatting for Sparse View Synthesis

Zijian Wu, Mingfeng Jiang, Zidian Lin, Ying Song, Hanjie Ma, Qun Wu, Dongping Zhang, Guiyang Pu

tl;dr: real views+multiple perturbation magnitudes->pseudo-views->optimization

arxiv.org/abs/2511.16030
November 22, 2025 at 6:14 PM
Xingyu Chen, Fu-Jen Chu, Pierre Gleize, Kevin J Liang, Alexander Sax, Hao Tang, Weiyao Wang, Michelle Guo, Thibaut Hardin, Xiang Li, Aohan Lin, Jiawei Liu, Ziqi Ma, Anushka Sagar, Bowen Song, Xiaodong Wang, Jianing Yang, Bowen Zhang, Piotr Dollár, Georgia Gkioxari, Matt Feiszli, Jitendra Malik
November 22, 2025 at 6:14 PM
SAM 3D: 3Dfy Anything in Images

tl;dr: 3D version of SAM

arxiv.org/abs/2511.16144
November 22, 2025 at 6:14 PM
IBGS: Image-Based Gaussian Splatting

Hoang Chuong Nguyen, Wei Mao, Jose M. Alvarez, Miaomiao Liu

tl;dr: base color from 3DGS rendering and learned residual inferred from nearby training images->pixel color

arxiv.org/abs/2511.14357
November 19, 2025 at 7:53 PM
Co-Me: Confidence-Guided Token Merging for Visual Geometric Transformers

Yutian Chen, @yuhengqiu.bsky.social, Ruogu Li, Ali Agha, Shayegan Omidshafiei, Jay Patrikar, @smash0190.bsky.social

tl;dr: ViT->distillation->per-token confidence->rank tokens->selective merging

arxiv.org/abs/2511.14751
November 19, 2025 at 7:53 PM
Towards Rotation-only Imaging Geometry: Rotation Estimation

Xinrui Li, Qi Cai, Yuanxin Wu

tl;dr: pose-only->decouple translation from rotation->rotation-only; reprojection error on rotation manifold

arxiv.org/abs/2511.12415
November 18, 2025 at 1:30 PM
CloseUpShot: Close-up Novel View Synthesis from Sparse-views via Point-conditioned Diffusion Model

Yuqi Zhang, Guanying Chen, Jiaxing Chen, Chuanyu Fu, Chuan Huang, Shuguang Cui

tl;dr: enhance the quality of conditioning images

arxiv.org/abs/2511.13121
November 18, 2025 at 1:30 PM
OmniVGGT: Omni-Modality Driven Visual Geometry Grounded

Haosong Peng, Hao Li, Yalun Dai, @yushi-lan.bsky.social, Yihang Luo, Tianyu Qi, Zhengshen Zhang, Yufeng Zhan, Junfei Zhang, Wenchao Xu, Ziwei Liu

tl;dr: depth and camera intrinsics/extrinsics->VGGT

arxiv.org/abs/2511.10560
November 14, 2025 at 3:11 PM
SMF-VO: Direct Ego-Motion Estimation via Sparse Motion Fields

Sangheon Yang, Yeongin Yoon, Hong Mo Jung, Jongwoo Lim

tl;dr: sparse optical flow->linear and angular velocity; generalized 3D ray-based motion field->different camera models

arxiv.org/abs/2511.09072
November 13, 2025 at 11:30 AM
OUGS: Active View Selection via Object-aware Uncertainty Estimation in 3DGS

Haiyi Li, Qi Chen, Denis Kalkofen, Hsiang-Ting Chen

tl;dr: Gaussian parameters->covariance->diagonal Fisher Information Matrix->uncertainty

arxiv.org/abs/2511.09397
November 13, 2025 at 11:29 AM
ConeGS: Error-Guided Densification Using Pixel Cones for Improved Reconstruction with Fewer Primitives

Bartłomiej Baranowski, @s-esposito.bsky.social, @pgschossmann.bsky.social, @apchen.bsky.social, @andreasgeiger.bsky.social

arxiv.org/abs/2511.06810
November 11, 2025 at 3:42 PM
YoNoSplat: You Only Need One Model for Feedforward 3D Gaussian Splatting

Botao Ye, Boqi Chen, @haofeixu.bsky.social, @danielbarath.bsky.social, @marcpollefeys.bsky.social

arxiv.org/abs/2511.07321
November 11, 2025 at 3:39 PM
4D3R: Motion-Aware Neural Reconstruction and Rendering of Dynamic Scenes from Monocular Videos

Mengqi Guo, Bo Xu, Yanyan Li, Gim Hee Lee

tl;dr: joint optimization of motion mask and scene reconstruction

arxiv.org/abs/2511.05229
November 10, 2025 at 2:39 PM
FastGS: Training 3D Gaussian Splatting in 100 Seconds

Shiwei Ren, Tianci Wen, Yongchun Fang, Biao Lu

tl;dr: multi-view consistency->densification and pruning

arxiv.org/abs/2511.04283
November 7, 2025 at 1:51 PM
Cycle-Sync: Robust Global Camera Pose Estimation through Enhanced Cycle-Consistent Synchronization

Shaohan Li, Yunpeng Shi, Gilad Lerman

tl;dr: translation averaging->Welsch loss->Message-Passing Least Squares with distance-based cycle-consistency

arxiv.org/abs/2511.02329
November 5, 2025 at 1:00 PM
TurboMap: GPU-Accelerated Local Mapping for Visual SLAM

Parsa Hosseininejad, Kimia Khabiri, Shishir Gopinath, Soudabeh Mohammadhashemi, Karthik Dantu, Steven Y. Ko

tl;dr: GPU->triangulation & map point fusion & local BA; CPU->keyframe culling

arxiv.org/abs/2511.02036
November 5, 2025 at 12:59 PM
WildfireX-SLAM: A Large-scale Low-altitude RGB-D Dataset for Wildfire SLAM and Beyond

Zhicong Sun, Jacqueline Lo, Jinxing Hu

tl;dr: synthetic dataset

arxiv.org/abs/2510.27133
November 4, 2025 at 4:33 PM
Unified Diffusion VLA: Vision-Language-Action Model via Joint Discrete Denoising Diffusion Process

Jiayi Chen, Wenxuan Song, Pengxiang Ding, Ziyang Zhou, Han Zhao, Feilong Tang, Donglin Wang, Haoang Li

tl;dr: multiple modalities->single synchronous denoising trajectory

arxiv.org/abs/2511.01718
November 4, 2025 at 4:32 PM
JOGS: Joint Optimization of Pose Estimation and 3D Gaussian Splatting

Yuxuan Li, Tao Wang, Xianben Yang

tl;dr: Lucas-Kanade 3D optical flow+reprojection errors->poses; standard differentiable rendering3DGS parameters

arxiv.org/abs/2510.26117
October 31, 2025 at 8:39 AM
PointSt3R: Point Tracking through 3D Grounded Correspondence

Rhodri Guerrier, Adam W. Harley, @dimadamen.bsky.social

tl;dr: fine-tune MASt3R with point matching loss and visibility head to handle dynamic scenes

arxiv.org/abs/2510.26443
October 31, 2025 at 8:39 AM
The Impact and Outlook of 3D Gaussian Splatting

Bernhard Kerbl

tl;dr: in title

arxiv.org/abs/2510.26694
October 31, 2025 at 8:36 AM
STG-Avatar: Animatable Human Avatars via Spacetime Gaussian

Guangan Jiang, Tianzi Zhang, Dong Li, @ericzzj.bsky.social, Haoang Li, Mingrui Li, Hongyu Wang

tl;dr: 3DGS-based framework for high-fidelity animatable human avatar reconstruction

arxiv.org/abs/2510.22140
October 30, 2025 at 8:34 AM
Epipolar Geometry Improves Video Generation Models

Orest Kupyn, Fabian Manhardt, Federico Tombari, Christian Rupprecht

tl;dr: Wan->diverse videos->Sampson epipolar error->relative reward signals->Flow-DPO->video generation rankings->3D-consistent videos

arxiv.org/abs/2510.21615
October 30, 2025 at 8:31 AM
PlanarGS: High-Fidelity Indoor 3D Gaussian Splatting Guided by Vision-Language Planar Priors

Xirui Jin, Renbiao Jin, Boying Li, Danping Zou, Wenxian Yu

tl;dr: depth & normal priors from DUSt3R+semantic planar priors from Grounded SAM & geometric cues & cross-view fusion

arxiv.org/abs/2510.23930
October 30, 2025 at 8:30 AM
Kineo: Calibration-Free Metric Motion Capture From Sparse RGB Cameras

Charles Javerliat, Pierre Raimbaud, Guillaume Lavoué

tl;dr: confidence-driven reliable correspondences+graph-based global optimization

arxiv.org/abs/2510.24464
October 30, 2025 at 8:29 AM