PhD from CUHK. 3D vision, SLAM, SfM, Image Matching (https://github.com/ericzzj1989/Awesome-Image-Matching).
🔹 GlobustVP
Convex Relaxation for Robust Vanishing Point Estimation in Manhattan World
🧱 Global optimality
💥 Tolerates up to 70% outliers
⚡ Fast runtime
📄 Paper: arxiv.org/abs/2505.04788
💻 Code: github.com/WU-CVGL/GlobustVP
1/
@idris1.bsky.social, @ericzzj.bsky.social, Samuel Schmidgall, Yumeng Wang, Paul Maria Scheikl, Axel Krieger
tl;dr: Gaussian Surfels->dynamic surgical scenes
arxiv.org/abs/2503.04079
Zijian Wu, Mingfeng Jiang, Zidian Lin, Ying Song, Hanjie Ma, Qun Wu, Dongping Zhang, Guiyang Pu
tl;dr: real views+multiple perturbation magnitudes->pseudo-views->optimization
arxiv.org/abs/2511.16030
Zijian Wu, Mingfeng Jiang, Zidian Lin, Ying Song, Hanjie Ma, Qun Wu, Dongping Zhang, Guiyang Pu
tl;dr: real views+multiple perturbation magnitudes->pseudo-views->optimization
arxiv.org/abs/2511.16030
@parskatt.bsky.social et 11 al.
tl;dr: in title.
Predict covariance per-pixel, more datasets, use DINOv3, adjust architecture.
arxiv.org/abs/2511.15706
@parskatt.bsky.social et 11 al.
tl;dr: in title.
Predict covariance per-pixel, more datasets, use DINOv3, adjust architecture.
arxiv.org/abs/2511.15706
Here are the main improvements we made since RoMa:
Here are the main improvements we made since RoMa:
The International Workshop on AI4Robotics by @naverlabseurope
2dys of Spatial AI, SLAM, robot learning, HRI, autonomy
This AM CET: @martinhumenberger.bsky.social @marcpollefeys.bsky.social Andrea Vedaldi Cordelia Schmid & @andrewdavidson.bsky.social ⬇️
The International Workshop on AI4Robotics by @naverlabseurope
2dys of Spatial AI, SLAM, robot learning, HRI, autonomy
This AM CET: @martinhumenberger.bsky.social @marcpollefeys.bsky.social Andrea Vedaldi Cordelia Schmid & @andrewdavidson.bsky.social ⬇️
Hoang Chuong Nguyen, Wei Mao, Jose M. Alvarez, Miaomiao Liu
tl;dr: base color from 3DGS rendering and learned residual inferred from nearby training images->pixel color
arxiv.org/abs/2511.14357
Hoang Chuong Nguyen, Wei Mao, Jose M. Alvarez, Miaomiao Liu
tl;dr: base color from 3DGS rendering and learned residual inferred from nearby training images->pixel color
arxiv.org/abs/2511.14357
Yutian Chen, @yuhengqiu.bsky.social, Ruogu Li, Ali Agha, Shayegan Omidshafiei, Jay Patrikar, @smash0190.bsky.social
tl;dr: ViT->distillation->per-token confidence->rank tokens->selective merging
arxiv.org/abs/2511.14751
Yutian Chen, @yuhengqiu.bsky.social, Ruogu Li, Ali Agha, Shayegan Omidshafiei, Jay Patrikar, @smash0190.bsky.social
tl;dr: ViT->distillation->per-token confidence->rank tokens->selective merging
arxiv.org/abs/2511.14751
Xinrui Li, Qi Cai, Yuanxin Wu
tl;dr: pose-only->decouple translation from rotation->rotation-only; reprojection error on rotation manifold
arxiv.org/abs/2511.12415
Xinrui Li, Qi Cai, Yuanxin Wu
tl;dr: pose-only->decouple translation from rotation->rotation-only; reprojection error on rotation manifold
arxiv.org/abs/2511.12415
Yuqi Zhang, Guanying Chen, Jiaxing Chen, Chuanyu Fu, Chuan Huang, Shuguang Cui
tl;dr: enhance the quality of conditioning images
arxiv.org/abs/2511.13121
Yuqi Zhang, Guanying Chen, Jiaxing Chen, Chuanyu Fu, Chuan Huang, Shuguang Cui
tl;dr: enhance the quality of conditioning images
arxiv.org/abs/2511.13121
Haosong Peng, Hao Li, Yalun Dai, @yushi-lan.bsky.social, Yihang Luo, Tianyu Qi, Zhengshen Zhang, Yufeng Zhan, Junfei Zhang, Wenchao Xu, Ziwei Liu
tl;dr: depth and camera intrinsics/extrinsics->VGGT
arxiv.org/abs/2511.10560
Haosong Peng, Hao Li, Yalun Dai, @yushi-lan.bsky.social, Yihang Luo, Tianyu Qi, Zhengshen Zhang, Yufeng Zhan, Junfei Zhang, Wenchao Xu, Ziwei Liu
tl;dr: depth and camera intrinsics/extrinsics->VGGT
arxiv.org/abs/2511.10560
Sangheon Yang, Yeongin Yoon, Hong Mo Jung, Jongwoo Lim
tl;dr: sparse optical flow->linear and angular velocity; generalized 3D ray-based motion field->different camera models
arxiv.org/abs/2511.09072
Sangheon Yang, Yeongin Yoon, Hong Mo Jung, Jongwoo Lim
tl;dr: sparse optical flow->linear and angular velocity; generalized 3D ray-based motion field->different camera models
arxiv.org/abs/2511.09072
Haiyi Li, Qi Chen, Denis Kalkofen, Hsiang-Ting Chen
tl;dr: Gaussian parameters->covariance->diagonal Fisher Information Matrix->uncertainty
arxiv.org/abs/2511.09397
Haiyi Li, Qi Chen, Denis Kalkofen, Hsiang-Ting Chen
tl;dr: Gaussian parameters->covariance->diagonal Fisher Information Matrix->uncertainty
arxiv.org/abs/2511.09397
Bartłomiej Baranowski, @s-esposito.bsky.social, @pgschossmann.bsky.social, @apchen.bsky.social, @andreasgeiger.bsky.social
arxiv.org/abs/2511.06810
Bartłomiej Baranowski, @s-esposito.bsky.social, @pgschossmann.bsky.social, @apchen.bsky.social, @andreasgeiger.bsky.social
arxiv.org/abs/2511.06810
Botao Ye, Boqi Chen, @haofeixu.bsky.social, @danielbarath.bsky.social, @marcpollefeys.bsky.social
arxiv.org/abs/2511.07321
Botao Ye, Boqi Chen, @haofeixu.bsky.social, @danielbarath.bsky.social, @marcpollefeys.bsky.social
arxiv.org/abs/2511.07321
Mengqi Guo, Bo Xu, Yanyan Li, Gim Hee Lee
tl;dr: joint optimization of motion mask and scene reconstruction
arxiv.org/abs/2511.05229
Mengqi Guo, Bo Xu, Yanyan Li, Gim Hee Lee
tl;dr: joint optimization of motion mask and scene reconstruction
arxiv.org/abs/2511.05229
Shiwei Ren, Tianci Wen, Yongchun Fang, Biao Lu
tl;dr: multi-view consistency->densification and pruning
arxiv.org/abs/2511.04283
Shiwei Ren, Tianci Wen, Yongchun Fang, Biao Lu
tl;dr: multi-view consistency->densification and pruning
arxiv.org/abs/2511.04283
Shaohan Li, Yunpeng Shi, Gilad Lerman
tl;dr: translation averaging->Welsch loss->Message-Passing Least Squares with distance-based cycle-consistency
arxiv.org/abs/2511.02329
Shaohan Li, Yunpeng Shi, Gilad Lerman
tl;dr: translation averaging->Welsch loss->Message-Passing Least Squares with distance-based cycle-consistency
arxiv.org/abs/2511.02329
Parsa Hosseininejad, Kimia Khabiri, Shishir Gopinath, Soudabeh Mohammadhashemi, Karthik Dantu, Steven Y. Ko
tl;dr: GPU->triangulation & map point fusion & local BA; CPU->keyframe culling
arxiv.org/abs/2511.02036
Parsa Hosseininejad, Kimia Khabiri, Shishir Gopinath, Soudabeh Mohammadhashemi, Karthik Dantu, Steven Y. Ko
tl;dr: GPU->triangulation & map point fusion & local BA; CPU->keyframe culling
arxiv.org/abs/2511.02036
Zhicong Sun, Jacqueline Lo, Jinxing Hu
tl;dr: synthetic dataset
arxiv.org/abs/2510.27133
Zhicong Sun, Jacqueline Lo, Jinxing Hu
tl;dr: synthetic dataset
arxiv.org/abs/2510.27133
Jiayi Chen, Wenxuan Song, Pengxiang Ding, Ziyang Zhou, Han Zhao, Feilong Tang, Donglin Wang, Haoang Li
tl;dr: multiple modalities->single synchronous denoising trajectory
arxiv.org/abs/2511.01718
Jiayi Chen, Wenxuan Song, Pengxiang Ding, Ziyang Zhou, Han Zhao, Feilong Tang, Donglin Wang, Haoang Li
tl;dr: multiple modalities->single synchronous denoising trajectory
arxiv.org/abs/2511.01718
Yuxuan Li, Tao Wang, Xianben Yang
tl;dr: Lucas-Kanade 3D optical flow+reprojection errors->poses; standard differentiable rendering3DGS parameters
arxiv.org/abs/2510.26117
Yuxuan Li, Tao Wang, Xianben Yang
tl;dr: Lucas-Kanade 3D optical flow+reprojection errors->poses; standard differentiable rendering3DGS parameters
arxiv.org/abs/2510.26117
Rhodri Guerrier, Adam W. Harley, @dimadamen.bsky.social
tl;dr: fine-tune MASt3R with point matching loss and visibility head to handle dynamic scenes
arxiv.org/abs/2510.26443
Rhodri Guerrier, Adam W. Harley, @dimadamen.bsky.social
tl;dr: fine-tune MASt3R with point matching loss and visibility head to handle dynamic scenes
arxiv.org/abs/2510.26443
Bernhard Kerbl
tl;dr: in title
arxiv.org/abs/2510.26694
Bernhard Kerbl
tl;dr: in title
arxiv.org/abs/2510.26694
Guangan Jiang, Tianzi Zhang, Dong Li, @ericzzj.bsky.social, Haoang Li, Mingrui Li, Hongyu Wang
tl;dr: 3DGS-based framework for high-fidelity animatable human avatar reconstruction
arxiv.org/abs/2510.22140
Guangan Jiang, Tianzi Zhang, Dong Li, @ericzzj.bsky.social, Haoang Li, Mingrui Li, Hongyu Wang
tl;dr: 3DGS-based framework for high-fidelity animatable human avatar reconstruction
arxiv.org/abs/2510.22140