Publications

(* equal contribution)

[1] VertexRegen: Mesh Generation with Continuous Level of Detail [pdf] [project page]
Xiang Zhang, Yawar Siddiqui, Armen Avetisyan, Chris Xie, Jakob Engel, Henry Howard-Jenkins

ICCV 2025

[2] DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion [pdf] [project page]
Qingcheng Zhao*, Xiang Zhang*, Haiyang Xu, Zeyuan Chen, Jianwen Xie, Yuan Gao, Zhuowen Tu

ICCV 2025

[3] Lay-Your-Scene: Natural Scene Layout Generation with Diffusion Transformers [pdf]
Divyansh Srivastava, Xiang Zhang, He Wen, Chenru Wen, Zhuowen Tu

ICCV 2025

[4] YOLO-Count: Differentiable Object Counting for Text-to-Image Generation [pdf]
Guanning Zeng, Xiang Zhang, Zirui Wang, Haiyang Xu, Zeyuan Chen, Bingnan Li, Zhuowen Tu

ICCV 2025

[5] MONSTERMASH: Multidirectional, Overlapping, Nested, Spiral Text Extraction for Recognition Models of Arabic-Script Handwriting [pdf]
Danlu Chen, Jacob Murel, Taimoor Shahid, Xiang Zhang, Jonathan Parkes Allen, Taylor Berg-Kirkpatrick, David A Smith

ICDAR 2024 Workshops

[6] Bayesian Diffusion Models for 3D Shape Reconstruction [pdf] [project page]
Haiyang Xu*, Yu Lei*, Zeyuan Chen, Xiang Zhang, Yue Zhao, Yilin Wang, Zhuowen Tu

CVPR 2024

[7] OmniControlNet: Dual-stage Integration for Conditional Image Generation [pdf]
Yilin Wang*, Haiyang Xu*, Xiang Zhang, Zeyuan Chen, Zhizhou Sha, Zirui Wang, Zhuowen Tu

CVPRW 2024

[8] Uni-3D: A Universal Model for Panoptic 3D Scene Reconstruction [pdf] [code]
Xiang Zhang*, Zeyuan Chen*, Fangyin Wei, Zhuowen Tu

ICCV 2023

[9] Text Spotting Transformers [pdf] [code]
Xiang Zhang, Yongwen Su, Subarna Tripathi, Zhuowen Tu

CVPR 2022

[10] Pose Recognition with Cascade Transformers [pdf] [code]
Ke Li*, Shijie Wang*, Xiang Zhang*, Yifan Xu, Weijian Xu, and Zhuowen Tu

CVPR 2021