Sitemap

A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.

Pages

Posts

publications

PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision

Published in CVPR (Oral), 2022

Self-supervised co-evolution of 3D human pose estimation, imitation, and hallucination. Oral presentation.

Recommended citation: Kehong Gong, Bingbing Li, Jianfeng Zhang, Tao Wang, Jing Huang, Michael Bi Mi, Jiashi Feng, Xinchao Wang. (2022). "PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision." CVPR (Oral).
Download Paper

SWiT-4D: Sliding-Window Transformer for Lossless and Parameter-Free Temporal 4D Generation

Published in arXiv, 2025

Sliding-window transformer for lossless and parameter-free temporal 4D generation. arXiv preprint.

Recommended citation: Kehong Gong, Zhengyu Wen, Mingxi Xu, Weixia He, Qi Wang, Ning Zhang, Zhengyu Li, Chenbin Li, Dongze Lian, Wei Zhao, Xiaoyu He, Mingyuan Zhang. (2025). "SWiT-4D: Sliding-Window Transformer for Lossless and Parameter-Free Temporal 4D Generation." arXiv preprint.
Download Paper

MoCapAnything V2: End-to-End Motion Capture for Arbitrary Skeletons

Published in arXiv, 2026

End-to-end motion capture for arbitrary skeletons. arXiv preprint.

Recommended citation: Kehong Gong, Zhengyu Wen, Dao Thien Phong, Mingxi Xu, Weixia He, Qi Wang, Ning Zhang, Zhengyu Li, Guanli Hou, Dongze Lian, Xiaoyu He, Mingyuan Zhang, Hanwang Zhang. (2026). "MoCapAnything V2: End-to-End Motion Capture for Arbitrary Skeletons." arXiv preprint.
Download Paper

MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos

Published in CVPR, 2026

Unified 3D motion capture for arbitrary skeletons from monocular videos. Deployed in the virtual-pet feature of Huawei smartphones.

Recommended citation: Kehong Gong, Zhengyu Wen, Weixia He, Mingxi Xu, Qi Wang, Ning Zhang, Zhengyu Li, Dongze Lian, Wei Zhao, Xiaoyu He, Mingyuan Zhang. (2026). "MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos." CVPR.
Download Paper