Kimodo: Scaling Controllable Human Motion Generation
Technical Report, 2026

Staff research scientist at Nvidia. Email = wilsonwanguoft @
.com
I am a staff research scientist in Nvidia's Generalist Embodied Agent Research (GEAR) team. Previously I obtained my PhD in University of Toronto, proudly advised by Prof. Sanja Fidler and Prof. Jimmy Ba. My name is pronounced as Ting-Wu, and it means "midday" in Chinese.
My research experience ranges widely from developing animation motion engines to deploying real robots. Since my PhD, I have been dedicated to exploring a central theme: creating scalable, robust, and generalist motion skills that seamlessly bridge the virtual and real worlds.
Technical Report, 2026
PhD Thesis, 2022
International Conference on Computer Vision, ICCV, 2021
Arxiv, 2020
7th International Conference on Learning Representations (ICLR), 2019
6th International Conference on Learning Representations (ICLR), 2018
IEEE Wireless Communications and Networking Conference (WCNC), 2016
IEEE 83rd Vehicular Technology Conference (VTC), 2016
Patent: CN105447529 A, 2016.
Work done as an intern in SenseTime. Contributions used in DeepFashion.
or Ting-Wu Wang)
I was born in Changsha. I was named by my grandfather, where "Ting-wu" means "the exact midday". The name comes from the Commentary on the Water Classic written by an ancient Chinese geographer called Li Daoyuan:
"To witness the Sun and the Moon, one shall wait until the exact midday or midnight."