Zhiyuan Gao

Zhiyuan Gao

PhD Student

Department of Computer Science

Viterbi School of Engineering, USC

Email: gaozhiyu [AT] usc.edu

Research Topics

  • Differentiable Simulation
  • Real-to-Sim
  • High Fidelity Digital Twin and Simulation
  • Robotics, Computer Graphics, 3D Computer Vision
I am a PhD student in Computer Science at University of Southern California, advised by Prof. Jernej Barbic. I've finished my Master degree in CS@USC advised by Prof. Yue Wang and Prof. Jernej Barbic. I have also interned at Vision & Graphics Lab, where I was fortunate to work with Prof. Yajie Zhao.

News

  • [2026-01] One paper is accepted to ICLR 2026!
  • [2025-10] One paper is accepted to NeurIPS 2025!
  • [2025-07] One paper is accepted to ICCV 2025!
  • [2024-12] One paper is accepted as Oral Presentation to 3DV 2025!
  • [2024-09] One paper is accepted to WACV 2025!

Publications

Seeing the Wind from a Falling Leaf

Zhiyuan Gao* , Jiageng Mao* , Hong-Xing Yu , Haozhe Lou , Emily Yue-ting Jia , Jernej Barbic , Jiajun Wu , Yue Wang  (Equal contribution)
NeurIPS 2025

Differentiable pipeline that recovers wind forces from leaf videos and enables physics-consistent video editing and wind manipulation.

Abstract

A longstanding goal in computer vision is to model motions from videos, while the representations behind motions, i.e. the invisible physical interactions that cause objects to deform and move, remain largely unexplored. In this work, we present an end-to-end differentiable inverse graphics framework, which jointly models object geometry, physical properties, and interactions directly from videos. By backpropagating through physics simulations, we can recover force representations from object movements. We validate our approach on both synthetic and real-world scenarios, demonstrating the ability to estimate plausible force fields—such as wind patterns affecting a falling leaf. Our method shows promise for physics-based video generation and editing, bridging computer vision with physics by understanding the physical processes underlying visual data.

Skyeyes: Ground Roaming using Aerial View Images

Zhiyuan Gao* , Wenbin Teng* , Gonglin Chen , Jinsen Wu , Ningli Xu , Rongjun Qin , Andrew Feng , Yajie Zhao  (Equal contribution)
WACV 2025

Skyeyes is a framework that can generate photorealistic sequences of ground view images using only aerial view inputs, thereby creating a ground roaming experience.

Abstract

Integrating aerial imagery-based scene generation into applications like autonomous driving and gaming enhances realism in 3D environments, but challenges remain in creating detailed content for occluded areas and ensuring real-time, consistent rendering. In this paper, we introduce Skyeyes, a novel framework that can generate photorealistic sequences of ground view images using only aerial view inputs, thereby creating a ground roaming experience. More specifically, we combine a 3D representation with a view consistent generation model, which ensures coherence between generated images. This method allows for the creation of geometrically consistent ground view images, even with large view gaps. The images maintain improved spatial-temporal coherence and realism, enhancing scene comprehension and visualization from aerial perspectives. To the best of our knowledge, there are no publicly available datasets that contain pairwise geo-aligned aerial and ground view imagery. Therefore, we build a large, synthetic, and geo-aligned dataset using Unreal Engine. Both qualitative and quantitative analyses on this synthetic dataset display superior results compared to other leading synthesis approaches.

Cite

Experience

Fun Facts

  • My Erdős number is 3: Zhiyuan Gao → Jitendra Malik → Fan Chung Graham → Paul Erdős.
  • I built phdstat, a tool for visualizing grad school application data from TheGradCafe with near real-time updates. It has helped 2.7k+ applicants track their applications. If you're interested in contributing to this project, feel free to contact me.
  • I've contributed to several games (click to expand)
    Flappy Balloon trailer frame
    Flappy Balloon
    Alt-controller party game guiding a balloon through obstacles with custom inputs.
    I was one of the producers.
    PL-23 teaser
    PL-23
    AI-driven detective/puzzle game in the spirit of story-rich investigation titles.
    I am the lead producer and a past backend engineer.

Contact

300 N Beaudry Ave, Los Angeles, CA 90012, United States