Boyang Wang

Hello! This is Boyang Wang (ēŽ‹åšę“‹). I am a first-year PhD student at the University of Virginia, supervised by Zezhou Cheng.

I graduated with a M.S. in Robotics and a B.S. in Computer Science from the University of Michigan (UMich). At UMich, I worked with Prof. JJ Park, Prof. Nima Fazeli, Prof. Hun-Seok Kim, and Prof. Chad Jenkins. During the summer of 2025, I interned at Shanghai AI Lab, working closely with Xudong Xu on the embodied AI department.

My research interests mainly lie in Generative Models (especially video) for General, Robotics, and Anime domains!

Email  Ā·  Twitter  Ā·  Google Scholar  Ā·  Github  Ā·  HuggingFace

profile photo
News

  • 01/2026: 🥳 RoboVIP: Multi-View Video Generation with Visual Identity Prompting Augments Robot Manipulation is released on Arxiv.
  • 09/2025: 🎉 Frame In-N-Out is accepted by NeurIPS 2025! See you in San Diego!
  • 08/2025: 🥳 Begin my journey as PhD student at the University of Virginia.
  • 05/2025: 🎉 Frame In-N-Out: Unbounded Controllable Image-to-Video Generation is released on Arxiv.
  • 02/2025: 🎉 One paper is accepted by CVPR 2025. See you in Nashville!
  • 01/2025: 🎉 One paper is accepted by ICRA 2025. See you in Atlanta!
  • 01/2025: 🎉 One paper is accepted by RA-L.
  • 12/2024: 🎉 Graduated from University of Michigan as Master Degree.
  • 08/2024: 🎉 Being selected as GSI for the EECS498 Computer Graphics and Generative Models.
  • 07/2024: 🎉 This&That: Language-Gesture Controlled Video Generation for Robot Planning is released on Arxiv.
  • 02/2024: 🎉 One paper is accepted by CVPR 2024.
  • 10/2023: 🎉 One paper is accepted by WACV 2024.

  • Selected Publications

    (* indicates equal contribution)



    RoboVIP: Multi-View Video Generation with Visual Identity Prompting Augments Robot Manipulation
    Boyang Wang *, Haoran Zhang *, Shujie Zhang *, Jinkun Hao, Mingda Jia, Qi Lv, Yucheng Mao, Zhaoyang Lyu, Jia Zeng, Xudong Xu, Jiangmiao Pang
    In Submission
    Paper Ā· Website Ā· Github

    A Multi-View Inpainting-based Video Diffusion Model with Identity Reference as conditions to Augment Robotics Manipulation data in both Simulation and Real-Robot setup.

    Frame In-N-Out: Unbounded Controllable Image-to-Video Generation
    Boyang Wang, Xuweiyi Chen, Matheus Gadelha, Zezhou Cheng
    NeurIPS 2025
    Paper Ā· Website Ā· Github Ā· HuggingFace

    First paper in Video Generation that studies Frame In and Frame Out cinematics effect, which is conditioned on First Frame Expanded Canvas, Text Prompt, Motion Trajectory and Identity Reference.

    Sparse Image Sets Restoration with Multi-View Diffusion Model
    Yucheng Mao *, Boyang Wang *, Nilesh Kulkarni, Jeong Joon Park
    CVPR 2025
    Paper Ā· Website

    Reconstruct a 3D scene with degraded image sets by Mutliview Diffusion Model.

    This&That: Language-Gesture Controlled Video Generation for Robot Planning
    Boyang Wang, Nikhil Sridhar, Chao Feng, Mark Van der Merwe, Adam Fishman, Nima Fazeli, Jeong Joon Park
    ICRA 2025
    Paper Ā· Website Ā· Github Ā· HuggingFace

    First paper with Language & Gesture conditioned Video Generative Model for the robot action mentoring.

    APISR: Anime Production Inspired Real-World Anime Super-Resolution
    Boyang Wang, Fengyu Yang, Xihang Yu, Chao Zhang, Hanbin Zhao
    CVPR 2024
    Paper Ā· Github Ā· HuggingFace

    SOTA Anime Super-Resolution and Restoration.

    VCISR: Blind Single Image Super-Resolution with Video Compression Synthetic Data
    Boyang Wang*, Bowen Liu*, Shiyu Liu*, Fengyu Yang
    WACV 2024
    Paper Ā· Github

    First paper studying the Video Compression Degradation on Image Super-Resolution.



    Teaching Assistant Experience

  • Teaching Assistant, CS 4501: Computer Vision, Spring 2026
  • Graduate Student Instructor (GSI), EECS 498: Computer Graphics and Generative Models, Fall 2024
  • Graduate Student Instructor (GSI), EECS 367/ROB 380/ROB 511: AutoRob, Winter 2024
  • Teaching Assistant (IA), EECS 367 / ROB 380 / ROB 511: AutoRob, Winter 2023


  • Intern Work Experience

  • Research Intern, Shanghai AI Lab , 05/2025 - 08/2025
  • Machine Learning Engineer, ByteDance , 04/2022 - 08/2022


  • Award

  • UVA Provost Fellowship
  • Conference Travel Award: NeurIPS 2025


  • Something want to share at the end

    I believe that the optimizer to the life is the faith one has.