Dongyu Yan 💫 (闫栋宇)

I'm a PhD student (2024 ~ now), working with Prof. Ying-Cong Chen at AI Thrust, Information Hub, Hong Kong University of Science and Technology (Guangzhou).

I received my Master's degree (2021 ~ 2024) in the School of Mechanical Engineering and Automation at Harbin Institute of Technology (Shenzhen), supervised by Prof. Haoyao Chen in nROS-lab. I obtained a B.Eng. degree (2017 ~ 2021) from Harbin Institute of Technology.

I'm now a research intern at Lightspeed Studios of Tencent IEG. Before that, I was a research intern in robotics at the Department of Flight System of DJI (Dec. 2021 ~ Oct. 2022), and a research intern in computer vision at the Department of Transformer of Megvii (May 2021 ~ Aug. 2021).

Dongyu Yan profile photo

Research

My research interests lie in computer vision, video world model, 3D generation, and robotics. I used to work on topics that combine implicit 3D representation with robotics tasks, including neural reconstruction, neural SLAM, and implicit next-best-view planning. I've also worked on 3D diffusion models for geometry and texture generation. My research objective now is to build interactive world models that can serve as the next generation game engine, requiring it to be high-quality, consistent, and efficient. Below are some of my selected papers. Some papers are highlighted.

Selected Publications

MSI-NeRF preview

MSI-NeRF: Linking Omni-Depth with View Synthesis through Multi-Sphere Image aided Generalizable Neural Radiance Field
Dongyu Yan, Guanyu Huang, Fengyu Quan, Haoyao Chen.
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2025)
Arxiv / Paper / Video / Github

We create a generalizable NeRF that takes four fisheye images as input and outputs a 3D MSI representation for novel view synthesis and depth estimation. It can be trained with synthetic depth data only and can generalize to a wide range of scenarios. We also released a fisheye multi-view dataset for training and evaluation.

Active implicit reconstruction preview

Active Implicit Object Reconstruction using Uncertainty-guided Next-Best-View Optimization
Dongyu Yan*, Jianheng Liu*, Fengyu Quan, Haoyao Chen.
IEEE Robotics and Automation Letters (RA-L 2023)
Arxiv / Paper / Video / Github

We propose an active implicit object reconstruction method leveraging direct uncertainty evaluation from an implicit occupancy map and next-best-view optimization. It directly optimizes a virtual camera trajectory in the uncertainty field to maximize information gain for the current reconstruction.

EINRUL preview

Efficient Implicit Neural Reconstruction Using LiDAR
Dongyu Yan, Xiaoyang Lyu, Jieqi Shi, Yi Lin.
IEEE International Conference on Robotics and Automation (ICRA 2023)
Arxiv / Paper / Video / Github / Project Page

We propose an implicit reconstruction method that uses LiDAR scans as input, which is efficient, accurate, and applicable in various scenarios. This method can be applied to real-world scenes, even with sparse LiDAR scans and poorly aligned poses, as shown in the self-collected dataset we released.

Competitions

RoboMaster competition RoboMaster competition

RoboMaster 2021: Ranked 2nd
As a member of the electronic control team, I was responsible for the control system of the Sentry robot, and also served as a robot operator.

Robocon 2020 China Division: Ranked 3rd
As the leader of the electronic control team, I was responsible for the development of the PMSM Driver system.

RoboMaster 2019: Ranked 5-6th
As the leader of the computer vision team, I was responsible for the development of the SLAM and Navigation system.

My time in HITCRT taught me a lot. Because of it, I embarked on the path of scientific research today. I will always be grateful to my teammates who fought alongside me. Hope our team can get better and better.

About Me

Me in my racing car. Me playing guitar and keytar. I can also compose and arrange music. Me riding my Ita-Bike.

Maniac of Racing and Car Modification
I love to free myself on the race track. I have a Porsche 987.2 boxster which is well modified by myself, and I hope to take it to every famous track in the world. I'm also interested in sim racing and have a cockpit in my dormitory to practice. I'm currently participating in iRacing's sim racing championships.

Big Fan of Music
I'm a big fan of music. My favorite music genres are Vocaloid and Japanese Pop. I can play violin and now I'm learning to play guitar and keytar. I'm also a beginner in music composition and arrangement. I'm currently learning to compose and arrange music for Vocaloid. I hope to become a Vocaloid producer in the future.

Standard Otaku
I have been watching anime since the 2010s. You can call me using my nijigen ID by 星空_StarrySky. My favorite character is Hatsune Miku, a famous virtual singer from Japan. I also have a Miku Itasha, the Porsche mentioned above, which makes me a member of one of the biggest Itasha groups in China called Hatsune Miku Itasha Lab.

Skills
Python   /   C++   /   Matlab   /   Linux   /   Embedded Systems
CAD   /   PCB Design   /   Fluent English & Japanese