Introduction
The world is three-dimensional. This fact was first seen by trilobites, the first organisms capable of sensing light. From that moment, nervous systems began to evolve, gradually transforming mere sight into insight, understanding, and action. All these combined gives rise to intelligence.
Despite remarkable technological advancements in recent decades, modern embodied systems remain far from achieving full intelligence. They fall short in several key aspects: (i) contain information necessary for physical interaction, such as temporal dynamics of the scene; (ii) have a prior over semantic relevance, and should focus on task-relevant features like objects and their relationships; and (iii) be compact, avoiding the inclusion of irrelevant details, such as background elements. Attempts have been made, including integrating foundational models and utilizing large-scale data. Yet, the path to true intelligence remains long, with significant progress still required.
Contact
-
Contact us via
[email protected]
-
Join Discord to chat with the organizers
-
Join discussions in our WeChat group
Schedule
-
Li CHen
HKU, China -
Ankit Goyal
NVIDIA, USABiography
Ankit Goyal is a Research Scientist in Robotics at NVIDIA working with Dieter Fox. He did his Ph.D. in Computer Science at Princeton University, where he was advised by Prof. Jia Deng. He completed Masters from University of Michigan and Bachelors from IIT Kanpur.
-
Track Organizer and Winners
-
Coffee Break
-
Yunzhu Li
Columbia, USABiography
Yunzhu Li is an Assistant Professor of Computer Science at Columbia University.
Before joining Columbia, he was an Assistant Professor at UIUC CS. He also spent time as a Postdoc at the Stanford Vision and Learning Lab (SVL), working with Fei-Fei Li and Jiajun Wu. He received his PhD from the Computer Science and Artificial Intelligence Laboratory (CSAIL) at MIT, where he was advised by Antonio Torralba and Russ Tedrake, and he obtained his bachelor's degree from Peking University. -
Track Organizer and Winners
-
Matthias Nießner
TUM, GermanyBiography
Matthias Nießner is a Professor at the Technical University of Munich, where he leads the Visual Computing Lab. Before, he was a Visiting Assistant Professor at Stanford University. Prof. Nießner's research lies at the intersection of computer vision, graphics, and machine learning, where he is particularly interested in cutting-edge techniques for 3D reconstruction, semantic 3D scene understanding, video editing, and AI-driven video synthesis. In total, he has published over 150 academic publications, including 25 papers at the prestigious ACM Transactions on Graphics (SIGGRAPH / SIGGRAPH Asia) journal and 55 works at the leading vision conferences (CVPR, ECCV, ICCV); several of these works won best paper awards, including at SIGCHI'14, HPG'15, SPG'18, and the SIGGRAPH'16 Emerging Technologies Award for the best Live Demo.
Prof. Nießner's work enjoys wide media coverage, with many articles featured in main-stream media including the New York Times, Wall Street Journal, Spiegel, MIT Technological Review, and many more, and his was work led to several TV appearances such as on Jimmy Kimmel Live, where Prof. Nießner demonstrated the popular Face2Face technique; Prof. Nießner's academic Youtube channel currently has over 5 million views.
For his work, Prof. Nießner received several awards: he is a TUM-IAS Rudolph Moessbauer Fellow (2017 - ongoing), he won the Google Faculty Award for Machine Perception (2017), the Nvidia Professor Partnership Award (2018), as well as the prestigious ERC Starting Grant 2018 which comes with 1.5 million Euro in research funding; in 2019, he received the Eurographics Young Researcher Award honoring the best upcoming graphics researcher in Europe.
In addition to his academic impact, Prof. Nießner is a co-founder and director of Synthesia Inc., a leading startup dedicated to democratize synthetic media generation with cutting-edge AI-driven video synthesis technology. -
Lunch Break
-
Li Chen
HKU, ChinaBiography
Li Chen am currently pursuing a PhD at the University of Hong Kong (HKU), while also conducting research at OpenDriveLab. He workw under the supervision of Prof. Ping LUO and Prof. Hongyang LI. He earned his Master's in Robotics from the Robotics Institute at the University of Michigan, Ann Arbor, in 2020, after a Bachelor's degree in Mechanical Engineering from Shanghai Jiao Tong University in 2019.
His research interests primarily focus on robotics, encompassing areas such as autonomous driving, robot learning, and robotic systems. Representative projects are highlighted below. -
Pulkit Agrawal
MIT, USABiography
Pulkit Agrawal is an Associate Professor in the department of Electrical Engineering and Computer Science (EECS) at MIT. His lab is a part of the Computer Science and Artificial Intelligence Lab (CSAIL), is affiliated with the Laboratory for Information and Decision Systems (LIDS) and involved with NSF AI Institute for Artificial Intelligence and Fundamental Interactions ( IAIFI ).
He completed his Ph.D. at UC Berkeley; undergraduate studies from IIT Kanpur. Co-founded SafelyYou Inc. that builds fall prevention technology. Advisor to Tutor Intelligence, Lab0 Inc., and Common Sense Machines. -
Coffee Break
-
Katerina Fragkiadaki
CMU, USABiography
Katerina Fragkiadaki is a JPMorgan Chase Associate Professor of Computer Science in the Machine Learning Department at Carnegie Mellon University. She works in Artificial Intelligence at the intersection of Computer Vision, Machine Learning, Language Understanding and Robotics. Prior to joining MLD's faculty she spent three wonderful years as a post doctoral researcher first at UC Berkeley working with Jitendra Malik and then at Google Research in Mountain View working with the video group. She completed her Ph.D. in GRASP, UPenn with Jianbo Shi . She did her undergraduate studies at the National Technical University of Athens and before that she was in Crete.
-
Jia Deng
Princeton, USABiography
Jia Deng is a Professor of Computer Science at Princeton University. His research focuses on computer vision and machine learning. He received his Ph.D. from Princeton University and his B.Eng. from Tsinghua University, both in computer science. He is a recipient of the Sloan Research Fellowship, the NSF CAREER award, the ONR Young Investigator award, an ICCV Marr Prize, a CVPR test-of-time award and two ECCV Best Paper Awards.
-
Speakers and Organizers
-
Closing Remarks
Organizers
- | Hongyang Li | HKU |
- | Philipp Krähenbühl | UT Austin |
- | Kashyap Chitta | NVIDIA |
- | Eric Jang | 1X Technologies |
- | Andrei Bursuc | Valeo |
- | Huijie Wang | OpenDriveLab |