IAPR Distinguished Lectures
Date | Speaker | Title |
---|---|---|
May 8 | Daniel Cremers | Direct Methods for 3D Reconstruction & Visual SLAM |
May 9 | Jianbo Shi | Connecting the Dots: Embodied Visual Perception from First-person Cameras |
May 10 | Hiroshi Murase | Image Recognition for Driver Assistance in Intelligent Vehicles |
Direct Methods for 3D Reconstruction & Visual SLAM
- IAPR Distinguished Speaker
- Prof. Daniel Cremers
- Departments of Informatics and Mathematics, Technical University of Munich, Germany
- Synopsis
-
The reconstruction of the 3D world from images is among the central challenges in computer vision. Starting in the 2000s, researchers have pioneered algorithms which can reconstruct camera motion and sparse feature-points in real-time. In my talk, I will introduce spatially dense methods for camera tracking and 3D reconstruction which do not require feature point estimation, which exploit all available input data and which recover dense or semi-dense geometry rather than sparse point clouds. Applications include 3D photography, 3D television, and autonomous vehicles.
Connecting the Dots: Embodied Visual Perception from First-person Cameras
- IAPR Distinguished Speaker
- Prof. Jianbo Shi
- School of Engineering and Applied Science, University of Pennsylvania, USA
- Synopsis
-
A computer has a complete photographical memory. It creates massive but isolated sensory moments. Unlike such fragmented photographic memory, human memories are highly connected through episodes that allow us to relate past experiences and predict future actions. How to computationally model a human like episodic memory system that connects photographically accurate sensory moments? Our insight is that an active interaction is a key to link between episodes because sensory moments are fundamentally centered on an active person-self. Our experiences are created by and shared through our social and physical interactions, i.e., we connect episodes driven by similar actions and, in turn, recall these past connected episodes to take a future actions. Therefore, connecting the dotted moments to create an episodic memory requires understanding the purposeful interaction between human (person-self) and world.
Photographs are only half of our world experience: it records what are out there. What are in our head, our intention-attention-physiological states during the social and physical interactions, are missing from the memory recording. This needs creating an embodied memory link between our inner ‘selves’ with the external episode, and a first person camera is an ideal sensor to capture, model, and predict the embodied memory link because it encodes a complete visual audio sensation of the camera wearer’s interaction with the world. We leverage purposeful actions measured by first person cameras to reveal the internal states of the camera wearer, and use the similar internal states to connect the wearer’s episodic sensations of the world.
Image Recognition for Driver Assistance in Intelligent Vehicles
- IAPR Distinguished Speaker
- Prof. Hiroshi Murase
- Graduate School of Information Science Nagoya University, Japan
- Synopsis
-
Due to our aging society, driver assistance and automated driving have been intensely researched. Recognizing the surrounding environment utilizing image processing is a core technological element for driving intelligence. Specifically, technology must accurately detect pedestrians, obstacles such as other vehicles, condition of a driver, weather, etc. using cameras and various sensors. In this presentation, I introduce image recognition technology necessary for driver assistance.