Physically grounded Multimodal foundation Models
![Thumbnail](http://memento.epfl.ch/image/28274/1440x810.jpg)
Event details
Date | 29.08.2024 |
Hour | 15:00 › 17:00 |
Speaker | Kunal Pratap Singh |
Location | |
Category | Conferences - Seminars |
EDIC candidacy exam
Exam president: Prof. Caglar Gulchere
Thesis advisor: Prof. Amir Zamir
Co-examiner: Prof. Mackenize Mathis
Abstract
On a broader sense, I'm interested in studying the role of the agent's embodiment and environment in the emergence of intelligence, through the lens of multimodality and active perception.
To this end, I'm more specifically trying to study, how an agent can bootstrap its visual capabilities by solely observing the environments based on its physical sensors.
Moreover, I'm also interested in studying the impact of actions on its perception, thereby closing the action perception loop. Specifically, I want to understand how can we design an agent that can take appropriate actions to learn a certain visual property.
Background papers
coming soon
Exam president: Prof. Caglar Gulchere
Thesis advisor: Prof. Amir Zamir
Co-examiner: Prof. Mackenize Mathis
Abstract
On a broader sense, I'm interested in studying the role of the agent's embodiment and environment in the emergence of intelligence, through the lens of multimodality and active perception.
To this end, I'm more specifically trying to study, how an agent can bootstrap its visual capabilities by solely observing the environments based on its physical sensors.
Moreover, I'm also interested in studying the impact of actions on its perception, thereby closing the action perception loop. Specifically, I want to understand how can we design an agent that can take appropriate actions to learn a certain visual property.
Background papers
coming soon
Practical information
- General public
- Free