Deep learning-based illuminant estimation for mobile devices

Humans possess the ability to see objects as having the same color even when viewed under different illuminations. Cameras inherently lack this capability. A process called auto white balance (AWB) has to be applied by the camera to mimic this behavior of the human visual system. AWB is one of the first steps in a […]

Read More
Efficient Learning Methods for Multimodal Understanding

The main focus of this research is to develop representation learning architectures and algorithms that can help perform various multimodal understanding tasks, and at the same time reduce the need for human supervision in the form of costly annotations. To achieve this goal, a learning system must be able to: (1) learn new tasks or […]

Read More
Weakly supervised representation learning for sequential and composite actions.

Camera enabled AI-based personal assistants will need to recognize human actions in order to be safe and effective. Current machine learning approaches for action recognition require extensive datasets of annotated videos that depicting the actions to be recognized. Such datasets are expensive to acquire. The goal of this project is to decrease the annotation required […]

Read More