Lesson 5: Vision and Language

Tim · March 1, 2019, 1:30pm

Lesson 5.2: From Language to Vision and Back Again

Description: Using higher level knowledge to improve object detection, language-vision model that simultaneously processes sentences and recognizes image objects and events, performing tasks like image/video retrieval, generating descriptions, and question answering.

Instructor: Andrei Barbu

Click here for the lesson transcript

Click here for the lesson slides