CS6187 - Vision and Language
|* The offering term is subject to change without prior notice|
This course introduces algorithms and techniques for integration of computer vision and natural language processing for innovative applications, such as robot dialog system, image/video captioning, cross-media multimedia search and question-answering. The course will discuss the latest technologies in bridging the gap between vision and language, with topics ranging from machine translation, feature extraction and learning, to design of deep neural network architecture.
Assessment (Indicative only, please check the detailed course information)
Continuous Assessment: 70%
Examination Duration: 2 hours
Detailed Course Information
|Department of Computer Science|