15. Vision and Language

How language helps computer vision?

What is language grounding?

Explicit grounding

  • Joint syntax learning
  • Joint structured learning and explicit parsing

Implicit grounding

  • RNN revisited
  • Large VL model with transformer

Grounding language on visual concepts and programs

Emergence of language & communication

Previous
Next