Listen, Look And Deliberate: Visual Context-Aware Speech Recognition Using Pre-Trained Text-Video Representations
14:57

Listen, Look And Deliberate: Visual Context-Aware Speech Recognition Using Pre-Trained Text-Video Representations

Log in

or