Cross Audio-Visual Recognition using 3D Convolutional Neural Networks
This project is aimed to provide the implementation for Coupled 3D Convolutional Neural Networks for audio-visual matching. Lip-reading can be a specific application for this work.