Multimodal Machine Learning Group (MMLG)

All

19 repositories

multimodal-ml-reading-list
Public
Reading list for research topics in multimodal machine learning
MIT License
•859•8•0•0•Updated Mar 14, 2023Mar 14, 2023
iPerceive
Public
Applying Common-Sense Reasoning to Multi-Modal Dense Video Captioning and Video Question Answering | Python3 | PyTorch | CNNs | Causality | Reasoning | LSTMs | Transformers | Multi-Head Self Attention
Python
•
MIT License
•20•1•0•0•Updated Nov 10, 2020Nov 10, 2020
CVSE
Public
The official source code for the paper Consensus-Aware Visual-Semantic Embedding for Image-Text Matching (ECCV 2020)
Python
•19•0•0•0•Updated Oct 21, 2020Oct 21, 2020
Video-guided-Machine-Translation
Public
Starter code for the VMT task and challenge
Python
•6•0•0•0•Updated Jul 29, 2020Jul 29, 2020
Multimodal-Emotion-Recognition
Public
A real time Multimodal Emotion Recognition web app for text, sound and video inputs
Jupyter Notebook
•
Apache License 2.0
•296•2•0•0•Updated Jul 6, 2020Jul 6, 2020
Grounding_in_Dialogue
Public
ACL 2020 Tutorial by Malihe Alikhani and Matthew Stone
3•0•0•0•Updated Jun 29, 2020Jun 29, 2020
awesome-multimodal-knowledge-graph
Public
A curated list of awesome papers, datasets and tutorials within Multimodal Knowledge Graph.
TeX
•
MIT License
•45•1•0•0•Updated Feb 5, 2020Feb 5, 2020
awesome-multimodal-machine-translation
Public
A curated list of awesome papers, datasets and tutorials within Multimodal Machine Learning.
TeX
•4•0•0•0•Updated Feb 2, 2020Feb 2, 2020
Multimodal-Transformer
Public
[ACL'19] [PyTorch] Multimodal Transformer
Python
•152•0•0•0•Updated Dec 12, 2019Dec 12, 2019
visualbert
Public
Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"
Python
•
Other
•105•0•0•0•Updated Nov 2, 2019Nov 2, 2019
MTN
Public
Code for the paper Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems (ACL19)
Python
•
MIT License
•25•0•0•0•Updated Oct 19, 2019Oct 19, 2019
how2-dataset
Public
This repository contains code and metadata of How2 dataset
Python
•17•0•0•0•Updated Oct 7, 2019Oct 7, 2019
lxmert
Public
PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".
Python
•
MIT License
•159•0•0•0•Updated Sep 27, 2019Sep 27, 2019
pythia
Public
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Python
•
Other
•939•0•0•0•Updated Sep 23, 2019Sep 23, 2019
vilbert_beta
Public
Jupyter Notebook
•96•0•0•0•Updated Sep 21, 2019Sep 21, 2019
nmtpytorch
Public
Sequence-to-Sequence Framework in PyTorch
Jupyter Notebook
•
Other
•51•0•0•0•Updated Aug 12, 2019Aug 12, 2019
multimodal-sentiment-analysis
Public
Attention-based multimodal fusion for sentiment analysis
Python
•
MIT License
•75•0•0•0•Updated Jan 12, 2019Jan 12, 2019
contextual-multimodal-fusion
Public
Contextual inter modal attention for multimodal sentiment analysis
Python
•
MIT License
•8•0•0•0•Updated Dec 13, 2018Dec 13, 2018
MultimodalNMT
Public
Python
•
MIT License
•1•1•0•0•Updated Jul 20, 2018Jul 20, 2018