-
sbvqa2.0 Public
The official implementation of the paper: SBVQA 2.0: Robust End-to-End Speech-Based Visual Question Answering for Open-Ended Questions
-
PSU Sentiment Analysis Session Code
-
python Public
Forked from HassanAlgoz/pythonكتاب للمبتدئين في البرمجة بلغة بايثون بطريقة عملية ومتدرجة باللغة العربية
Jupyter Notebook Apache License 2.0 UpdatedDec 21, 2024 -
ctcdecode Public
Forked from WayenVan/ctcdecodePyTorch CTC Decoder bindings, modified for better installation and compatibility
C++ MIT License UpdatedJun 27, 2024 -
VQVAE-Pytorch Public
Forked from explainingai-code/VQVAE-PytorchThis repo implements VQVAE on mnist and as well as colored version of mnist images. It also implements simple LSTM for generating sample numbers using the encoder outputs of trained VQVAE
Python UpdatedFeb 6, 2024 -
TTS Public
Forked from coqui-ai/TTS🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Python Mozilla Public License 2.0 UpdatedJan 12, 2024 -
DeepFilterNet Public
Forked from Rikorose/DeepFilterNetNoise supression using deep filtering
Python Other UpdatedAug 8, 2023 -
recurrent-memory-transformer-pytorch Public
Forked from lucidrains/recurrent-memory-transformer-pytorchImplementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch
Python MIT License UpdatedMay 27, 2023 -
transformer Public
Forked from hyunwoongko/transformerPyTorch Implementation of "Attention Is All You Need"
Python UpdatedMay 18, 2023 -
nanoGPT Public
Forked from karpathy/nanoGPTThe simplest, fastest repository for training/finetuning medium-sized GPTs.
Python MIT License UpdatedMay 2, 2023 -
pydub Public
Forked from jiaaro/pydubManipulate audio with a simple and easy high level interface
Python MIT License UpdatedApr 30, 2023 -
NeMo Public
Forked from NVIDIA/NeMoNeMo: a toolkit for conversational AI
Python Apache License 2.0 UpdatedJan 28, 2023 -
The code of the "Language Models and Their Applications" session
-
BLIP Public
Forked from salesforce/BLIPPyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Jupyter Notebook BSD 3-Clause "New" or "Revised" License UpdatedDec 21, 2022 -
pyannote-audio Public
Forked from pyannote/pyannote-audioNeural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Python MIT License UpdatedNov 8, 2022 -
deepspeech.pytorch Public
Forked from SeanNaren/deepspeech.pytorchSpeech Recognition using DeepSpeech2.
Python MIT License UpdatedSep 30, 2022 -
-
This repo contains a notebook that illustrates how to train Transformer-XL on 🤗 Transformers library
Jupyter Notebook MIT License UpdatedAug 14, 2022 -
Train a CNN model on MNIST dataset and use it to develop an adversarial example to fool the model
-
bottom-up-attention Public
Forked from peteanderson80/bottom-up-attentionBottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
Jupyter Notebook MIT License UpdatedFeb 28, 2022 -
Face-Transformer Public
Forked from zhongyy/Face-TransformerFace Transformer for Recognition
Python MIT License UpdatedJan 3, 2022 -
CTDNN Public
Forked from chenllliang/CTDNNMMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition
Python MIT License UpdatedDec 4, 2021 -
sequitur-g2p Public
Forked from sequitur-g2p/sequitur-g2pThis is a github repository of the abandonware Sequitur G2P by Bisani & Ney
Python GNU General Public License v2.0 UpdatedNov 23, 2021 -
Speech-Transformer Public
Forked from foamliu/Speech-TransformerPyTorch re-implementation of Speech-Transformer
Python MIT License UpdatedNov 19, 2021 -
kaldi-serve Public
Forked from skit-ai/kaldi-serveServer framework for Kaldi ASR Toolkit
C++ Apache License 2.0 UpdatedNov 8, 2021 -
wav2vec2-kenlm Public
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
-
warp-ctc Public
Forked from SeanNaren/warp-ctcPytorch Bindings for warp-ctc FIX ERRORS related to CUDA 10.1
Cuda Apache License 2.0 UpdatedOct 7, 2021 -
facenet-pytorch-glint360k Public
Forked from tamerthamoqa/facenet-pytorch-glint360kA PyTorch implementation of the 'FaceNet' paper for training a facial recognition model with Triplet Loss using the glint360k dataset. A pre-trained model using Triplet Loss is available for download.
Python MIT License UpdatedSep 16, 2021 -
-