Skip to content
View farisalasmary's full-sized avatar

Block or report farisalasmary

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • sbvqa2.0 Public

    The official implementation of the paper: SBVQA 2.0: Robust End-to-End Speech-Based Visual Question Answering for Open-Ended Questions

    Python 7 MIT License Updated Feb 12, 2025
  • PSU Sentiment Analysis Session Code

    Jupyter Notebook 4 5 MIT License Updated Jan 29, 2025
  • python Public

    Forked from HassanAlgoz/python

    كتاب للمبتدئين في البرمجة بلغة بايثون بطريقة عملية ومتدرجة باللغة العربية

    Jupyter Notebook Apache License 2.0 Updated Dec 21, 2024
  • ctcdecode Public

    Forked from WayenVan/ctcdecode

    PyTorch CTC Decoder bindings, modified for better installation and compatibility

    C++ MIT License Updated Jun 27, 2024
  • This repo implements VQVAE on mnist and as well as colored version of mnist images. It also implements simple LSTM for generating sample numbers using the encoder outputs of trained VQVAE

    Python Updated Feb 6, 2024
  • TTS Public

    Forked from coqui-ai/TTS

    🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

    Python Mozilla Public License 2.0 Updated Jan 12, 2024
  • Noise supression using deep filtering

    Python Other Updated Aug 8, 2023
  • Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch

    Python MIT License Updated May 27, 2023
  • PyTorch Implementation of "Attention Is All You Need"

    Python Updated May 18, 2023
  • nanoGPT Public

    Forked from karpathy/nanoGPT

    The simplest, fastest repository for training/finetuning medium-sized GPTs.

    Python MIT License Updated May 2, 2023
  • pydub Public

    Forked from jiaaro/pydub

    Manipulate audio with a simple and easy high level interface

    Python MIT License Updated Apr 30, 2023
  • NeMo Public

    Forked from NVIDIA/NeMo

    NeMo: a toolkit for conversational AI

    Python Apache License 2.0 Updated Jan 28, 2023
  • The code of the "Language Models and Their Applications" session

    Jupyter Notebook 9 MIT License Updated Jan 10, 2023
  • BLIP Public

    Forked from salesforce/BLIP

    PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

    Jupyter Notebook BSD 3-Clause "New" or "Revised" License Updated Dec 21, 2022
  • Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

    Python MIT License Updated Nov 8, 2022
  • Speech Recognition using DeepSpeech2.

    Python MIT License Updated Sep 30, 2022
  • shieldrnn Public

    The implementation of ShieldRNN

    Python 3 1 MIT License Updated Aug 30, 2022
  • This repo contains a notebook that illustrates how to train Transformer-XL on 🤗 Transformers library

    Jupyter Notebook MIT License Updated Aug 14, 2022
  • Train a CNN model on MNIST dataset and use it to develop an adversarial example to fool the model

    Jupyter Notebook 2 1 MIT License Updated Jul 16, 2022
  • Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

    Jupyter Notebook MIT License Updated Feb 28, 2022
  • Face Transformer for Recognition

    Python MIT License Updated Jan 3, 2022
  • CTDNN Public

    Forked from chenllliang/CTDNN

    MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition

    Python MIT License Updated Dec 4, 2021
  • This is a github repository of the abandonware Sequitur G2P by Bisani & Ney

    Python GNU General Public License v2.0 Updated Nov 23, 2021
  • PyTorch re-implementation of Speech-Transformer

    Python MIT License Updated Nov 19, 2021
  • kaldi-serve Public

    Forked from skit-ai/kaldi-serve

    Server framework for Kaldi ASR Toolkit

    C++ Apache License 2.0 Updated Nov 8, 2021
  • Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding

    Python 74 12 MIT License Updated Oct 11, 2021
  • warp-ctc Public

    Forked from SeanNaren/warp-ctc

    Pytorch Bindings for warp-ctc FIX ERRORS related to CUDA 10.1

    Cuda Apache License 2.0 Updated Oct 7, 2021
  • A PyTorch implementation of the 'FaceNet' paper for training a facial recognition model with Triplet Loss using the glint360k dataset. A pre-trained model using Triplet Loss is available for download.

    Python MIT License Updated Sep 16, 2021
  • elgen Public

    Forked from sikora507/elgen
    Jupyter Notebook Updated Aug 1, 2021
  • arastance Public

    Forked from Tariq60/arastance
    Updated Jul 27, 2021