-
Microsoft Research
- Redmond, WA, USA
- liyuanlucasliu.github.io
- @LiyuanLucas
-
TransformerEngine-MoE Public
Forked from NVIDIA/TransformerEngineA library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…
-
OpenRLHF Public
Forked from OpenRLHF/OpenRLHFAn Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Python Apache License 2.0 UpdatedJan 2, 2025 -
streaming Public
Forked from mosaicml/streamingA Data Streaming Library for Efficient Neural Network Training
-
-
-
vllm Public
Forked from ykim362/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedMay 10, 2024 -
LiyuanLucasLiu.github.io Public
S-Theme: Simple & Beautiful Responsive Jekyll theme.
-
DeepSpeed Public
Forked from microsoft/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
-
-
-
DALLE-pytorch Public
Forked from lucidrains/DALLE-pytorchImplementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
Python MIT License UpdatedMay 24, 2023 -
-
LM-LSTM-CRF Public
Empower Sequence Labeling with Task-Aware Language Model
-
Transformer-Clinic Public
Understanding the Difficulty of Training Transformers
-
ColossalAI Public
Forked from hpcaitech/ColossalAIColossal-AI: A Unified Deep Learning System for Large-Scale Parallel Training
Python Apache License 2.0 UpdatedMay 18, 2022 -
fairseq Public
Forked from facebookresearch/fairseqFacebook AI Research Sequence-to-Sequence Toolkit written in Python.
Python MIT License UpdatedApr 29, 2022 -
NvidiaRTX-LHRv2Unlocker Public
Forked from liji3278/NvidiaRTX-LHRv2UnlockerThe first fully automated BIOS modifier for RTX cards with LHR v2 lock.
UpdatedFeb 23, 2022 -
RAdam Public
On the Variance of the Adaptive Learning Rate and Beyond
-
FasterTransformer Public
Forked from NVIDIA/FasterTransformerTransformer related optimization, including BERT, GPT
C++ Apache License 2.0 UpdatedMay 8, 2021 -
NN-CUDA-Example Public template
Forked from godweiyang/NN-CUDA-ExampleSeveral simple examples for popular neural network toolkits calling custom CUDA operators.
Python Apache License 2.0 UpdatedApr 29, 2021 -
Convex-Optimization-Course Public
Forked from niaohe/Convex-Optimization-CourseRepository for my convex optimization course.
UpdatedFeb 6, 2021 -
-
spike2py Public
Forked from MartinHeroux/spike2pyspike2py provides a simple interface to analyse and visualise data
Python GNU General Public License v3.0 UpdatedJan 5, 2021 -
emnlp-2020-virtual-conference-images Public
Forked from acl-org/emnlp-2020-virtual-conference-imagesMakefile UpdatedNov 16, 2020 -
NREPapers Public
Forked from thunlp/NREPapersMust-read papers on neural relation extraction (NRE)
-
NLP-progress Public
Forked from sebastianruder/NLP-progressRepository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Python MIT License UpdatedOct 31, 2020 -
APART Public
Forked from zichaoli/APARTOverfitting or Underfitting? Understand Robustness Drop in Adversarial Training
Python UpdatedOct 21, 2020 -
ipyplot Public
Forked from karolzak/ipyplotIPyPlot is a small python package offering fast and efficient plotting of images inside Jupyter Notebooks cells. It's using IPython with HTML for faster, richer and more interactive way of displayi…
Python MIT License UpdatedJun 15, 2020 -
SciencePlots Public
Forked from garrettj403/SciencePlotsMatplotlib styles for scientific plotting
Python MIT License UpdatedMay 28, 2020 -
marian Public
Forked from marian-nmt/marianFast Neural Machine Translation in C++
C++ Other UpdatedMay 17, 2020