-
veScale Public
Forked from volcengine/veScaleA PyTorch Native LLM Training Framework
Python Apache License 2.0 UpdatedMar 15, 2024 -
ByteTransformer Public
Forked from bytedance/ByteTransformeroptimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
C++ Apache License 2.0 UpdatedMar 14, 2024 -
pylcs Public
super fast cpp implementation of longest common subsequence/substring
-
CUDALibrarySamples Public
Forked from NVIDIA/CUDALibrarySamplesCUDA Library Samples
Cuda Other UpdatedDec 10, 2022 -
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
-
-
open-gpu-kernel-modules Public
Forked from NVIDIA/open-gpu-kernel-modulesNVIDIA Linux open GPU kernel module source
C Other UpdatedMay 13, 2022 -
cutlass Public
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
C++ BSD 3-Clause "New" or "Revised" License UpdatedJan 17, 2022 -
tvm Public
Forked from apache/tvmOpen deep learning compiler stack for cpu, gpu and specialized accelerators
Python Apache License 2.0 UpdatedDec 6, 2021 -
torchscript-example Public
Forked from codekansas/torchscript-cmake-exampleExample CMake project for TorchScript
Python UpdatedNov 23, 2021 -
effective_transformer Public
Forked from bytedance/effective_transformerRunning BERT without Padding
C++ Apache License 2.0 UpdatedNov 16, 2021 -
tvm-rfcs Public
Forked from apache/tvm-rfcsA home for the final text of all TVM RFCs.
Apache License 2.0 UpdatedSep 13, 2021 -
Cpp_Primer_Practice Public
Forked from applenob/Cpp_Primer_Practice搞定C++:punch:。C++ Primer 中文版第5版学习仓库,包括笔记和课后练习答案。
-
assignment2-2018 Public
Forked from dlsys-course/assignment2-2018(Spring 2018) Assignment 2: Graph Executor with TVM
Python UpdatedJun 20, 2020 -
tensorflow Public
Forked from tensorflow/tensorflowAn Open Source Machine Learning Framework for Everyone
C++ Apache License 2.0 UpdatedJun 16, 2020 -
CUDA_by_practice Public
Forked from eegkno/CUDA_by_practiceCUDA by practice
-
-
-
-
bert Public
Forked from google-research/bertTensorFlow code and pre-trained models for BERT
Python Apache License 2.0 UpdatedSep 11, 2019 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
C++ Other UpdatedAug 26, 2019 -
tutorials Public
Forked from pytorch/tutorials -
-
fairseq Public
Forked from facebookresearch/fairseqFacebook AI Research Sequence-to-Sequence Toolkit written in Python.
Python MIT License UpdatedAug 15, 2019 -
flaskweb Public
An easy to start yet full-featured web framework
-
translate Public
Forked from pytorch/translateTranslate - a PyTorch Language Library
Python BSD 3-Clause "New" or "Revised" License UpdatedAug 9, 2019 -
cuBERT Public
Forked from zhihu/cuBERTFast implementation of BERT inference directly on NVIDIA (CUDA, CUBLAS) and Intel MKL
C MIT License UpdatedJul 22, 2019 -
bert-as-service Public
Forked from jina-ai/clip-as-serviceMapping a variable-length sentence to a fixed-length vector using BERT model
-
pyflame2 Public
Forked from uber-archive/pyflame🔥 Pyflame: A Ptracing Profiler For Python
-
libfsm Public
Forked from katef/libfsmDFA regular expression library & friends
C BSD 2-Clause "Simplified" License UpdatedJul 3, 2019