Skip to content

Commit

Permalink
doc: 更新readme内容
Browse files Browse the repository at this point in the history
  • Loading branch information
nl8590687 committed Feb 11, 2022
1 parent 6b70ec7 commit d4e4759
Show file tree
Hide file tree
Showing 2 changed files with 13 additions and 7 deletions.
10 changes: 6 additions & 4 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,7 @@ ASRT是一个基于深度学习的中文语音识别系统,如果您觉得喜
[![GPL-3.0 Licensed](https://img.shields.io/badge/License-GPL3.0-blue.svg?style=flat)](https://opensource.org/licenses/GPL-3.0)
[![TensorFlow Version](https://img.shields.io/badge/Tensorflow-1.15+-blue.svg)](https://www.tensorflow.org/)
[![Python Version](https://img.shields.io/badge/Python-3.6+-blue.svg)](https://www.python.org/)
[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.5808435.svg)](https://doi.org/10.5281/zenodo.5808435)

**ReadMe Language** | 中文版 | [English](https://github.com/nl8590687/ASRT_SpeechRecognition/blob/master/README_EN.md) |

Expand Down Expand Up @@ -32,6 +33,7 @@ ASRT是一个基于深度学习的中文语音识别系统,如果您觉得喜
有关AI柠檬ASRT语音项目的相关信息亦可使用[AI柠檬站内搜索引擎](https://s.ailemon.net/)进行相关信息的搜索。

## ASRT相关资料
* [查看ASRT项目的Wiki文档](https://wiki.ailemon.net/docs/asrt-doc)

ASRT的原理请查看本文:
* [ASRT:一个中文语音识别系统](https://blog.ailemon.net/2018/08/29/asrt-a-chinese-speech-recognition-system/)
Expand Down Expand Up @@ -75,14 +77,14 @@ $ git clone https://github.com/nl8590687/ASRT_SpeechRecognition.git
```shell
$ cd ASRT_SpeechRecognition

$ mkdir dataset
$ mkdir /data/speech_data

$ tar zxf <数据集压缩文件名> -C dataset/
$ tar zxf <数据集压缩文件名> -C /data/speech_data/
```

然后需要将datalist目录下的文件全部拷贝到 `dataset/` 目录下,也就是将其跟数据集放在一起。
下载默认数据集的拼音标签文件:
```shell
$ cp -rf datalist/* dataset/
$ python download_default_datalist.py
```

目前可用的模型有24、25和251
Expand Down
10 changes: 7 additions & 3 deletions README_EN.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,6 +3,7 @@
[![GPL-3.0 Licensed](https://img.shields.io/badge/License-GPL3.0-blue.svg?style=flat)](https://opensource.org/licenses/GPL-3.0)
[![TensorFlow Version](https://img.shields.io/badge/Tensorflow-1.15+-blue.svg)](https://www.tensorflow.org/)
[![Python Version](https://img.shields.io/badge/Python-3.6+-blue.svg)](https://www.python.org/)
[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.5808435.svg)](https://doi.org/10.5281/zenodo.5808435)

**ReadMe Language** | [中文版](https://github.com/nl8590687/ASRT_SpeechRecognition/blob/master/README.md) | English |

Expand All @@ -16,6 +17,8 @@ If you have any questions in your works with this project, welcome to put up iss

You can check the [FAQ Page (Chinese)](https://wiki.ailemon.net/docs/asrt-doc/asrt-doc-1deoeud494h4f) first before asking questions to avoid repeating questions.

* [ASRT project's Wiki document](https://wiki.ailemon.net/docs/asrt-doc)

A post about ASRT's introduction
* [ASRT: Chinese Speech Recognition System (Chinese)](https://blog.ailemon.net/2018/08/29/asrt-a-chinese-speech-recognition-system/)

Expand Down Expand Up @@ -51,17 +54,18 @@ After cloning the repository via git, go to the project root directory; create a
```shell
$ cd ASRT_SpeechRecognition

$ mkdir dataset
$ mkdir /data/speech_data

$ tar zxf <dataset zip files name> -C dataset/
$ tar zxf <dataset zip files name> -C /data/speech_data/
```

Then, you need to copy all the files in the 'datalist' directory to the dataset directory, that is, put them together with the data set.

Note that in the current version, in the configuration file, two data sets, Thchs30 and ST-CMDS, are added by default, please delete them if you don’t need them. If you want to use other data sets, you need to add data configuration yourself, and use the standard format supported by ASRT to organize the data in advance.

To download pinyin syllable list files for default dataset:
```shell
$ cp -rf datalist/* dataset/
$ python download_default_datalist.py
```

Currently available models are 24, 25 and 251
Expand Down

0 comments on commit d4e4759

Please sign in to comment.