GitHub - wang-keran/sherpa-onnx: Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter

Supported functions

Speech recognition	Speech synthesis	Speaker verification	Speaker identification
✔️	✔️	✔️	✔️

Spoken Language identification	Audio tagging	Voice activity detection
✔️	✔️	✔️

Keyword spotting	Add punctuation
✔️	✔️

Supported platforms

Architecture	Android	iOS	Windows	macOS	linux
x64	✔️		✔️	✔️	✔️
x86	✔️		✔️
arm64	✔️	✔️	✔️	✔️	✔️
arm32	✔️				✔️
riscv64					✔️

Supported programming languages

1. C++	2. C	3. Python	4. C#	5. Java	6. JavaScript
✔️	✔️	✔️	✔️	✔️	✔️

7. Kotlin	8. Swift	9. Go	10. Dart	11. Rust
✔️	✔️	✔️	✔️	✔️

For Rust support, please see https://github.com/thewh1teagle/sherpa-rs

It also supports WebAssembly.

Introduction

This repository supports running the following functions locally

Speech-to-text (i.e., ASR); both streaming and non-streaming are supported
Text-to-speech (i.e., TTS)
Speaker identification
Speaker verification
Spoken language identification
Audio tagging
VAD (e.g., silero-vad)
Keyword spotting

on the following platforms and operating systems:

x86, x86_64, 32-bit ARM, 64-bit ARM (arm64, aarch64), RISC-V (riscv64)
Linux, macOS, Windows, openKylin
Android, WearOS
iOS
NodeJS
WebAssembly
Raspberry Pi
RV1126
LicheePi4A
VisionFive 2
旭日X3派
etc

with the following APIs

C++, C, Python, Go, C#
Java, Kotlin, JavaScript
Swift
Dart

Links for pre-built Android APKs

Description	URL	中国用户
Streaming speech recognition	Address	点此
Text-to-speech	Address	点此
Voice activity detection (VAD)	Address	点此
VAD + non-streaming speech recognition	Address	点此
Two-pass speech recognition	Address	点此
Audio tagging	Address	点此
Audio tagging (WearOS)	Address	点此
Speaker identification	Address	点此
Spoken language identification	Address	点此
Keyword spotting	Address	点此

Links for pre-built Flutter APPs

Real-time speech recognition

Description	URL	中国用户
Streaming speech recognition	Address	点此

Text-to-speech

Description	URL	中国用户
Android (arm64-v8a, armeabi-v7a, x86_64)	Address	点此
Linux (x64)	Address	点此
macOS (x64)	Address	点此
macOS (arm64)	Address	点此
Windows (x64)	Address	点此

Note: You need to build from source for iOS.

Links for pre-trained models

Description	URL
Speech recognition (speech to text, ASR)	Address
Text-to-speech (TTS)	Address
VAD	Address
Keyword spotting	Address
Audio tagging	Address
Speaker identification (Speaker ID)	Address
Spoken language identification (Language ID)	See multi-lingual Whisper ASR models from Speech recognition
Punctuation	Address

Useful links

Documentation: https://k2-fsa.github.io/sherpa/onnx/
Bilibili 演示视频: https://search.bilibili.com/all?keyword=%E6%96%B0%E4%B8%80%E4%BB%A3Kaldi

How to reach us

Please see https://k2-fsa.github.io/sherpa/social-groups.html for 新一代 Kaldi 微信交流群 and QQ 交流群.

Name		Name	Last commit message	Last commit date
Latest commit History 748 Commits
.github		.github
android		android
c-api-examples		c-api-examples
cmake		cmake
dart-api-examples		dart-api-examples
dotnet-examples		dotnet-examples
ffmpeg-examples		ffmpeg-examples
flutter-examples		flutter-examples
flutter		flutter
go-api-examples		go-api-examples
ios-swift		ios-swift
ios-swiftui		ios-swiftui
java-api-examples		java-api-examples
kotlin-api-examples		kotlin-api-examples
mfc-examples		mfc-examples
nodejs-addon-examples		nodejs-addon-examples
nodejs-examples		nodejs-examples
python-api-examples		python-api-examples
rust-api-examples		rust-api-examples
scripts		scripts
sherpa-onnx		sherpa-onnx
swift-api-examples		swift-api-examples
toolchains		toolchains
wasm		wasm
.clang-format		.clang-format
.clang-tidy		.clang-tidy
.flake8		.flake8
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CMakeLists.txt		CMakeLists.txt
CPPLINT.cfg		CPPLINT.cfg
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
build-aarch64-linux-gnu.sh		build-aarch64-linux-gnu.sh
build-android-arm64-v8a.sh		build-android-arm64-v8a.sh
build-android-armv7-eabi.sh		build-android-armv7-eabi.sh
build-android-x86-64.sh		build-android-x86-64.sh
build-android-x86.sh		build-android-x86.sh
build-arm-linux-gnueabihf.sh		build-arm-linux-gnueabihf.sh
build-ios-no-tts.sh		build-ios-no-tts.sh
build-ios-shared.sh		build-ios-shared.sh
build-ios.sh		build-ios.sh
build-riscv64-linux-gnu.sh		build-riscv64-linux-gnu.sh
build-swift-macos.sh		build-swift-macos.sh
build-wasm-simd-asr.sh		build-wasm-simd-asr.sh
build-wasm-simd-kws.sh		build-wasm-simd-kws.sh
build-wasm-simd-nodejs.sh		build-wasm-simd-nodejs.sh
build-wasm-simd-tts.sh		build-wasm-simd-tts.sh
release.sh		release.sh
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Supported functions

Supported platforms

Supported programming languages

Introduction

Links for pre-built Android APKs

Links for pre-built Flutter APPs

Real-time speech recognition

Text-to-speech

Links for pre-trained models

Useful links

How to reach us

About

Releases

Packages

Languages

License

wang-keran/sherpa-onnx

Folders and files

Latest commit

History

Repository files navigation

Supported functions

Supported platforms

Supported programming languages

Introduction

Links for pre-built Android APKs

Links for pre-built Flutter APPs

Real-time speech recognition

Text-to-speech

Links for pre-trained models

Useful links

How to reach us

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages