-
-
Notifications
You must be signed in to change notification settings - Fork 2
/
Copy pathrequirements.txt
49 lines (38 loc) · 1.23 KB
/
requirements.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
#Requirements for recasepunc:
git+https://github.com/benob/mosestokenizer.git
numpy
regex
tqdm
transformers
wheel #Technically not necessary but it'll make your life easier.
keyring #to store sensitive parameters such as the API keys
#Requirements for pyttsx3 TTS:
pyttsx3
soundfile
sounddevice
#Requirements for 11.ai TTS:
elevenlabslib
requests
#Requirements for the vosk recognition:
vosk
#PyAudio; sys.platform != 'win32'
#PyAudioWPatch; sys.platform == 'win32' #This is a fork of pyaudio that support WASAPI loopback devices, so you can use an output as an input.
#So as it turns out, vosk can only recognize input from single-channel sources. In addition, WASAPI loopback devices seem unable to use any number of channels
#that isn't the maximum, so I had to scrap this whole idea. You'll need to rely on something like VBCable if you want to use an audio output as input.
PyAudio
#Requirements for the whisper recognition:
#For the local version:
faster-whisper
openai-whisper
SpeechRecognition
#For the API
openai
#Requirements for the azure recognition:
azure-cognitiveservices-speech
pycaw; sys.platform == 'win32'
comtypes; sys.platform == 'win32'
#For the OBS subtitles.
obsws-python
#For the translation
googletrans>=4.0.0rc1
deepl