GitHub - asuzukosi/audiobook-generator: This is a python package that generates audiobooks from easily available PDF versions. It uses the Microsoft T5 Speech Transformer model from Huggingface to generate audiobooks from PDFs

Audiobook generator that uses the Microsoft T5 Speech Transformer model from Huggingface to generate audiobooks from PDFs (NOTE: still under active development and experimentation, breaking changes were made recently, would not work now)

Introduction

Getting access to quality audio difficult is difficult or expensive, but access to pdf versions are relatively much easier. If you are someone whose always on the move having time to read pdf books can be unenjoyable and somtimes impossible. I am one of such people, so I decided to build an audio book generator using a text to speech model.

Structure

The project is mainly consists of the main.py file which contains the main executables and is the entry point of the application, and the aiReader.py which handles text cleaning and speech synthesizing.

To run the application

To run the application you need to install the requirments using the following command.

pip install -r requirements.txt

After successfully installing all the requirements, you can then run the application by specifying your input pdf file and you output mp3 file, here is an example command

python main.py -i snowcrash.pdf -o snowcrash.mp3

This would generate the audiobook for you, depending on the size of the audiobooke the time may vary

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
__pycache__		__pycache__
background		background
images		images
nix		nix
weights		weights
.DS_Store		.DS_Store
.gitignore		.gitignore
AudioBookGenerator.ipynb		AudioBookGenerator.ipynb
README.md		README.md
aiReader.py		aiReader.py
main.py		main.py
requirements.txt		requirements.txt
snowcrash.pdf		snowcrash.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Audiobook generator that uses the Microsoft T5 Speech Transformer model from Huggingface to generate audiobooks from PDFs (NOTE: still under active development and experimentation, breaking changes were made recently, would not work now)

Introduction

Structure

To run the application

About

Releases

Packages

Languages

asuzukosi/audiobook-generator

Folders and files

Latest commit

History

Repository files navigation

Audiobook generator that uses the Microsoft T5 Speech Transformer model from Huggingface to generate audiobooks from PDFs (NOTE: still under active development and experimentation, breaking changes were made recently, would not work now)

Introduction

Structure

To run the application

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages