Skip to content

NeoAdonis/fwmc-transcript

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿพ FUWAMOCO Transcripts Repository

This repo contains scripts to generate automatic transcriptions, as well as summaries, for FUWAMOCO content, starting with FWMC Morning.

This is a fan made project. The contents in this repository follow the hololive production Derivative Works Guidelines set forth by Cover Corp.

Note

In its initial state, the transcriptions provided in this repository were generated automatically using speech recognition software. The summaries were created aided by a large language model virtual assistant. As a result, there may be inaccuracies or errors in these documents that are not representative of the original content. If you notice significant errors, feel free to submit corrections or raise issues through the repository's issue tracker.

๐ŸŒ… FUWAMOCO Morning

FUWAMOCO Morning is an online, short-format morning show hosted by the fuzzy and fluffy guard dog sisters FUWAMOCO. Their aim with this show is to bring a smile to everyone's face and to help them start the day on the right paw!

An index of all FWMC Morning episodes, summaries and transcripts can be found at morning/index.md.

โš’๏ธ Building

Some of the files here are generated automatically.

  1. Media is extracted directly from YouTube along with its metadata. Audio is converted into .wav format for easier processing.
  2. Audio is automatically transcribed using a mix of automatic speech recognition plus some manual workarounds to provide the most accurate transcript possible from the get-go.
  3. A basic summary is created. A LLM/RAG tool can be leveraged to summarize the information from the transcript and help with the initial draft.
  4. Using the basic summary and metadata from each media, a fancier summary document can be created programmatically.
  5. An index of all summaries can be automatically generated.

Prerequisites

  • Bun or Node.js
  • FFmpeg
  • Optional: Miniforge to handle virtual Python environments.
  • Otherwise, make sure that Python 3.12 is installed.

If using Windows, you can install all these prerequisites with WinGet.

๐Ÿ”ฐ Quick start

Install prerequisites in Windows

Make sure that WinGet is installed, then run the following:

winget install Oven-sh.Bun
winget install Gyan.FFmpeg
winget install CondaForge.Miniforge3

Install prerequisites in Linux

WhisperX requires Python 3.12. Make sure that such version is installed on your system before creating an environment.

You can install Bun or Node.js from your distro package manager, or directly by following the instructions on the Bun or Node.js download websites.

Set up Miniconda environment

  1. If using PowerShell, init Miniconda: conda init powershell
  2. Create environment with some base packages: conda env create --file environment.yml
  3. Activate environment: conda activate transcripts
  4. Proceed with WhisperX installation.

Set up venv

  1. Create environment: python3.12 -m venv transcripts
  2. Activate environment:
    • PowerShell: .\transcripts\Scripts\Activate.ps1
    • Bash: source ./transcripts/bin/activate
  3. Install base packages: pip install -r requirements_0_base.txt
  4. Proceed with installation of other packages.

Install WhisperX

  1. Install requirements: pip install -r requirements_1_torch_cu129.txt (use requirements_1_torch_cpu_<platform>.txt instead if you don't have an Nvidia GPU)
  2. Install WhisperX: pip install -r requirements_2_whisperx.txt
  3. On Linux, if you encounter an error similar to ImportError: libctranslate2-d3638643.so.4.4.0: cannot enable executable stack as shared object requires, run ./workaround_libctranslate2_clear-execstack.sh to clear conflicting GNU_STACK executable flag from the library; see OpenNMT/CTranslate2#1849.
  4. On Linux, if you encounter an error similar to Could not load library libcudnn_ops_infer.so.8, install CUDNN 11. You can also install nvidia-cudnn-cu11 from pip, then set LD_LIBRARY_PATH to the libraries location (e.g. export LD_LIBRARY_PATH="$VIRTUAL_ENV/lib/python3.x/site-packages/nvidia/cublas/lib:$VIRTUAL_ENV/lib/python3.x/site-packages/nvidia/cudnn/lib:$LD_LIBRARY_PATH").

๐ŸŽถ FUWAMOCO songs

A compilation of lyrics, fanchants & other content related to their songs.

โœจ Thanks

  • Cover Corp.: For hololive and for giving FUWAMOCO a chance to shine through.
  • Dylan Mendes, Kami-bako: For the timestamps in each video's comments sections; it was a useful resource to easily find sections and compare against my own timestamps.
  • FUWAMOCO: For their content and for being an inspiration to many, including myself!

About

๐Ÿพ Transcripts and summaries for FUWAMOCO content.

Resources

License

Contributing

Stars

Watchers

Forks