this is a northernlion quote aggregator
the current workflow is
- extract audio from all videos using yt-dlp (3k videos from https://www.youtube.com/@TheLibraryofLetourneau ) - main channel and older videos NOT available
- save them in gdrive as webm
- process through insanely fast whisper (https://github.com/Vaibhavs10/insanely-fast-whisper)
- parse transcription
- load to mongo db
- serve