Video to Text: Natural language description generator for some given video. [Video Captioning]
-
Updated
May 3, 2022 - Python
Video to Text: Natural language description generator for some given video. [Video Captioning]
YouTube, Apple Podcasts (and more) to readable Markdown.
A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and create subtitles.
A real-time video caption to conversation bot that captures frames generates captions and creates conversational responses using a Large Language Models base to create interactive video descriptions.
Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*
Everything is very simple: you either download a picture file or specify its link when running a python script, and output you get a text file, and you can immediately view on the command line how it will look the result of your conversion.
Generate captions for videos using the power of OpenAI's Whisper API
A curated list of video-text datasets in a variety of languages. These datasets can be used for video captioning (video description) or video retrieval.
Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*
Generate subtitles for all the videos in a folder with OpenAI's Whisper privately in your computer.
Generating video descriptions using deep learning in Keras
Convert images or videos to ASCII in the terminal
A curated list of zero-shot captioning papers
Convert a video tutorial in a blog post using Claude 3
A video call application that recognizes gestures (signal language) and converts them into text and sound.
Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*
Chrome extension that helps students learn from YouTube videos
Generate automatic transcripts and subtitles for your videos with the help of the neural network-based.
An AI tools which helps to analyze any YouTube video, give the sentiment of the video and suggest description and topics related the content. Lastly, It extract the subtitles from the video by understanding the audio then transcribe it in any language with timestamps and also embed the subtitles into the video
Text from the video is extracted and saved into a .docx file in the form of notes.
Add a description, image, and links to the video-to-text topic page so that developers can more easily learn about it.
To associate your repository with the video-to-text topic, visit your repo's landing page and select "manage topics."