Skip to content

Commit

Permalink
add logo and discord to all notebooks (#636)
Browse files Browse the repository at this point in the history
* add ppt rag notebook

* verified notebooks

* Delete docs/docs/examples/ppt_rag.ipynb

* verified ppt notebook

* add logo
  • Loading branch information
rishiraj authored May 31, 2024
1 parent 3e2920b commit 1bb4b9c
Show file tree
Hide file tree
Showing 20 changed files with 127 additions and 5 deletions.
6 changes: 6 additions & 0 deletions docs/docs/examples/GifSearch.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,12 @@
"source": [
"# **GIF search with MiniLM-L6 and CLIP embeddings**\n",
"\n",
"<div class=\"align-center\">\n",
" <a href=\"https://getindexify.ai/\"><img src=\"https://getindexify.ai/Indexify_Logo_Wordmark.svg\" width=\"145\"></a>\n",
" <a href=\"https://discord.com/invite/kF8UZACA7r\"><img src=\"https://raw.githubusercontent.com/rishiraj/random/main/Discord%20button.png\" width=\"145\"></a><br>\n",
" Join Discord if you need help + ⭐ <i>Star us on <a href=\"https://github.com/tensorlakeai/indexify\">Github</a></i> ⭐\n",
"</div>\n",
"\n",
"In this notebook, we'll create a semantic GIF search functionality with Indexify and Tumblr GIF dataset https://github.com/raingo/TGIF-Release. We'll use Indexify CLIP and MiniLM-L6 extractors to create embeddings for the GIFs and the search queries. We'll then use the embeddings to find the most similar GIFs to the search query."
]
},
Expand Down
6 changes: 6 additions & 0 deletions docs/docs/examples/HOA_Invoice_Data_Extraction.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,12 @@
"source": [
"# **Extracting Tabular Data from a PDF using Indexify**\n",
"\n",
"<div class=\"align-center\">\n",
" <a href=\"https://getindexify.ai/\"><img src=\"https://getindexify.ai/Indexify_Logo_Wordmark.svg\" width=\"145\"></a>\n",
" <a href=\"https://discord.com/invite/kF8UZACA7r\"><img src=\"https://raw.githubusercontent.com/rishiraj/random/main/Discord%20button.png\" width=\"145\"></a><br>\n",
" Join Discord if you need help + ⭐ <i>Star us on <a href=\"https://github.com/tensorlakeai/indexify\">Github</a></i> ⭐\n",
"</div>\n",
"\n",
"In this notebook, we're going to learn how we can extract transactional data from a PDF using Indexify. For that, we'll be using a sample PDF that contains transactional data from a Home Owners Association (HOA).\n",
"\n",
"We will explore several way to extract this data from the PDF using Indexify Extractor into a structured format that we can use further for RAG pipeline. This is the preview of the data that we will extract from the PDF.\n",
Expand Down
6 changes: 6 additions & 0 deletions docs/docs/examples/Image_RAG_Structured_Extraction.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,12 @@
"source": [
"# **Accurate Image RAG using Yolo and CodeGemma**\n",
"\n",
"<div class=\"align-center\">\n",
" <a href=\"https://getindexify.ai/\"><img src=\"https://getindexify.ai/Indexify_Logo_Wordmark.svg\" width=\"145\"></a>\n",
" <a href=\"https://discord.com/invite/kF8UZACA7r\"><img src=\"https://raw.githubusercontent.com/rishiraj/random/main/Discord%20button.png\" width=\"145\"></a><br>\n",
" Join Discord if you need help + ⭐ <i>Star us on <a href=\"https://github.com/tensorlakeai/indexify\">Github</a></i> ⭐\n",
"</div>\n",
"\n",
"Most Language Models(especially smaller ones) don't have vision capabilities. In this example, we will augment them with vision capabilities by automatically injecting structured data from images. The pipeline is tested to work at any scale, on laptops and with 10s of 1000s images on the cloud.\n",
"\n",
"What happens behind the scenes:\n",
Expand Down
8 changes: 7 additions & 1 deletion docs/docs/examples/Instructor_RAG.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,13 @@
"id": "defe2f2f",
"metadata": {},
"source": [
"## **Setup**"
"## **Setup**\n",
"\n",
"<div class=\"align-center\">\n",
" <a href=\"https://getindexify.ai/\"><img src=\"https://getindexify.ai/Indexify_Logo_Wordmark.svg\" width=\"145\"></a>\n",
" <a href=\"https://discord.com/invite/kF8UZACA7r\"><img src=\"https://raw.githubusercontent.com/rishiraj/random/main/Discord%20button.png\" width=\"145\"></a><br>\n",
" Join Discord if you need help + ⭐ <i>Star us on <a href=\"https://github.com/tensorlakeai/indexify\">Github</a></i> ⭐\n",
"</div>"
]
},
{
Expand Down
8 changes: 7 additions & 1 deletion docs/docs/examples/Invoices.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,13 @@
"cell_type": "markdown",
"metadata": {},
"source": [
"## **Setup**"
"## **Setup**\n",
"\n",
"<div class=\"align-center\">\n",
" <a href=\"https://getindexify.ai/\"><img src=\"https://getindexify.ai/Indexify_Logo_Wordmark.svg\" width=\"145\"></a>\n",
" <a href=\"https://discord.com/invite/kF8UZACA7r\"><img src=\"https://raw.githubusercontent.com/rishiraj/random/main/Discord%20button.png\" width=\"145\"></a><br>\n",
" Join Discord if you need help + ⭐ <i>Star us on <a href=\"https://github.com/tensorlakeai/indexify\">Github</a></i> ⭐\n",
"</div>"
]
},
{
Expand Down
6 changes: 6 additions & 0 deletions docs/docs/examples/Moondream_Visual_Description_Index.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,12 @@
"source": [
"# **Indexing Images Based on Visual Description**\n",
"\n",
"<div class=\"align-center\">\n",
" <a href=\"https://getindexify.ai/\"><img src=\"https://getindexify.ai/Indexify_Logo_Wordmark.svg\" width=\"145\"></a>\n",
" <a href=\"https://discord.com/invite/kF8UZACA7r\"><img src=\"https://raw.githubusercontent.com/rishiraj/random/main/Discord%20button.png\" width=\"145\"></a><br>\n",
" Join Discord if you need help + ⭐ <i>Star us on <a href=\"https://github.com/tensorlakeai/indexify\">Github</a></i> ⭐\n",
"</div>\n",
"\n",
"In this notebook we show how you can index images by visual description. We use a small visual description model called MoonDream. \n",
"\n",
"Once you setup Indexify, it will continoulsy extract visual descriptions using Moondream and index the description as images are ingested. You can build reliable applications which have to react to images in real-time. The use of such pipelines spans security, retail, and robotics."
Expand Down
6 changes: 6 additions & 0 deletions docs/docs/examples/SEC_10_K_docs.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,12 @@
"source": [
"# **Introduction**\n",
"\n",
"<div class=\"align-center\">\n",
" <a href=\"https://getindexify.ai/\"><img src=\"https://getindexify.ai/Indexify_Logo_Wordmark.svg\" width=\"145\"></a>\n",
" <a href=\"https://discord.com/invite/kF8UZACA7r\"><img src=\"https://raw.githubusercontent.com/rishiraj/random/main/Discord%20button.png\" width=\"145\"></a><br>\n",
" Join Discord if you need help + ⭐ <i>Star us on <a href=\"https://github.com/tensorlakeai/indexify\">Github</a></i> ⭐\n",
"</div>\n",
"\n",
"This notebook demonstrates how Indexify can make it easier to quickly extract insights from complex SEC filings like the Form 10-K annual report. Using Uber's 10-K as an example, we show how the Indexify library can enable question answering on the filing text to get rapid answers. We also illustrate how schema-based extraction can pull key data points from the unstructured document. The combination of question answering and schema-based extraction provides a powerful toolkit to derive insights from dense financial filings."
]
},
Expand Down
6 changes: 6 additions & 0 deletions docs/docs/examples/Scientific_Journals.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,12 @@
"source": [
"# Q and A on Scientific Journal\n",
"\n",
"<div class=\"align-center\">\n",
" <a href=\"https://getindexify.ai/\"><img src=\"https://getindexify.ai/Indexify_Logo_Wordmark.svg\" width=\"145\"></a>\n",
" <a href=\"https://discord.com/invite/kF8UZACA7r\"><img src=\"https://raw.githubusercontent.com/rishiraj/random/main/Discord%20button.png\" width=\"145\"></a><br>\n",
" Join Discord if you need help + ⭐ <i>Star us on <a href=\"https://github.com/tensorlakeai/indexify\">Github</a></i> ⭐\n",
"</div>\n",
"\n",
"In this notebook we show text extraction from a scientific paper. We are going to chunk and embed the text after text extraction. We show Langchain based extraction."
]
},
Expand Down
7 changes: 7 additions & 0 deletions docs/docs/examples/Sixt.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,13 @@
"metadata": {},
"source": [
"# **RAG on Multiple Terms and Conditions Documents Varying By Geography**\n",
"\n",
"<div class=\"align-center\">\n",
" <a href=\"https://getindexify.ai/\"><img src=\"https://getindexify.ai/Indexify_Logo_Wordmark.svg\" width=\"145\"></a>\n",
" <a href=\"https://discord.com/invite/kF8UZACA7r\"><img src=\"https://raw.githubusercontent.com/rishiraj/random/main/Discord%20button.png\" width=\"145\"></a><br>\n",
" Join Discord if you need help + ⭐ <i>Star us on <a href=\"https://github.com/tensorlakeai/indexify\">Github</a></i> ⭐\n",
"</div>\n",
"\n",
"In this demo we are going to build a pipeline to build and update policy documents which vary by geography. \n",
"\n",
"Approach:\n",
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,13 @@
"cell_type": "markdown",
"metadata": {},
"source": [
"## **Setup**"
"## **Setup**\n",
"\n",
"<div class=\"align-center\">\n",
" <a href=\"https://getindexify.ai/\"><img src=\"https://getindexify.ai/Indexify_Logo_Wordmark.svg\" width=\"145\"></a>\n",
" <a href=\"https://discord.com/invite/kF8UZACA7r\"><img src=\"https://raw.githubusercontent.com/rishiraj/random/main/Discord%20button.png\" width=\"145\"></a><br>\n",
" Join Discord if you need help + ⭐ <i>Star us on <a href=\"https://github.com/tensorlakeai/indexify\">Github</a></i> ⭐\n",
"</div>"
]
},
{
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,13 @@
"cell_type": "markdown",
"metadata": {},
"source": [
"## **Setup**"
"## **Setup**\n",
"\n",
"<div class=\"align-center\">\n",
" <a href=\"https://getindexify.ai/\"><img src=\"https://getindexify.ai/Indexify_Logo_Wordmark.svg\" width=\"145\"></a>\n",
" <a href=\"https://discord.com/invite/kF8UZACA7r\"><img src=\"https://raw.githubusercontent.com/rishiraj/random/main/Discord%20button.png\" width=\"145\"></a><br>\n",
" Join Discord if you need help + ⭐ <i>Star us on <a href=\"https://github.com/tensorlakeai/indexify\">Github</a></i> ⭐\n",
"</div>"
]
},
{
Expand Down
6 changes: 6 additions & 0 deletions docs/docs/examples/Video_RAG.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,12 @@
"source": [
"# **RAG using Video as Context**\n",
"\n",
"<div class=\"align-center\">\n",
" <a href=\"https://getindexify.ai/\"><img src=\"https://getindexify.ai/Indexify_Logo_Wordmark.svg\" width=\"145\"></a>\n",
" <a href=\"https://discord.com/invite/kF8UZACA7r\"><img src=\"https://raw.githubusercontent.com/rishiraj/random/main/Discord%20button.png\" width=\"145\"></a><br>\n",
" Join Discord if you need help + ⭐ <i>Star us on <a href=\"https://github.com/tensorlakeai/indexify\">Github</a></i> ⭐\n",
"</div>\n",
"\n",
"This notebook will guide you on creating a RAG pipeline with a video as the knowledge source. The pipeline will be able to answer questions based on the video content."
]
},
Expand Down
7 changes: 7 additions & 0 deletions docs/docs/examples/Visual_Understanding_Clip_Yolo.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,13 @@
"metadata": {},
"source": [
"# Querying Images using CLIP and YOLO\n",
"\n",
"<div class=\"align-center\">\n",
" <a href=\"https://getindexify.ai/\"><img src=\"https://getindexify.ai/Indexify_Logo_Wordmark.svg\" width=\"145\"></a>\n",
" <a href=\"https://discord.com/invite/kF8UZACA7r\"><img src=\"https://raw.githubusercontent.com/rishiraj/random/main/Discord%20button.png\" width=\"145\"></a><br>\n",
" Join Discord if you need help + ⭐ <i>Star us on <a href=\"https://github.com/tensorlakeai/indexify\">Github</a></i> ⭐\n",
"</div>\n",
"\n",
"This notebook demonstrates creating CLIP embeddings to search images based on a text query and YOLO object detection allowing you to query images containing specific objects."
]
},
Expand Down
6 changes: 6 additions & 0 deletions docs/docs/examples/asrdiarization_rag.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,12 @@
"source": [
"# **Transcribing Audio and Question Answering with ASR, Diarization, and Retrieval-Augmented Generation**\n",
"\n",
"<div class=\"align-center\">\n",
" <a href=\"https://getindexify.ai/\"><img src=\"https://getindexify.ai/Indexify_Logo_Wordmark.svg\" width=\"145\"></a>\n",
" <a href=\"https://discord.com/invite/kF8UZACA7r\"><img src=\"https://raw.githubusercontent.com/rishiraj/random/main/Discord%20button.png\" width=\"145\"></a><br>\n",
" Join Discord if you need help + ⭐ <i>Star us on <a href=\"https://github.com/tensorlakeai/indexify\">Github</a></i> ⭐\n",
"</div>\n",
"\n",
"This notebook demonstrates a powerful pipeline for transcribing audio, such as podcasts, and performing question answering using Retrieval-Augmented Generation (RAG). The pipeline combines Automatic Speech Recognition (ASR), diarization, and speculative decoding techniques to efficiently process audio data and generate informative responses.\n",
"\n",
"## Key Components\n",
Expand Down
6 changes: 6 additions & 0 deletions docs/docs/examples/audio_rag.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,12 @@
"source": [
"# **RAG using Audio as a Context**\n",
"\n",
"<div class=\"align-center\">\n",
" <a href=\"https://getindexify.ai/\"><img src=\"https://getindexify.ai/Indexify_Logo_Wordmark.svg\" width=\"145\"></a>\n",
" <a href=\"https://discord.com/invite/kF8UZACA7r\"><img src=\"https://raw.githubusercontent.com/rishiraj/random/main/Discord%20button.png\" width=\"145\"></a><br>\n",
" Join Discord if you need help + ⭐ <i>Star us on <a href=\"https://github.com/tensorlakeai/indexify\">Github</a></i> ⭐\n",
"</div>\n",
"\n",
"This notebook will show you how to use audio files as a context for your RAG pipeline. We are going to use 2 Indexify Extractors:\n",
"\n",
"- `tensorlake/whisper-asr`: This extractor will convert the audio file into text.\n",
Expand Down
6 changes: 6 additions & 0 deletions docs/docs/examples/audio_transcription.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,12 @@
"source": [
"# **Transcribing Audio with Indexify**\n",
"\n",
"<div class=\"align-center\">\n",
" <a href=\"https://getindexify.ai/\"><img src=\"https://getindexify.ai/Indexify_Logo_Wordmark.svg\" width=\"145\"></a>\n",
" <a href=\"https://discord.com/invite/kF8UZACA7r\"><img src=\"https://raw.githubusercontent.com/rishiraj/random/main/Discord%20button.png\" width=\"145\"></a><br>\n",
" Join Discord if you need help + ⭐ <i>Star us on <a href=\"https://github.com/tensorlakeai/indexify\">Github</a></i> ⭐\n",
"</div>\n",
"\n",
"In this notebook, we will use an Indexify Extractor (Whisper ASR) to transcribe audio files to texts."
]
},
Expand Down
6 changes: 6 additions & 0 deletions docs/docs/examples/efficient_rag.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,12 @@
"source": [
"# Efficient and supercharged RAG for mixed context texts with Indexify's framework, Gemini's 1M context & Arctic's embeddings\n",
"\n",
"<div class=\"align-center\">\n",
" <a href=\"https://getindexify.ai/\"><img src=\"https://getindexify.ai/Indexify_Logo_Wordmark.svg\" width=\"145\"></a>\n",
" <a href=\"https://discord.com/invite/kF8UZACA7r\"><img src=\"https://raw.githubusercontent.com/rishiraj/random/main/Discord%20button.png\" width=\"145\"></a><br>\n",
" Join Discord if you need help + ⭐ <i>Star us on <a href=\"https://github.com/tensorlakeai/indexify\">Github</a></i> ⭐\n",
"</div>\n",
"\n",
"## Introduction\n",
"\n",
"Retrieval-augmented generation (RAG) systems have emerged as a groundbreaking approach in natural language processing, enabling the generation of accurate and contextually relevant responses by leveraging external knowledge. These systems have the potential to revolutionize various applications, from question answering and content generation to dialogue systems and beyond. However, despite their immense promise, modern RAG systems face a significant challenge when it comes to efficiently processing large mixed context texts.\n",
Expand Down
6 changes: 6 additions & 0 deletions docs/docs/examples/pdfqa.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,12 @@
"source": [
"## **Installation and Setup**\n",
"\n",
"<div class=\"align-center\">\n",
" <a href=\"https://getindexify.ai/\"><img src=\"https://getindexify.ai/Indexify_Logo_Wordmark.svg\" width=\"145\"></a>\n",
" <a href=\"https://discord.com/invite/kF8UZACA7r\"><img src=\"https://raw.githubusercontent.com/rishiraj/random/main/Discord%20button.png\" width=\"145\"></a><br>\n",
" Join Discord if you need help + ⭐ <i>Star us on <a href=\"https://github.com/tensorlakeai/indexify\">Github</a></i> ⭐\n",
"</div>\n",
"\n",
"1. Install the `indexify-extractor-sdk` package using pip."
]
},
Expand Down
6 changes: 6 additions & 0 deletions docs/docs/examples/ppt_rag.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,12 @@
"source": [
"# **Introduction**\n",
"\n",
"<div class=\"align-center\">\n",
" <a href=\"https://getindexify.ai/\"><img src=\"https://getindexify.ai/Indexify_Logo_Wordmark.svg\" width=\"145\"></a>\n",
" <a href=\"https://discord.com/invite/kF8UZACA7r\"><img src=\"https://raw.githubusercontent.com/rishiraj/random/main/Discord%20button.png\" width=\"145\"></a><br>\n",
" Join Discord if you need help + ⭐ <i>Star us on <a href=\"https://github.com/tensorlakeai/indexify\">Github</a></i> ⭐\n",
"</div>\n",
"\n",
"This notebook demonstrates how Indexify can make it easier to quickly extract insights from complex real-world PowerPoint presentations like a talk given on \"[A little guide to building Large Language Models in 2024](https://docs.google.com/presentation/d/1IkzESdOwdmwvPxIELYJi8--K3EZ98_cL6c5ZcLKSyVg/edit?usp=sharing)\" by Thomas, the co-founder of Hugging Face. Using the slides as an example, we show how the Indexify library can enable question answering on the talk to get rapid answers."
]
},
Expand Down
8 changes: 7 additions & 1 deletion docs/docs/examples/rag.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,13 @@
"id": "2a784b7a",
"metadata": {},
"source": [
"## **Setup**"
"## **Setup**\n",
"\n",
"<div class=\"align-center\">\n",
" <a href=\"https://getindexify.ai/\"><img src=\"https://getindexify.ai/Indexify_Logo_Wordmark.svg\" width=\"145\"></a>\n",
" <a href=\"https://discord.com/invite/kF8UZACA7r\"><img src=\"https://raw.githubusercontent.com/rishiraj/random/main/Discord%20button.png\" width=\"145\"></a><br>\n",
" Join Discord if you need help + ⭐ <i>Star us on <a href=\"https://github.com/tensorlakeai/indexify\">Github</a></i> ⭐\n",
"</div>"
]
},
{
Expand Down

0 comments on commit 1bb4b9c

Please sign in to comment.