Skip to content

Commit

Permalink
fix old notebook
Browse files Browse the repository at this point in the history
  • Loading branch information
richardsliu committed Apr 1, 2024
1 parent 929bbd9 commit 3bd07cd
Showing 1 changed file with 13 additions and 3 deletions.
16 changes: 13 additions & 3 deletions applications/rag/example_notebooks/rag-kaggle-ray-sql-latest.ipynb
Original file line number Diff line number Diff line change
@@ -1,5 +1,15 @@
{
"cells": [
{
"cell_type": "markdown",
"id": "afb6fda4-ffde-4831-88a5-ae41144492b2",
"metadata": {},
"source": [
"# RAG-on-GKE Application\n",
"\n",
"This is a Python notebook for generating the vector embeddings used by the RAG on GKE application. For full information, please checkout the GitHub documentation [here](https://github.com/GoogleCloudPlatform/ai-on-gke/blob/main/applications/rag/README.md).\n"
]
},
{
"cell_type": "code",
"execution_count": null,
Expand All @@ -10,7 +20,7 @@
"# Replace these with your settings\n",
"# Navigate to https://www.kaggle.com/settings/account and generate an API token to be used to setup the env variable. See https://www.kaggle.com/docs/api#authentication how to create one.\n",
"KAGGLE_USERNAME = \"<username>\"\n",
"KAGGLE_KEY = \"<token>\"\n"
"KAGGLE_KEY = \"<token>\""
]
},
{
Expand All @@ -36,8 +46,8 @@
"\n",
"# Download the zip file to local storage and then extract the desired contents directly to the GKE GCS CSI mounted bucket. The bucket is mounted at the \"/persist-data\" path in the jupyter pod.\n",
"!kaggle datasets download -d shivamb/netflix-shows -p ~/data --force\n",
"!mkdir /persist-data/netflix-shows -p\n",
"!unzip -o ~/data/netflix-shows.zip -d /persist-data/netflix-shows"
"!mkdir /data/netflix-shows -p\n",
"!unzip -o ~/data/netflix-shows.zip -d /data/netflix-shows"
]
},
{
Expand Down

0 comments on commit 3bd07cd

Please sign in to comment.