This repository contains a comprehensive tutorial demonstrating how to use Couchbase Capella's AI Services auto-vectorization feature to automatically convert your data into vector embeddings and perform semantic search using LangChain.
The main tutorial is contained in the Jupyter notebook autovec_langchain.ipynb, which walks you through:
- Couchbase Capella Setup - Creating account, cluster, and access controls
- Data Upload & Processing - Using sample data
- Model Deployment - Deploying embedding models for vectorization
- Auto-Vectorization Workflow - Setting up automated embedding generation
- LangChain Integration - Building semantic search applications with vector similarity
- Python 3.8 or higher
- A Couchbase Capella account
- Basic understanding of vector databases and embeddings
-
Clone or download this repository
git clone <repository-url> cd autovec
-
Install Python dependencies
pip install jupyter pip install couchbase pip install langchain-couchbase pip install langchain-nvidia-ai-endpoints
-
Start Jupyter Notebook
jupyter notebook
or
jupyter lab
-
Open the tutorial notebook
- Navigate to
autovec_langchain.ipynbin the Jupyter interface - Follow the step-by-step instructions in the notebook
- Navigate to
**Note**: This tutorial is designed for educational purposes. For production deployments, ensure proper security configurations and SSL/TLS verification.