- Python: Runtime Platform
- OpenAI: Embedding of the text & Inference of the class
- Qdrant: Vector Database
- Set up virtual environment for runtime
- Linux or Macos
python -m venv .venv source .venv/bin/activate pip install -r requirements.txt
- Windows
python -m venv .venv .venv/Scripts/activate pip install -r requirements.txt
- Linux or Macos
- Configure the environment variables
- Create a new
.env
file. - Edit
.env
fileOPENAI_API_KEY="OPENAI_API_KEY" OPENAI_ORGANIZATION_KEY="OPENAI_ORGANIZATION_KEY" QDRANT_API_KEY="QDRANT_API_KEY" QDRANT_CLUSTER="QDRANT_CLUSTER"
- Create a new
- Parse XML files to prepare the data
python xml_parse.py
- Prepare the Qdrant and prepare the vector database
python data_retrieve.py data/output.json