ask-me-anything

Minimal System Requirements

Storage >= 16GB
RAM >= 8GB
CPU cores >= 4
Core Frequency >= 3GHz

Local Envr. Setup

python3 -m venv agents_env
source agents_env/bin/activate
python -m pip install --upgrade pip

pip install -r requirements-agents.txt

huggingface-cli download ojjsaw/reranking_model
huggingface-cli download ojjsaw/embedding_model
huggingface-cli download OpenVINO/Phi-3-mini-128k-instruct-int4-ov

Local Run Steps

Ideal Setup: Run LLM model on GPU and other 2 models on CPU (default all models run on CPU)
To switch to gpu, update LLM_DEVICE to "GPU" in llamaindex-minimal.py

uvicorn llamaindex-minimal:app

To run via Swagger (non streaming responses), navigate to http://127.0.0.1:8000/docs
To use upload api via terminal CURL:

curl -X 'POST' \
  'http://127.0.0.1:8000/upload-pdf-vector-index' \
  -H 'accept: application/json' \
  -H 'Content-Type: multipart/form-data' \
  -F '[email protected];type=application/pdf'

To run ask-me-anything api via terminal preferred for streaming responses.

Note: Use -N param to prevent buffer cache and see live chunked response data

curl -N -X 'POST' \
  'http://127.0.0.1:8000/ask-me-anything' \
  -H 'accept: application/json' \
  -H 'Content-Type: application/json' \
  -d '{
  "question": "What is the range of Thermal Design Power (TDP) for Intel Xeon 6 processors with E-cores?"
}'

Intel GPU Setup on Linux and WSL2 Linux for Ubuntu 22.04

Repo Signing

wget -qO - https://repositories.intel.com/gpu/intel-graphics.key |
     sudo gpg --yes --dearmor --output /usr/share/keyrings/intel-graphics.gpg

APT Repo Setup

wget -qO - https://repositories.intel.com/gpu/intel-graphics.key | \
  sudo gpg --yes --dearmor --output /usr/share/keyrings/intel-graphics.gpg
echo "deb [arch=amd64,i386 signed-by=/usr/share/keyrings/intel-graphics.gpg] https://repositories.intel.com/gpu/ubuntu jammy client" | \
  sudo tee /etc/apt/sources.list.d/intel-gpu-jammy.list
sudo apt update

Install minimal GPU deps for AI workload

apt-get install -y ocl-icd-libopencl1 intel-opencl-icd intel-level-zero-gpu level-zero

Troubleshooting/alternate OS Steps:

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README-dev.md		README-dev.md
README.md		README.md
fastapi-test.py		fastapi-test.py
llamaindex-minimal.py		llamaindex-minimal.py
requirements-agents.txt		requirements-agents.txt
text_example_en.pdf		text_example_en.pdf
xeon6-QnA		xeon6-QnA
xeon6-e-cores-network-and-edge-brief.pdf		xeon6-e-cores-network-and-edge-brief.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ask-me-anything

Minimal System Requirements

Local Envr. Setup

Local Run Steps

Intel GPU Setup on Linux and WSL2 Linux for Ubuntu 22.04

About

Releases

Packages

Languages

License

gitist/graph-then-ask-me-anything

Folders and files

Latest commit

History

Repository files navigation

ask-me-anything

Minimal System Requirements

Local Envr. Setup

Local Run Steps

Intel GPU Setup on Linux and WSL2 Linux for Ubuntu 22.04

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages