Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug] dataprep/retriever is not working as expected in ChatQnA with Qdrant #1473

Open
2 of 8 tasks
lianhao opened this issue Jan 26, 2025 · 0 comments
Open
2 of 8 tasks
Labels
bug Something isn't working

Comments

@lianhao
Copy link
Collaborator

lianhao commented Jan 26, 2025

Priority

P2 - High

OS type

Ubuntu

Hardware type

Xeon-GNR

Installation method

  • Pull docker images from hub.docker.com
  • Build docker images from source
  • Other

Deploy method

  • Docker
  • Docker Compose
  • Kubernetes Helm Charts
  • Kubernetes GMC
  • Other

Running nodes

Single Node

What's the version?

git commit ed16308

Description

When launch ChatQnA with Qdran vector DB, after successfully uploading the document, we can not retrieve the previously uploaded document. The return of the retriever is always containing 0 document, not as what we've expected.

The retrieval result is:

{"id":"6d08404f2c77900d78baeb58bbb97d6c","retrieved_docs":[],"initial_query":"What is the revenue of Nike in 2023?","top_n":1,"metadata":[]}

Expected result should be something valid docs in the field retrieved_docs.

Checking the qdrant-vector-db log, finding that dataprep and retriever are not using the same collection in Qdrant.

Reproduce steps

cd GenAIExamples/ChatQnA/docker_compose/intel/cpu/xeon
source ./set_env.sh
docker compose -f compose_qdrant.yaml up

upload the document

curl http://localhost:6043/v1/dataprep/ingest -X POST -H "Content-Type: multipart/form-data" -F "files=@./nike.txt"

retrieve the document

export your_embedding=$(python3 -c "import random; embedding = [random.uniform(-1, 1) for _ in range(768)]; print(embedding)")
curl http://localhost:6045/v1/retrieval -X POST -H 'Content-Type: application/json' -d "{"text":"What is the revenue of Nike in 2023?","embedding":${your_embedding}}"

Raw log

docker compose -f compose_qdrant.yaml logs qdrant-vector-db

qdrant-vector-db  | 2025-01-26T08:27:08.808448Z  INFO actix_server::builder: Starting 151 workers
qdrant-vector-db  | 2025-01-26T08:27:08.808458Z  INFO actix_server::server: Actix runtime found; starting in Actix runtime
qdrant-vector-db  | 2025-01-26T08:27:35.611864Z  INFO actix_web::middleware::logger: 172.20.0.5 "GET /collections/rag-qdrant HTTP/1.1" 404 92 "-
" "qdrant-client/1.13.2 python/3.11.11" 0.000409
qdrant-vector-db  | 2025-01-26T08:27:35.612829Z  INFO storage::content_manager::toc::collection_meta_ops: Deleting collection rag-qdrant
qdrant-vector-db  | 2025-01-26T08:27:35.612918Z  INFO actix_web::middleware::logger: 172.20.0.5 "DELETE /collections/rag-qdrant HTTP/1.1" 200 58
 "-" "qdrant-client/1.13.2 python/3.11.11" 0.000167
qdrant-vector-db  | 2025-01-26T08:27:35.614417Z  INFO storage::content_manager::toc::collection_meta_ops: Creating collection rag-qdrant
qdrant-vector-db  | 2025-01-26T08:27:35.637944Z  INFO actix_web::middleware::logger: 172.20.0.5 "PUT /collections/rag-qdrant HTTP/1.1" 200 57 "-
" "qdrant-client/1.13.2 python/3.11.11" 0.023616
qdrant-vector-db  | 2025-01-26T08:27:35.652448Z  INFO actix_web::middleware::logger: 172.20.0.5 "PUT /collections/rag-qdrant/points?wait=true HT
TP/1.1" 200 79 "-" "qdrant-client/1.13.2 python/3.11.11" 0.000784
qdrant-vector-db  | 2025-01-26T08:28:03.911038Z  INFO actix_web::middleware::logger: 172.20.0.4 "GET /collections/rag_qdrant/exists HTTP/1.1" 200 66 "-" "qdrant-client/1.13.2 python/3.11.11" 0.000387
qdrant-vector-db  | 2025-01-26T08:28:03.913743Z  INFO actix_web::middleware::logger: 172.20.0.4 "GET /collections/rag_qdrant/exists HTTP/1.1" 200 66 "-" "qdrant-client/1.13.2 python/3.11.11" 0.000072
qdrant-vector-db  | 2025-01-26T08:28:03.914621Z  INFO storage::content_manager::toc::collection_meta_ops: Creating collection rag_qdrant
qdrant-vector-db  | 2025-01-26T08:28:03.938203Z  INFO actix_web::middleware::logger: 172.20.0.4 "PUT /collections/rag_qdrant HTTP/1.1" 200 57 "-" "qdrant-client/1.13.2 python/3.11.11" 0.023654
qdrant-vector-db  | 2025-01-26T08:28:03.945781Z  INFO actix_web::middleware::logger: 172.20.0.4 "POST /collections/rag_qdrant/points/query HTTP/1.1" 200 66 "-" "qdrant-client/1.13.2 python/3.11.11" 0.00130

Attachments

No response

@lianhao lianhao added the bug Something isn't working label Jan 26, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant