fix storeindex cosmosnosql query issue - (BadRequest) One of the inpu… #17385

nqtung · 2024-12-28T22:12:17Z

Fix the CosmosNoSQL query parameter and update dependencies version llama-index-core = "^0.12.0"

Fixes # (17384) - #17384

New Package?

Did I fill in the tool.llamahub section in the pyproject.toml and provide a detailed README.md for my new integration or package?

Yes
[ x] No

Version Bump?

Did I bump the version in the pyproject.toml file of the package I am updating? (Except for the llama-index-core package)

[x ] Yes
No

Type of Change

Please delete options that are not relevant.

[ x] Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

How Has This Been Tested?

Your pull-request will likely not be merged unless it is covered by some form of impactful unit testing.

I added new unit tests to cover this change
[ x] I believe this change is already covered by existing unit tests

Suggested Checklist:

[ x] I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have added Google Colab support for the newly added notebooks.
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
[ x] New and existing unit tests pass locally with my changes
I ran make format; make lint to appease the lint gods

…t values is invalid.

logan-markewich · 2024-12-29T00:56:28Z

...lama-index-vector-stores-azurecosmosnosql/llama_index/vector_stores/azurecosmosnosql/base.py

@@ -325,25 +325,31 @@ def _query(self, query: VectorStoreQuery, **kwargs: Any) -> VectorStoreQueryResu

        # If limit_offset_clause is not specified, add TOP clause
        if pre_filter is None or pre_filter.get("limit_offset_clause") is None:
-            query += "TOP @limit "
+            # query += "TOP @limit "
+            query += f"TOP {params.get('k', 2)} "


Generally if you are making a change, we just remove the old code, don't comment it out

logan-markewich · 2024-12-29T00:57:10Z

...lama-index-vector-stores-azurecosmosnosql/llama_index/vector_stores/azurecosmosnosql/base.py

@@ -325,25 +325,31 @@ def _query(self, query: VectorStoreQuery, **kwargs: Any) -> VectorStoreQueryResu

        # If limit_offset_clause is not specified, add TOP clause
        if pre_filter is None or pre_filter.get("limit_offset_clause") is None:
-            query += "TOP @limit "
+            # query += "TOP @limit "
+            query += f"TOP {params.get('k', 2)} "


Also curious, can you give some explanation on this change? This seems to be changing a lot of the query text

logan-markewich · 2024-12-29T00:57:29Z

...lama-index-vector-stores-azurecosmosnosql/llama_index/vector_stores/azurecosmosnosql/base.py


        # Add limit_offset_clause if specified
        if pre_filter is not None and pre_filter.get("limit_offset_clause") is not None:
            query += " {}".format(pre_filter["limit_offset_clause"])
        parameters = [
-            {"name": "@limit", "value": params["k"]},
-            {"name": "@embeddingKey", "value": self._embedding_key},
+            # {"name": "@limit", "value": params["k"]},


Why remove the limit param?

Use clear limit value in query, we dont need to pass @limit as parameter

logan-markewich · 2024-12-29T00:58:46Z

...lama-index-vector-stores-azurecosmosnosql/llama_index/vector_stores/azurecosmosnosql/base.py

-            {"name": "@limit", "value": params["k"]},
-            {"name": "@embeddingKey", "value": self._embedding_key},
+            # {"name": "@limit", "value": params["k"]},
+            # {"name": "@embeddingKey", "value": self._embedding_key},
            {"name": "@embeddings", "value": params["vector"]},


params["vector"] is not the same as self._embedding_key? looking at line 316 above

params: Dict[str, Any] = { "vector": query.query_embedding, "path": self._embedding_key, "k": query.similarity_top_k, }

you cannot use the parameter @embeddingKey for container field name in the query. in the query we need to use clear field name c.embeding instead of c.@embeddingKey

@limit will process default 2 if there is no value

fix storeindex cosmosnosql query issue - (BadRequest) One of the inpu…

1d66549

…t values is invalid.

dosubot bot added the size:S This PR changes 10-29 lines, ignoring generated files. label Dec 28, 2024

logan-markewich reviewed Dec 29, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix storeindex cosmosnosql query issue - (BadRequest) One of the inpu… #17385

fix storeindex cosmosnosql query issue - (BadRequest) One of the inpu… #17385

nqtung commented Dec 28, 2024

logan-markewich Dec 29, 2024

logan-markewich Dec 29, 2024

logan-markewich Dec 29, 2024

nqtung Dec 29, 2024

logan-markewich Dec 29, 2024

nqtung Dec 29, 2024

fix storeindex cosmosnosql query issue - (BadRequest) One of the inpu… #17385

Are you sure you want to change the base?

fix storeindex cosmosnosql query issue - (BadRequest) One of the inpu… #17385

Conversation

nqtung commented Dec 28, 2024

New Package?

Version Bump?

Type of Change

How Has This Been Tested?

Suggested Checklist:

logan-markewich Dec 29, 2024

Choose a reason for hiding this comment

logan-markewich Dec 29, 2024

Choose a reason for hiding this comment

logan-markewich Dec 29, 2024

Choose a reason for hiding this comment

nqtung Dec 29, 2024

Choose a reason for hiding this comment

logan-markewich Dec 29, 2024

Choose a reason for hiding this comment

nqtung Dec 29, 2024

Choose a reason for hiding this comment