[BUG] opensearch-py doesn't support chunked encoding with compression enabled with sigv4 AuthN/AuthZ #176

kumjiten · 2022-06-25T13:39:37Z

What is the bug?
Opensearch python client using content length header and does not support chunked with compression enabled.

How can one reproduce the bug?
Steps to reproduce the behavior:

create openSearch domain in (AWS) which support IAM based AuthN/AuthZ
send signed request to opensearch cluster using python rest client(https://docs.aws.amazon.com/opensearch-service/latest/developerguide/request-signing.html#request-signing-python)
create rest-client in java with compression enabled

search = OpenSearch(
    hosts = [{'host': host, 'port': 443}],
    http_auth = awsauth,
    use_ssl = True,
    verify_certs = True,
    http_compress = True, # enables gzip compression for request bodies <---------
    connection_class = RequestsHttpConnection
)

it's sending content-length header by default

python3  client.py
-----------START-----------
PUT https://xxxxxxxx:443/movies/_doc/1?refresh=true
content-type: application/json
user-agent: opensearch-py/2.0.0 (Python 3.8.9)
accept-encoding: gzip,deflate
content-encoding: gzip
Content-Length: 78 <--------------
x-amz-date: 20220625T131237Z
x-amz-content-sha256: 70ced8b1d2572d31b43dcf4ad0c58867d4f23bbbdb3bb24d7cb0059a87465816
Authorization: AWS4-HMAC-SHA256 Credential=AKIAV7BDGZUCRKUTEG7B/20220625/eu-west-1/es/aws4_request, SignedHeaders=content-type;host;x-amz-content-sha256;x-amz-date, Signature=5e8d252a9bd11728ec2e3305a74f2cc2eeddb29e69ae102cc815ed90bcb27d34

repro code:

from opensearchpy import OpenSearch, RequestsHttpConnection
from requests_aws4auth import AWS4Auth
import boto3
import pdb

host = '' # e.g. my-test-domain.us-east-1.es.amazonaws.com
region = 'eu-west-1' # e.g. us-west-1
service = 'es'
credentials = boto3.Session().get_credentials()
awsauth = AWS4Auth(credentials.access_key, credentials.secret_key, region, service, session_token=credentials.token)

# Create the client.
search = OpenSearch(
    hosts = [{'host': host, 'port': 443}],
    http_auth = awsauth,
    use_ssl = True,
    verify_certs = True,
    http_compress = True, # enables gzip compression for request bodies
    connection_class = RequestsHttpConnection
)

document = {
  "title": "Moneyball",
  "director": "Bennett Miller",
  "year": "2011"
}

# Send the request.
print(search.index(index='movies', id='1', body=document, refresh=True))

causing this call to pass, what if content is too large and wanted to use chunked with compression.

What is the expected behavior?
It should support chunked with sigv4 to work with large payload.

similar issue: opensearch-project/OpenSearch#3640

What is your host/environment?

OS: [e.g. iOS]
Version [e.g. 22]
Plugins

Do you have any screenshots?
If applicable, add screenshots to help explain your problem.

Do you have any additional context?
opensearch-project/OpenSearch#3640

The text was updated successfully, but these errors were encountered:

harshavamsi · 2022-09-30T17:12:30Z

@jiten1551 Are you saying that this is a bug or a feature that you might want. The default RequestsHttpConnection does not support chunked encoding. It would have to be a new flag in the connection class to allow for that. But just to separate things, SigV4 works with compressed requests using http_compress, what you're asking for is compressing and chunking, which could be a new feature?

dblock · 2022-09-30T18:58:23Z

I think it's a feature request: enable chunked transfer encoding (and ensure it works with Sigv4). A similar problem in the java client was that setting compression would also automatically turn on chunked transfer encoding, which would work, except for Sigv4.

fabioasdias · 2023-09-01T02:15:31Z

python requests does chunked automatically if a generator is passed. In fact, one could arguably bypass the api straight into the connector.perform_request with a generator, as long as the http_compress is disabled (and then the gzip.compress doesn't run) and the input argument is just happily passed along to requests...

kumjiten added bug Something isn't working untriaged Need triage labels Jun 25, 2022

dblock mentioned this issue Jun 28, 2022

Adding support for AWS SDK V2 request signer opensearch-project/opensearch-go#116

Merged

nakajiak mentioned this issue Sep 26, 2022

v2.8.0 doesn't work properly in some regions aws-samples/siem-on-amazon-opensearch-service#297

Closed

wbeckler removed the untriaged Need triage label Nov 3, 2022

wbeckler added the good first issue Good for newcomers label Sep 19, 2023

dblock added the performance Make it fast. label Dec 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] opensearch-py doesn't support chunked encoding with compression enabled with sigv4 AuthN/AuthZ #176

[BUG] opensearch-py doesn't support chunked encoding with compression enabled with sigv4 AuthN/AuthZ #176

kumjiten commented Jun 25, 2022

harshavamsi commented Sep 30, 2022

dblock commented Sep 30, 2022

fabioasdias commented Sep 1, 2023

[BUG] opensearch-py doesn't support chunked encoding with compression enabled with sigv4 AuthN/AuthZ #176

[BUG] opensearch-py doesn't support chunked encoding with compression enabled with sigv4 AuthN/AuthZ #176

Comments

kumjiten commented Jun 25, 2022

harshavamsi commented Sep 30, 2022

dblock commented Sep 30, 2022

fabioasdias commented Sep 1, 2023