Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

parquet.ParquetFormatException: Unsupported encoding: RLE_DICTIONARY #86

Open
d33tah opened this issue Mar 28, 2024 · 0 comments
Open

Comments

@d33tah
Copy link

d33tah commented Mar 28, 2024

> echo -e 'hi' | parquet-fromcsv  --input-file /dev/stdin  --schema <(  echo 'message schema { OPTIONAL BYTE_ARRAY key (STRING); }' ) --output-file test.parquet
> base64 -w 0 < test.pq 
UEFSMRUEFRoVGkwVBBUAEgAAAwAAAGtleQIAAABoaRUAFRIVEiwVBBUQFQYVBhxYA2tleRgCaGkAAAACAAAABAEBAwIVDBk1AAYQGRgDa2V5FQAWBBaAARaAASY2JgAcWANrZXkYAmhpAAAZEQIZGAJoaRkYA2tleRUAGRYAABkcFj4VShYAAAAVAhksSAxhcnJvd19zY2hlbWEVAgAVDCUCGANrZXklAEwcAAAAFgQZHBkcJogBHBUMGTUABhAZGANrZXkVABYEFoABFoABJj4mCBxYA2tleRgCaGkAABb+ARUUFtYBFSgAFoABFgQmCBaAARQAABkcGAxBUlJPVzpzY2hlbWEYnAEvLy8vLzJ3QUFBQVFBQUFBQUFBS0FBd0FDZ0FKQUFRQUNnQUFBQkFBQUFBQUFRUUFDQUFJQUFBQUJBQUlBQUFBQkFBQUFBRUFBQUFVQUFBQUVBQVVBQkFBRGdBUEFBUUFBQUFJQUJBQUFBQVlBQUFBREFBQUFBQUFBUVVRQUFBQUFBQUFBQVFBQkFBRUFBQUFBd0FBQUd0bGVRQT0AGBlwYXJxdWV0LXJzIHZlcnNpb24gNDYuMC4wADoBAABQQVIx
> python3 -m parquet test.pq
Traceback (most recent call last):
  File "<frozen runpy>", line 198, in _run_module_as_main
  File "<frozen runpy>", line 88, in _run_code
  File "/tmp/pq2/parquet-python/parquet/__main__.py", line 63, in <module>
    main()
  File "/tmp/pq2/parquet-python/parquet/__main__.py", line 59, in main
    parquet.dump(args.file, args)
  File "/tmp/pq2/parquet-python/parquet/__init__.py", line 526, in dump
    return _dump(file_obj, options=options, out=out)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/tmp/pq2/parquet-python/parquet/__init__.py", line 506, in _dump
    for row in DictReader(file_obj, options.col):
  File "/tmp/pq2/parquet-python/parquet/__init__.py", line 415, in DictReader
    for row in reader(file_obj, columns):
  File "/tmp/pq2/parquet-python/parquet/__init__.py", line 464, in reader
    values = read_data_page(file_obj, schema_helper, page_header, cmd,
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/tmp/pq2/parquet-python/parquet/__init__.py", line 376, in read_data_page
    raise ParquetFormatException("Unsupported encoding: {}".format(
parquet.ParquetFormatException: Unsupported encoding: RLE_DICTIONARY
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant