-
Notifications
You must be signed in to change notification settings - Fork 13
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Generate expected data for integration tests as feather files (#73)
- Loading branch information
Showing
29 changed files
with
93 additions
and
89 deletions.
There are no files selected for viewing
Binary file added
BIN
+492 KB
tests/integration/data/expected_arrow/TestOrcFile.columnProjection.feather
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file added
BIN
+415 KB
tests/integration/data/expected_arrow/TestOrcFile.testDate1900.feather
Binary file not shown.
Binary file added
BIN
+1.23 MB
tests/integration/data/expected_arrow/TestOrcFile.testDate2038.feather
Binary file not shown.
Binary file added
BIN
+37.3 KB
tests/integration/data/expected_arrow/TestOrcFile.testMemoryManagementV11.feather
Binary file not shown.
Binary file added
BIN
+30.4 KB
tests/integration/data/expected_arrow/TestOrcFile.testMemoryManagementV12.feather
Binary file not shown.
Binary file added
BIN
+40.1 KB
tests/integration/data/expected_arrow/TestOrcFile.testPredicatePushdown.feather
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file added
BIN
+866 Bytes
tests/integration/data/expected_arrow/TestOrcFile.testStringAndBinaryStatistics.feather
Binary file not shown.
Binary file added
BIN
+44.7 KB
tests/integration/data/expected_arrow/TestOrcFile.testStripeLevelStats.feather
Binary file not shown.
Binary file added
BIN
+5.02 KB
tests/integration/data/expected_arrow/TestOrcFile.testUnionAndTimestamp.feather
Binary file not shown.
Binary file added
BIN
+533 KB
tests/integration/data/expected_arrow/TestOrcFile.testWithoutIndex.feather
Binary file not shown.
Binary file added
BIN
+263 KB
tests/integration/data/expected_arrow/TestStringDictionary.testRowIndex.feather
Binary file not shown.
Binary file added
BIN
+178 KB
tests/integration/data/expected_arrow/TestVectorOrcFile.testLz4.feather
Binary file not shown.
Binary file added
BIN
+178 KB
tests/integration/data/expected_arrow/TestVectorOrcFile.testLzo.feather
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,17 @@ | ||
# Requires pyarrow to be installed | ||
import glob | ||
from pyarrow import orc, feather | ||
|
||
files = glob.glob("data/expected/*") | ||
files = [file.removeprefix("data/expected/").removesuffix(".jsn.gz") for file in files] | ||
|
||
ignore_files = [ | ||
"TestOrcFile.testTimestamp" # Root data type isn't struct | ||
] | ||
|
||
files = [file for file in files if file not in ignore_files] | ||
|
||
for file in files: | ||
print(f"Converting {file} from ORC to feather") | ||
table = orc.read_table(f"data/{file}.orc") | ||
feather.write_feather(table, f"data/expected_arrow/{file}.feather") |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters