Changing upload process for lazy loaded DataFrames #165

AnimatorJoe · 2024-01-09T19:24:34Z

In the original DataFrame upload implementation for Snowflake and PySpark DataFrames, the DataFrames are loaded entire into memory first before being uploaded (PySpark, Snowflake).

However, this can cause problems if the DataFrames are larger than the driver's memory (PySpark, Snowflake).

This PR processes the DataFrames by batch, which solves the memory problem.

…dataframes by batch instead of all into memory at once

…into batch-loading-large-dataframes

changing dataframe upload implementation to load spark and snowflake …

e0d6ec5

…dataframes by batch instead of all into memory at once

AnimatorJoe requested a review from axl1313 January 9, 2024 19:25

AnimatorJoe added 7 commits January 9, 2024 20:31

Merge branch 'main' into batch-loading-large-dataframes

0469f19

removing benchmarking code

6611695

Merge remote-tracking branch 'origin/batch-loading-large-dataframes' …

ccd89f9

…into batch-loading-large-dataframes

Merge branch 'main' into batch-loading-large-dataframes

fe5df8c

using new backend endpoint for file upload

b004a98

fixing bugs

a623fe4

implementing changes based on backend pr suggestions

4a421f6

AnimatorJoe force-pushed the batch-loading-large-dataframes branch from 49f5de7 to 4a421f6 Compare January 26, 2024 22:31

AnimatorJoe added 5 commits January 26, 2024 14:31

Merge branch 'main' into batch-loading-large-dataframes

b0dfc7f

appeasing the linter god save the queen

119bad9

fixing type errors

deef6b3

fixing type errors

be76f88

casting int to int to appease the CI type checker

0b0c289

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Changing upload process for lazy loaded DataFrames #165

Changing upload process for lazy loaded DataFrames #165

AnimatorJoe commented Jan 9, 2024 •

edited

Loading

Changing upload process for lazy loaded DataFrames #165

Are you sure you want to change the base?

Changing upload process for lazy loaded DataFrames #165

Conversation

AnimatorJoe commented Jan 9, 2024 • edited Loading

AnimatorJoe commented Jan 9, 2024 •

edited

Loading