Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Assignment 1 #66

Closed
wants to merge 3 commits into from
Closed

Assignment 1 #66

wants to merge 3 commits into from

Conversation

lookatme818
Copy link

What changes are you trying to make? (e.g. Adding or removing code, refactoring existing code, adding reports)

adding new features to a dataset stored in Parquet files using Dask

What did you learn from the changes you have made?

Converting a large dataset from a Dask DataFrame to a pandas DataFrame (.compute()) can lead to memory issues and slow performance

Was there another approach you were thinking about making? If so, what approach(es) were you thinking of?

Instead of converting the Dask DataFrame to pandas, I could perform the rolling average calculations directly in Dask

Were there any challenges? If so, what issue(s) did you face? How did you overcome it?

You faced a challenge with duplicate labels causing errors during calculations, such as reindexing issues

How were these changes tested?

The results were tested by computing and displaying a sample (dd_feat.head()) using pandas. This helps verify if the new columns, such as lags and rolling averages, were calculated correctly.

A reference to a related issue in your repository (if applicable)

Checklist

  • I can confirm that my changes are working as intended

Copy link

Hello, thank you for your contribution. If you are a participant, please close this pull request and open it in your own forked repository instead of here. Please read the instructions on your onboarding Assignment Submission Guide more carefully. If you are not a participant, please give us up to 72 hours to review your PR. Alternatively, you can reach out to us directly to expedite the review process.

@lookatme818
Copy link
Author

Pull to wrong place

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant