-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Module for sending datasets to Google cloud #18
base: dev
Are you sure you want to change the base?
Module for sending datasets to Google cloud #18
Conversation
@akiraonstarknet and @ManvithaMolakala Can you pls review |
@@ -0,0 +1,86 @@ | |||
import logging |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please add comments throughtout.
src/data/datasets.py
Outdated
from google.cloud import bigquery | ||
from google.cloud import storage | ||
|
||
import yaml |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please update pyproject.toml file
tests/test_datasets.py
Outdated
@@ -0,0 +1,30 @@ | |||
import unittest |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In this project, we use pytest. Please write the code using pytest for uniformity
tests/test_datasets.py
Outdated
@@ -0,0 +1,30 @@ | |||
import unittest | |||
from unittest.mock import Mock, patch |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Explicitly mention in the README or comments if the script depends on gcloud auth for credentials or requires a GOOGLE_APPLICATION_CREDENTIALS environment variable.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If additional configurations are needed (e.g., a secrets file or service account JSON), document their usage and location.
tests/test_datasets.py
Outdated
# Assert no exceptions raised (would require more detailed mocks for deeper validation) | ||
|
||
if __name__ == '__main__': | ||
unittest.main() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please address the above comments before I take a final look.
…ate, delete, and list functionalities
…ate, delete, and list functionalities
Closes #17
Created a bucket to monitors a Cloud Storage for file uploads or changes.
Wrote a function for uploading the data
Uploaded a file and tested it on big query to see the datasets