Skip to content

5th Group Assessment for Data Science Toolbox module at the University of Bristol. This assessment was about using Parallelism to improve performance of our code.

License

Notifications You must be signed in to change notification settings

dsbristol/DST-Assessment-05

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

77 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DST-Assessment-05

5th Group Assessment for Data Science Toolbox module at the University of Bristol.

Project Members

  • Gabriel Grant
  • Alex Caian
  • Matt Corrie

Our project is equity split of 30/35/35 relative to the list above.

Reading Order

All Report content is found in the /Report directory and takes the following structure:

  • 01 - Introduction
  • 02 - Word Scraping
  • 03 - PySpark Implementation
  • 04 - Other methods and analysis
  • 05 - Conclusion

Data

Data accessed in the report is found in the /Data directory of the GitHub repository.

Individual Work

You can see our individual work in each of our individual directories in the repository.

Gabriel Grant

Alex Caian

Matt Corrie

About

5th Group Assessment for Data Science Toolbox module at the University of Bristol. This assessment was about using Parallelism to improve performance of our code.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%