Skip to content

Latest commit

 

History

History
126 lines (87 loc) · 4.46 KB

README.md

File metadata and controls

126 lines (87 loc) · 4.46 KB

Useful Data Science

A curated list of new or uncommon Python (R, etc.) libraries that are useful for Data Science analysis.

Inspired by awesome-python.


Printing

Libraries for enhanced print functions

  • printy - lite and cross-platform library that extends the functionalities of the built-in functions print() and input().
  • rich - Python library for rich text and beautiful formatting in the terminal.

Visualization

Libraries for visual representation of data

  • pandas-bokeh - Bokeh Plotting Backend for Pandas and GeoPandas

Machine Learning

Libraries for machine learning

  • yellowbrick - extends the Scikit-Learn API to make model selection and hyperparameter tuning easier

Database

Libraries or 3rd party tools for manipulating database systems

  • Beekeper Studio - IDE for databases. Similar to DataGrip, Toad, and SQL Developer.

Utility

Libraries that manipulate the data or dataframe in some useful way

  • dtale - Create interactive table from Pandas dataframe.
  • nbdev - Create library and PyPi package from a Jupyter notebook.
  • pyp - Easily run Python at the shell! Magical, but never mysterious.
  • deon - An ethics checklist for data scientists.

Virtual Machine

*Virtual machines or VM resources

  • macos-virtualbox - Push-button installer of macOS Catalina, Mojave, and High Sierra guests in Virtualbox for Windows, Linux, and macOS.

Resources

Where to discover new Python libraries.

Useful for Classes

Podcasts

Twitter

Websites

Weekly

Contributing

Your contributions are always welcome!


If you have any question about this opinionated list, do not hesitate to contact me: [email protected] or open an issue on GitHub.