GreenplumPython is a Python library that enables the user to interact with database in a Pythonic way.
GreenplumPython provides a pandas-like DataFrame API that
- looks familiar and intuitive to Python users
- is powerful to do complex analytics, such as statistical analysis, with UDFs and UDAs
- encapsulates common best practices and avoids common pitfalls in Greenplum, compared to writing SQL directly
To install the latest development version, do
pip3 install --user git+https://github.com/greenplum-db/GreenplumPython
To install the latest released version, do
pip3 install --user greenplum-python
Note: The --user
option in an active virtual environment will install to the local user python location.
Since a user location doesn't make sense for a virtual environment, to install the GreenplumPython library,
just remove --user
from the above commands.
The documentation of GreenplumPython can be viewed at: