version 0.5.3 - Mykolaiv
Pyinterpolate is the Python library for spatial statistics. The package provides access to spatial statistics tools used in various studies. This package helps you interpolate spatial data with the Kriging technique.
If you’re:
- GIS expert,
- geologist,
- mining engineer,
- ecologist,
- public health specialist,
- data scientist.
Then this package may be useful for you. You could use it for:
- spatial interpolation and spatial prediction,
- alone or with machine learning libraries,
- for point and block datasets.
Pyinterpolate allows you to perform:
- Ordinary Kriging and Simple Kriging (spatial interpolation from points),
- Centroid-based Poisson Kriging of polygons (spatial interpolation from blocks and areas),
- Area-to-area and Area-to-point Poisson Kriging of Polygons (spatial interpolation and data deconvolution from areas to points).
- Inverse Distance Weighting.
- Semivariogram regularization and deconvolution.
- Semivariogram modeling and analysis.
The package has multiple spatial interpolation functions. The flow of analysis is usually the same for each method:
[1.] Read and prepare data.
from pyinterpolate import read_txt
point_data = read_txt('dem.txt')
[2.] Analyze data, calculate the experimental variogram.
from pyinterpolate import build_experimental_variogram
search_radius = 500
max_range = 40000
experimental_semivariogram = build_experimental_variogram(input_array=point_data,
step_size=search_radius,
max_range=max_range)
[3.] Data transformation, fit theoretical variogram.
from pyinterpolate import build_theoretical_variogram
semivar = build_theoretical_variogram(experimental_variogram=experimental_semivariogram,
model_name='spherical',
sill=400,
rang=20000,
nugget=0)
[4.] Interpolation.
from pyinterpolate import kriging
unknown_point = (20000, 65000)
prediction = kriging(observations=point_data,
theoretical_model=semivar,
points=[unknown_point],
how='ok',
no_neighbors=32)
[5.] Error and uncertainty analysis.
print(prediction) # [predicted, variance error, lon, lat]
>> [211.23, 0.89, 20000, 60000]
With pyinterpolate, we can retrieve the point support model from blocks. Example from Tick-borne Disease Detector study for European Space Agency - COVID-19 population at risk mapping. We did it with the Area-to-Point Poisson Kriging technique from the package. Countries worldwide aggregate disease data to protect the privacy of infected people. But this kind of representation introduces bias to the decision-making process. To overcome this bias, you may use Poisson Kriging. Block aggregates of COVID-19 infection rate are transformed into new point support semivariogram created from population density blocks. We get the population at risk map:
Beta (late) version: the structure will be in most cases stable, new releases will introduce new classes and functions instead of API changes.
Setup with conda: conda install -c conda-forge pyinterpolate
Setup with pip: pip install pyinterpolate
Detailed instructions on how to install the package are presented in the file SETUP.md. We pointed out there most common problems related to third-party packages.
You may follow those setup steps to create a conda environment with the package for your work:
[1.] Create conda environment with Python >= 3.8. Recommended is Python 3.10.
conda create -n [YOUR ENV NAME] -c conda-forge python=3.10 pyinterpolate
[2.] Activate environment.
conda activate [YOUR ENV NAME]
[3.] You are ready to use the package!
With Python>=3.8 and system libspatialindex_c.so
dependencies you may install package by simple command:
pip install pyinterpolate
A world of advice, you should always use Virtual Environment for the installation. You may consider using PipEnv too.
All tests are grouped in the test
directory. If you would like to contribute, then you won't avoid testing, but it is described step-by-step here: CONTRIBUTION.md
- Tick-Borne Disease Detector (Data Lions company) for the European Space Agency (2019-2020).
- B2C project related to the prediction of demand for specific flu medications (2020).
- B2G project related to the large-scale infrastructure maintenance (2020-2021).
- E-commerce service for reporting and analysis, building spatial / temporal profiles of customers (2022+).
- The external data augmentation for e-commerce services (2022+).
Join our community in Discord: Discord Server Pyinterpolate
PyInterpolate was created thanks to many resources and all of them are pointed here:
- Armstrong M., Basic Linear Geostatistics, Springer 1998,
- GIS Algorithms by Ningchuan Xiao: https://uk.sagepub.com/en-gb/eur/gis-algorithms/book241284
- Pardo-Iguzquiza E., VARFIT: a fortran-77 program for fitting variogram models by weighted least squares, Computers & Geosciences 25, 251-261, 1999,
- Goovaerts P., Kriging and Semivariogram Deconvolution in the Presence of Irregular Geographical Units, Mathematical Geology 40(1), 101-128, 2008
- Deutsch C.V., Correcting for Negative Weights in Ordinary Kriging, Computers & Geosciences Vol.22, No.7, pp. 765-773, 1996
Moliński, S., (2022). Pyinterpolate: Spatial interpolation in Python for point measurements and aggregated datasets. Journal of Open Source Software, 7(70), 2869, https://doi.org/10.21105/joss.02869
Core requirements and dependencies are:
- Python >= 3.8
- descartes
- geopandas
- matplotlib
- numpy
- tqdm
- pyproj
- scipy
- shapely
- fiona
- rtree
- prettytable
- pandas
- dask
- hdbscan
- pylibtiff
- pyarrow
You may check a specific version of requirements in the setup.cfg
file.
High level overview:
-
pyinterpolate
-
distance
- distance calculation, -
idw
- inverse distance weighting interpolation, -
io
- reads and prepares input spatial datasets, -
kriging
- Ordinary Kriging, Simple Kriging, Poisson Kriging: centroid based, area-to-area, area-to-point, -
pipelines
- a complex functions to smooth a block data, download sample data, compare different kriging techniques, and filter blocks, -
processing
- core data structures of the package:Blocks
andPointSupport
, and additional functions used for internal processes, -
variogram
- experimental variogram, theoretical variogram, variogram point cloud, semivariogram regularization & deconvolution, -
viz
- interpolation of smooth surfaces from points into rasters.
-
-
tutorials
- tutorials (Basic, Intermediate and Advanced).
Datasets and scripts to download spatial data from external API's are available in a dedicated package: pyinterpolate-datasets