Releases: scienxlab/redflag
Releases · scienxlab/redflag
v0.1.8
- Added Wasserstein distance comparisons for univariate and multivariate distributions. This works for either a
groups
array, or for multiple dataset splits if that's more convenient. - Improved
get_outliers()
, removing OneClassSVM method and adding EllipticEnvelope and Mahalanobis distance. - Added Mahalanobis distance outlier detection function to serve
get_outlier()
or be used on its own. Reproduces the resultszscore_outliers()
used to give for univariate data, so removed that. - Added
kde_peaks()
function to find peaks in a kernel density estimate. This also needed some other functions, includingfit_kde()
,get_kde()
,find_large_peaks()
, and the bandwidth estimators,bw_silverman()
andbw_scott()
. - Added
classes
argument to the class imbalance function, in case there are classes with no data, or to override the classes in the data. - Fixed a bug in the
feature_importances()
function. - Fixed a bug in the
is_continuous()
function. - Improved the
Using_redflag.ipynb
notebook. - Added
has_nans()
,has_monotonic()
, andhas_flat()
functions to detect interpolation issues. - Moved some more helper functions into utils, eg
iter_groups()
,ecdf()
,flatten()
,stdev_to_proportion()
andproportion_to_stdev()
. - Wrote a lot more tests, coverage is now at 95%.
Preview
A lot more tests
Still very rough and feature-light.
Initial release
Very early days!