ooi-data-groom

About

Data Groom is a framework for the scheduled execution of plugins that can be created to "groom" the data. The framework provides methods for selecting and inserting records into the Cassandra stream tables in response to updates to the partition_metadata records of another stream. In this way, it can be used to generate precomputed (or "groomed") virtual streams that are derived from another stream that contains the actual raw data.

Prerequisites

Create a conda virtual environment with the necessary packages:

conda create -n data_groom ion-functions cassandra-driver ooi-data apscheduler pyyaml psycopg2 pandas -c ooi -c conda-forge -y

Running

The script manage-data-groom allows for starting and stopping the Data Groom process for one plugin. The name of a config file is passed to this script and that config file should contain the name of the plugin to run. The plugin should be located the "plugins" directory.

./manage-data-groom start botpt_precompute
./manage-data-groom stop botpt_precompute

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
ooi_data_groom		ooi_data_groom
.gitignore		.gitignore
README.md		README.md
botpt_precompute.yml		botpt_precompute.yml
manage-data-groom		manage-data-groom

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ooi-data-groom

About

Prerequisites

Running

About

Releases

Packages

Contributors 2

Languages

oceanobservatories/ooi-data-groom

Folders and files

Latest commit

History

Repository files navigation

ooi-data-groom

About

Prerequisites

Running

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages