MMSD_trends

About this workflow

This project uses remake, a file dependency manager that ensures that our analysis scripts get run in a sensible order. from richfitz:

You describe the beginning, intermediate and end points of your analysis, and how they flow together.

"targets" are the points in your analysis. They can be either files (data files for input; tables, plots, knitr reports for output) or they can be R objects (representing processed data, results, fitted models, etc).
"rules" are how the targets in your analysis relate together and are simply the names of R functions.
"dependencies" are the targets that need to already be made before a particular target can run (for example, a processed data set might depend on downloading a file; a plot might depend on a processed data set).

Setup

Configuration to use remake with this R project is really simple!:

install remake:

devtools::install_github("richfitz/remake")

install any missing packages that this project requires that you don't already have:

remake::install_missing_packages()

Building the project

Build this project, or pieces of it, using remake.

library(remake)
# build the entire project:
make() 

# build only one stage of the project:
make(remake_file = "10_load_data.yml")

# build only one target of the project:
make("20_merge_data/doc/progress.csv")

debugging in R

remake is "unapologetically R focussed", and supports debugging of functions by inserting browser() commands inline to your functions

Want to do things the old fashioned way? you can create the script that remake would execute if all targets were out of date:

remake::make_script()

If you would like a particular target called outside the main workflow, you can call it directly with remake::make:

merged_data <- remake::make("merged_data")

starting fresh

like make, you can start a "clean" build:

remake::make("clean")

Note that the above command deletes files and also gets rid of R objects. Alternatively, you can delete individual targets:

remake::delete("20_merge/doc/progress.csv")

What happens in a build

Subfolders named 'out' and 'log' exist within each numbered folder, and there are a few 'doc' subfolders here and there. On GitHub, these are empty except for README.md files. The README.md files serve as placeholders so that the directories can be versioned and don't need to be created by the project scripts. When you build the project, these folders become populated with data files, figures, etc. ('out'), and ancillary documentation ('doc').

R scripts

What's going on?

10_load_data

Raw data files are saved on a private S3 bucket. The function in this step assumes you have a "default" credential set up on your computer. Then, the files are simply downloaded to the "1_get_raw_data/out" folder.

Dependency tree

[

The procedure for making a remake dependency diagram:

remake::diagram()

which also takes the same arguments as remake::make(), so you can build a diagram for a stage, the whole project, or a single target.

Disclaimer

This software is in the public domain because it contains materials that originally came from the U.S. Geological Survey (USGS), an agency of the United States Department of Interior. For more information, see the official USGS copyright policy at https://www.usgs.gov/visual-id/credit_usgs.html#copyright

Although this software program has been used by the USGS, no warranty, expressed or implied, is made by the USGS or the U.S. Government as to the accuracy and functioning of the program and related program material nor shall the fact of distribution constitute any such warranty, and no responsibility is assumed by the USGS in connection therewith.

This software is provided "AS IS."

Name		Name	Last commit message	Last commit date
Latest commit History 262 Commits
10_load_data		10_load_data
15_clean_data/src		15_clean_data/src
20_merge_data/src		20_merge_data/src
22_merge_data_deg_test/src		22_merge_data_deg_test/src
24_check_data/src		24_check_data/src
25_rank_data/src		25_rank_data/src
26_mixtures/src		26_mixtures/src
30_reports		30_reports
40_pesticide_figs/src		40_pesticide_figs/src
miscellaneous		miscellaneous
.gitignore		.gitignore
10_load_data.yml		10_load_data.yml
15_clean_data.yml		15_clean_data.yml
20_merge_data.yml		20_merge_data.yml
21_merge_data_dl.yml		21_merge_data_dl.yml
22_merge_data_deg_test.yml		22_merge_data_deg_test.yml
24_check_data.yml		24_check_data.yml
25_rank_data.yml		25_rank_data.yml
26_mixtures.yml		26_mixtures.yml
30_reports.yml		30_reports.yml
40_pesticide_figs.yml		40_pesticide_figs.yml
GLRI_CEC_2016.Rproj		GLRI_CEC_2016.Rproj
README.md		README.md
remake.yml		remake.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MMSD_trends

About this workflow

Setup

Building the project

debugging in R

starting fresh

What happens in a build

R scripts

10_load_data

Dependency tree

Disclaimer

About

Releases

Packages

Languages

limnoliver/GLRI_CEC_2016

Folders and files

Latest commit

History

Repository files navigation

MMSD_trends

About this workflow

Setup

Building the project

debugging in R

starting fresh

What happens in a build

R scripts

10_load_data

Dependency tree

Disclaimer

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages