Request to retain dedicated plot column for TMP L1 #169

peterregier · 2024-05-07T20:30:01Z

I just pulled L1 data and it appears the plot column included in prior releases is now removed and instead in the filename. I recognize and appreciate trying to reduce redundancy and am happy to parse from filenames, but as critical metadata all TEMPEST analyses will need, I think it warrants a dedicated column for ease of use. I'm intending this comment to only apply to TEMPEST.

bpbond · 2024-05-07T20:54:49Z

Thanks @peterregier

Why it is "only apply to TEMPEST"? Plot is a crucial piece of information everywhere else too, right? What about Site? Should that have its own dedicated column? Curious about your thoughts here.

for ease of use

That's definitely a priority, so I'm very open to this. Just would like to understand better why something like this seems onerous:

# Read in all the 2024 TEMPEST control plot data
f <- list.files("TMP_2024/", pattern = "TMP_C_", full.names = TRUE)
dat <- bind_rows(lapply(f, read_csv))
dat$Site <- "TMP"
dat$Plot <- "C"

bpbond · 2024-05-07T21:03:47Z

Just making a note, adding "Site" and "Plot" columns to one of the 2024 TEMPEST files (I tested TMP_C_20240301-20240331_L1_v1-0.csv) increased its size from 44 to 47.3 MB. Not bad.

peterregier · 2024-05-07T21:15:12Z

@bpbond my instinct on "easier" comes from my solution of parsing filenames with stringr::str_extract(name, "(?<=)[^_]+(?=)") when reading all csvs from a given L1 folder (eg TMP_2023). Absolutely agree your solution is not onerous, just hoping to keep the data as easy as possible for folks of all to use, if folks don't have to parse strings or assign, that's one less barrier in my mind.

I think you're absolutely right, having transect location as an equivalent variable would make sense for synoptics! Adding site as a column makes sense to me too. I recognize wanting to keep things lightweight, just putting in my 2 cents on that balance.

bpbond · 2024-05-07T21:25:39Z

It does seem penny-wise, pound-foolish to save 10% in file sizes but force everyone to again and again parse filenames to re-create those Site and Plot columns.

Thoughts @selinalcheng @roylrich @wilsonsj100 ?

* Add metadata required to map GCReW met to GCW-W * Remove GCW mappings as no met station or sonde * Fix solar total/flux units for 15min data #160 * Clean up reset() function * Add site and plot back in; see #169 * Remove option to remove input files

bpbond · 2024-05-08T00:57:50Z

Addressed in #167

wilsonsj100 · 2024-05-08T13:09:06Z

Sorry for the late input, but I am pro adding the zone back in! Thank you!

roylrich · 2024-05-08T13:18:01Z

agreed Roy Rich PhD. Research Ecologist Smithsonian Environmental Research Center 647 Contees Wharf Rd. Edgewater, MD USA 21037 Cell 651-328-4391 ***@***.******@***.***> https://serc.si.edu/technology-in-ecology

…

________________________________ From: Stephanie J Wilson ***@***.***> Sent: Wednesday, May 8, 2024 9:09 AM To: COMPASS-DOE/data-workflows ***@***.***> Cc: Rich, Roy ***@***.***>; Mention ***@***.***> Subject: Re: [COMPASS-DOE/data-workflows] Request to retain dedicated plot column for TMP L1 (Issue #169) External Email - Exercise Caution Sorry for the late input, but I am pro adding the zone back in! Thank you! — Reply to this email directly, view it on GitHub<#169 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AFIDRBXQCXNCQKFVFDPH3ELZBIPYRAVCNFSM6AAAAABHLWDZQOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCMBQGU2DANZRHE>. You are receiving this because you were mentioned.Message ID: ***@***.***>

peterregier assigned bpbond and stephpenn1 May 7, 2024

bpbond added a commit that referenced this issue May 7, 2024

Add site and plot back in; see #169

61af273

bpbond closed this as completed May 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request to retain dedicated plot column for TMP L1 #169

Request to retain dedicated plot column for TMP L1 #169

peterregier commented May 7, 2024

bpbond commented May 7, 2024

bpbond commented May 7, 2024 •

edited

Loading

peterregier commented May 7, 2024

bpbond commented May 7, 2024

bpbond commented May 8, 2024

wilsonsj100 commented May 8, 2024

roylrich commented May 8, 2024 via email

Request to retain dedicated plot column for TMP L1 #169

Request to retain dedicated plot column for TMP L1 #169

Comments

peterregier commented May 7, 2024

bpbond commented May 7, 2024

bpbond commented May 7, 2024 • edited Loading

peterregier commented May 7, 2024

bpbond commented May 7, 2024

bpbond commented May 8, 2024

wilsonsj100 commented May 8, 2024

roylrich commented May 8, 2024 via email

bpbond commented May 7, 2024 •

edited

Loading