Heatmap of observations #202

bpbond · 2024-08-02T16:53:49Z

@stephpenn1 something like this?

bpbond · 2024-08-02T16:57:05Z

Plotting the data by quarter seems better

stephpenn1 · 2024-08-02T17:40:24Z

Yes I like this - leaning towards quarterly but am going to do some exploring on my own with tick marks and will come back with thoughts!

stephpenn1 · 2024-08-02T18:07:22Z

If we go the monthly route (also using my google drive downloaded data i see very obviously that it did split the download into multiple zips):

Does the above graph provide extra clarity to use the monthly plotting? If it's still too confusing, I say we go with quarterly with more tick marks.

Now complicating it a step further, it would be really cool to have a waffle chart with data availability colored by the Instrument column :)

bpbond · 2024-08-02T18:28:33Z

Whoah, cool graph you made, although why all the missing data? 😕

it would be really cool to have a waffle chart with data availability colored by the Instrument column

Oh!

stephpenn1 · 2024-08-02T18:37:59Z

Missing data is because i only used one of the zip files what google drive downloaded (I thought the others were duplicates)

bpbond · 2024-08-02T18:40:36Z

Oh got it, thanks.

Are you going to try the waffle chart? Is that like what you did for the COSORE paper?

stephpenn1 · 2024-08-02T19:01:08Z

I'm trying it out but if it becomes too complicated we'll go with your existing code.

Currently, does this only count # of rows and not "look" into the files?
results$rows[i] <- length(readLines(fls[i])) - 1

bpbond · 2024-08-02T19:05:21Z

Exactly right!

stephpenn1 · 2024-08-02T20:14:09Z

fls <- list.files("~/Documents/data package/v1-1 beta/", pattern = "*.csv$", full.names = TRUE, recursive = TRUE)

library(tibble)
results <- list()

for(i in seq_along(fls)) {
    message(basename(fls[i]))
    
    # results$rows[i] <- length(readLines(fls[i])) - 1
    results[[basename(fls[i])]] <- readr::read_csv(fls[i]) %>% group_by(Site, Instrument, year(TIMESTAMP), month(TIMESTAMP)) %>% summarise(n = n())
}

bind_rows(results) %>% 
    rename(Year = `year(TIMESTAMP)`, Month = `month(TIMESTAMP)`) %>% 
    arrange(Site, Year, Month) -> r

r %>% group_by(Site, Instrument, Year, Month) %>% 
    summarise(n = sum(n)) %>% 
    arrange(Month, Instrument) %>% 
    mutate(data_present = ifelse(n > 0, "Yes", "No"), Month = month.abb[Month]) %>% 
    filter(Site == "MSM") %>% select(-n) %>% ggplot() + 
    geom_tile(aes(x = factor(Month, levels = month.abb), y = Instrument, fill = data_present), size = 01) + 
    facet_wrap(~Year) + 
    theme_minimal() + 
    theme(axis.text.x = element_text(angle = 45, vjust = 0.5, hjust=1)) + 
    labs(x = "Month") + 
    scale_fill_manual(values = "palegreen3")

bpbond · 2024-08-02T21:27:18Z

This will be useful for users but also for you/us. It seems like a great way to look for unexpected missing data streams, etc.

Simplify code and add comments

stephpenn1 · 2024-09-03T16:52:25Z

Good to merge once checks pass

Update cumulative-observations.R

fc109de

bpbond requested a review from stephpenn1 August 2, 2024 16:53

Quarterly

62a81ab

stephpenn1 and others added 8 commits August 27, 2024 17:39

Create availability_graph.R

83e5fec

Simplify code and add comments

7699a75

Update availability_graph.R

6e9af6d

Update availability_graph.R

2916976

Merge branch 'main' into heatmap-simplify

71c8827

Merge branch 'main' into heatmap-simplify

93d19bd

filter out tempest for now

4313f92

Merge pull request #206 from COMPASS-DOE/heatmap-simplify

1d036ea

Simplify code and add comments

stephpenn1 approved these changes Sep 3, 2024

View reviewed changes

Move avail graph code to code_examples

d71117d

stephpenn1 merged commit f7418f0 into main Sep 3, 2024
1 check passed

stephpenn1 deleted the heatmap branch September 3, 2024 17:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Heatmap of observations #202

Heatmap of observations #202

bpbond commented Aug 2, 2024

bpbond commented Aug 2, 2024 •

edited

Loading

stephpenn1 commented Aug 2, 2024

stephpenn1 commented Aug 2, 2024

bpbond commented Aug 2, 2024

stephpenn1 commented Aug 2, 2024

bpbond commented Aug 2, 2024

stephpenn1 commented Aug 2, 2024

bpbond commented Aug 2, 2024

stephpenn1 commented Aug 2, 2024

bpbond commented Aug 2, 2024

stephpenn1 commented Sep 3, 2024

Heatmap of observations #202

Heatmap of observations #202

Conversation

bpbond commented Aug 2, 2024

bpbond commented Aug 2, 2024 • edited Loading

stephpenn1 commented Aug 2, 2024

stephpenn1 commented Aug 2, 2024

bpbond commented Aug 2, 2024

stephpenn1 commented Aug 2, 2024

bpbond commented Aug 2, 2024

stephpenn1 commented Aug 2, 2024

bpbond commented Aug 2, 2024

stephpenn1 commented Aug 2, 2024

bpbond commented Aug 2, 2024

stephpenn1 commented Sep 3, 2024

bpbond commented Aug 2, 2024 •

edited

Loading