The goal is to use observational ocean data to explore the ocean. We begin with the Oceans Observatories Initiative producing data since 2014: From continental shelves to the deep ocean, from sea floor to surface, from seismology to marine mammal vocalizations to physical, chemical and biological sensors.
We begin in the ocean's photic zone, from 200 meters depth to the surface, the region of light. Regular observations include temperature, dissolved oxygen, salinity, particulate backscatter, chlorophyll concentration, dissolved organic matter fluorescence, nitrate concentration, pH, carbon dioxide concentration, spectral irradiance (7 wavelengths), spectrophotometry (2 forms x 83 channels), photosynthetically-available radiation and ocean current in three dimensions. These parameters are measured up to 18 times per day from 200 meters depth up to near-surface in centimeter-scale increments. Consequently we are, roughly speaking, trying to understand the upper ocean in both depth and time as an 18-dimensional space.
The interpretability of this 'starting point' synoptic dataset is itself daunting; but having come this far we are also interested in making the problem "worse" after a fashion:
- Does the near surface time series agree with satellite observations?
- Does it agree with predictive ocean models?
- Does it agree with independent observations, for example from gliders or drifters (such as ARGO)?
Questions applicable to all of the data include
- How to get data in a usable form?
- Can the data be trusted?
- How can machine learning methods be constructively applied to this multi-dimensional space?
This project begins from the rich time series data described above (again: emphasizing the upper water column / photic zone). The following examples demonstrate potential project directions. Alternative ideas are welcome of course!
- Extend water column observations to map plane data using complementary observations from MODIS
- Apply machine learning methods to N > 3 coincident instrument data streams
- Evaluate agreement between regional ocean model forecasts and in situ observation
- Characterize water column biochemical profiles in relation to physical drivers such as turbulence and temperature
- From real-time automated sensor data streams: Build a strategy for discrete sampling during maintenance cruises
- Build an analysis workflow for automated sensors, e.g. anomaly/thin-layer detection
- Use the data to differentiate influences of coastal upwilling versus terrigenous freshwater runoff
- Shallow profilers run a dozen or so instruments through the upper 200 meters of the water column nine times per day
- On the support platform at 200m depth an ADCP profiles current (200m to surface)
- ARGO floats: CTD and BGC
- ROMS ocean model
- MODIS sea surface data
- specific tasks you want to accomplish, project goals, or research questions you want to answer.
- Think about what outcomes or deliverables you'd like to create
- e.g. a series of tutorial notebooks demonstrating a use case
- e.g. a new python package
Briefly describe the data that will be used here (size, format, how to access).
Needed: How would you or others traditionally try to address this problem?
Needed: What new approaches would you like to try to implement to address your specific question(s) or application(s)?
- Overview
-
Files
.gitignore
Globally ignored files bygit
for the project.environment.yml
conda
environment description needed to run this project.README.md
Description of the project (see suggested headings below)
-
Folders
contributors
Each team member can create their own folder under contributors, within which they can work on their own scripts, notebooks, and other files. Having a dedicated folder for each person helps to prevent conflicts when merging with the main branch. This is a good place for team members to start off exploring data and methods for the project.notebooks
scripts
Helper utilities that are shared with the team should go in here.data
Low-volume datasets that serve as excerpts from the full OOI operational time range. The sub-folders help organize data by sensor/instrument/context/etcetera.images
Resource images, for example invoked by notebook markdown for inline inclusion in a narrative.figures
Distinct fromimages
: These are typically charts produced by notebook code execution.
-