-
Notifications
You must be signed in to change notification settings - Fork 4
Reassess sites-to-segs search radius #112
Comments
Thanks for discovering this issue! I'm supportive of reducing the search radius to 1 km (or shorter) and dropping sites that are further away than that. It would be helpful to know how many sites, particularly continuous, are dropped to potentially encourage modeling at finer NHD scale in the future. |
We're currently snapping points to segments by searching for the nearest segment within a radius of 0.1 degrees (~10 km). The plot below shows the distance between our SC sites (including discrete + NWIS observations) and the matched segment, showing that there are quite a few sites that are matched to reaches that are relatively far away (i.e., ~1360 sites out of 3449 total sites are > 1 km away from their matched segment). Most of these sites that don't get matched/are matched at long distances are discrete sites. However, there are some NWIS sites that are getting snapped to PRMS segments even though the sites probably aren't located on those segments. NWIS 01478950 (on Pike Creek and not the Christina River, which is shown in red) is an example of that: |
So I'd suggest we should reduce our search radius. In other projects such as temperature and now DO, we use a search radius of 500 m (e.g. see lines 56-61 in the 2wp-temp repo). If we reduce our search radius to 500 m, we lose 26 NWIS gages out of 125. I quickly checked the location of these 26 gages and most of them are located on streams not represented by the PRMS network. But there are also edge cases where larger rivers might not get snapped if they're wider than 500 m (e.g. issue USGS-R/delaware-model-prep#34 in delaware-model-prep). I noticed this with sites on the Delaware River (e.g. 01474703 and 01477050, which are ~620 and 630 m away from that segment, respectively. I'm leaning towards keeping the search radius at 500 m, but adding special handling to retain 01474703 and 01477050. Implementing this change would result in a loss of 24 NWIS gages and 9,048 NWIS observation-days (~5% of our total NWIS obs-days for SC). |
Thanks for the update. Using 500 m and special handling for points where the river width is large sounds good to me. River width is an attribute that could be used to determine which segments should receive special handling. |
I used Google Earth for a quick river width estimation. It seems like the main stem south of Trenton is > 500 m. It doesn't look like any of the tributaries are > 500 m. I can quickly grab all of the main stem segment numbers that could use > 500 m tolerance, but would not have a quick way to say what the tolerance should be for these reaches |
Thanks for looking into this (I haven't looked at |
I don't think it would introduce tributaries using |
Yeah, thanks for clarifying. I meant tributary samples that got matched to the mainstem. I don't expect that to be an issue if we're sticking with a 500 m radius, though. |
In USGS-R/drb-do-ml#36, we realized that the search radius being used to match sites to segments likely resulted in many headwater streams being matched to the mainstem rivers represented in the PRMS network. This issue includes the following steps:
The text was updated successfully, but these errors were encountered: