-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cleaning fishbase scinames / territories #2
Comments
@quir1869 Nice repo issue! I was wondering if my github notifications were set up properly, and they do! I'll respond to your bullets here.
A bit of background/orientation: The first step in running ARTIS (besides setting up the run environment) is to clean the raw data inputs. If you take a look at the root project, you will see a sequence of numbered Couple of Notes:
Thoughts on your specific question The SAU data I shared with you a while back may have been run with an older snapshot of I don't have an immediate fix for this, I'll keep an eye out to see if this species also shows up in my unmatched and no synonym list. You could also check out the I was playing around with the
I think this is an issue with territories. I don't know at what level But honestly, I think this is probably mostly stemming from the example SAU data you are working with. Once we have our next full model run, 🤞 hopefully you will have a cleaner and more clear dataset to work with. However, messy data is a great way to learn some data wrangling 🤠 . |
Scientific name cleaning
There are currently 93 ARTIS scientific names that are not being matched to fishbase or sealifebase. One name in the list,
litopenaeus vannamei
is an updated sciname for whiteleg shrimps, whereas fishbase/sealifebase lists whiteleg shrimps as penaeus vannamei. What process does the ARTIS model use to clean fb/slb scientific names, and can that be applied here to clean those names so that they can potentially match with the unmatched ARTIS species?eez zone cleaning
When using countrycode() to match the reported countries / regions in fishbase/sealife base to iso3c, there are unmatched territories. It would be great to try to determine the eez regions given these territories, but to include these territories as additional rows to keep the original data.
! Some values were not matched unambiguously: Adelaide I., Admiralty Is., Alaska, Amsterdam I., Andaman Is., Ascension I., Azores Is., Balleny Is., Bon. Eust. Saba, Br Antarctic Tr, Canary Is., Cargados Carajos, Caroline I., Central Afr. Rp, Chagos Is., Channel Is., Chatham Is., Clipperton I., Crozet Is., Desventuradas Is, Dominican Rp, Easter I., Elephant I., Europa I., French South Tr, Galapagos Is., Glorieuses Is., Hawaii, Heard McDon Is., Jan Mayen I., Johnston I., Juan de Nova I., Juan Fernández, Kerguelen Is., Kermadec Is., Kosovo, Kuril Is., Lord Howe I., Macquarie Is., Madeira Is., Marquesas Is., Micronesia, Midway Is., Neth Antilles, Ogasawara Is., Pac Is Trust Tr, Peter I I., Prince Edward Is, Revillagigedo A., Rodriguez I., Ryukyu Is., S. Georg. Sandw., Scott I., Socotra Arch., South Orkney Is., South Shetland, St Martin (FR), St Paul's Rocks, St Paul I., St Pierre Mique., Terre Adélie, Trind. M.Vaz Is., Tristan da Cunha, Tuamotu Is., UK Engld Wal, UK No Ireld, UK Scotland, Wake I., West Sahara
unmatched_scinames.txt
The text was updated successfully, but these errors were encountered: