Open CrisSevilleja opened 2 months ago
@CrisSevilleja Can't find the script anymore (but it will be somewhere), but I'm quite sure I used the 90% or 95% quantiles. As for such abundant species numbers in peak season are very high, the tails can still have substantial numbers (though small in percentage). But in CZ M jurtina before 15 May will be doubtful, the question is if you want to skip all M jurtina in May (so also those before 15 May) or the other way around. I fear there is no easy fix.
PS all was restricted to months, so you have to choose to either include May or not.
@chrisvanswaay I wouldn't accept records before 15May in Czechia of M.jurtina but for the tail at the end of the flight period is almost a month. The rulesets are set up for mid-August and it can be seen until mid-September.
I am posting this because other coordinators told me the rulesets did not include common species, like in Austria, and I noticed this in Spain as well. I am just wondering if a new check can be done to improve and include more of those species. We can involve more coordinators to check the flight periods of all their country species.
ah I though it was restricted to periods and not months.
I'll try to find the script (there are so many, that I sometimes forget where I put them).
I noticed this issue with common species too. Hopefully Chris can update the rules as we don't want to be manually adjusting them?
Also, it is worth noting that these automated checks are only to adds flags to records - they do not lead to accepted/rejected status as that is only done by the human verifiers. We could use these rules to automate the verification but it's good to be confident that they work in all (most) situations
@DavidRoy Can you find back when I sent you the file with the flightperiods? And what the name was? That would help me to trace the script.
With the script it would be easy to change the quantiles and run it again.
it was captured by this issue https://github.com/BiologicalRecordsCentre/ABLE/issues/511 which also links to an earlier issue. There is some discussion on the approach and the file you supplied to us
Thanks, that helped, found the script. I used GBIF data for this, so if for some countries data is missing, then these species will be missing. Here is the table for M jurtina in Continental:
flight_period_M_jurtina_Continental.xlsx
The top rows: <html xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:x="urn:schemas-microsoft-com:office:excel" xmlns="http://www.w3.org/TR/REC-html40">
species | code | month | period | nrec | cumsum | tot | cumperc -- | -- | -- | -- | -- | -- | -- | -- Maniola jurtina | Continental | 7 | II | 35396 | 35396 | 153270 | 23,09388661 Maniola jurtina | Continental | 7 | I | 30685 | 66081 | 153270 | 43,11411235 Maniola jurtina | Continental | 6 | II | 25050 | 91131 | 153270 | 59,45781953 Maniola jurtina | Continental | 8 | I | 22809 | 113940 | 153270 | 74,33940106 Maniola jurtina | Continental | 1 | I | 11454 | 125394 | 153270 | 81,81248777 Maniola jurtina | Continental | 6 | I | 11251 | 136645 | 153270 | 89,15312847 Maniola jurtina | Continental | 8 | II | 11010 | 147655 | 153270 | 96,33653031 Maniola jurtina | Continental | 5 | II | 3114 | 150769 | 153270 | 98,36823906 Maniola jurtina | Continental | 9 | I | 1368 | 152137 | 153270 | 99,26078163 Maniola jurtina | Continental | 5 | I | 616 | 152753 | 153270 | 99,66268676 Maniola jurtina | Continental | 9 | II | 316 | 153069 | 153270 | 99,86885888
Hello,
The Czech coordinator wanted to make the validations on the verification system and realised the rulesets for flagging butterflies are quite strict for some common species. He pointed to Maniola jurtina and Coenonympha pamphilus are rejected in months when they usually fly. I checked the Rulesets for flagging butterflies in issue #511 and found M. jurtina flying between June and mid-August. I took it from this document flightperiod_95perc.csv
Species | BGR | Month | perido | nrec Maniola jurtina | Continental | 7 | II | 35396 Maniola jurtina | Continental | 7 | I | 30685 Maniola jurtina | Continental | 6 | II | 25050 Maniola jurtina | Continental | 8 | I | 22809 Maniola jurtina | Continental | 1 | I | 11454 Maniola jurtina | Continental | 6 | I | 11251
The flying period of M. jurtina in Czechia is ca since May 20 till ca September 15, see https://portal.nature.cz/w/druh-31746#/ and for Copenonympha pamphilus, it ranges from mid April to mid October (https://portal.nature.cz/w/druh-31751#/).
I think we can correct the flight periods of those two species in Czechia. Still, it would be best to check common species, like Pyronia tithonus, Polyommatus thersites, Vanessa atalanta, Celastrina argiolus among others when they are rejected and correct them for all countries. Or another option to check which species are more rejected in the verification system and determine which one have a longer flight period.