nc-minibbs / mbbs

A repository for the Mini-Bird Breeding Survey data
https://minibbs.us
Other
2 stars 0 forks source link

duplicate data in eBird #37

Open ahhurlbert opened 1 year ago

ahhurlbert commented 1 year ago

There seem to be instances in which there are multiple checklists for the same route-year. For at least some of these, the cause may be that the original observer (often Will Cook) actually shared the checklist with the mbbs eBird account in addition to the checklists that were scraped from the website and manually added. But in some cases, the values do not agree. Here are the examples from the mbbsorangenc account (mbbsdurhamnc and mbbschathamn checklists should be checked for similar duplicates):

(Note there are some duplicate route-years in a few cases (esp by Haven Wiley) because the protocol for awhile was to submit a separate nocturnal pre-survey checklist. Thus they are not true duplicates because they typically start at 5:10 whereas the actual checklist starts at 5:25 or 5:30.)

ahhurlbert commented 1 year ago

To be clear, it seems that in our own analysis we ignore eBird data pre-2009 (or is it 2010?) and only use the old website-based data for that period. However, the following tasks would be useful

IJBG commented 1 year ago

Create text document to keep in the repository that has cleaning actions we've taken on ebird ie: removed checklist x because confirmed other checklist more accurate Markdown document Checklists will be removed from ebird, but still available from the github previous ebird downloads