vai @myrmoteras
The clean bucket has been on @deepreef side, and I remember long time ago, I spend some time to help to clean your side up when we worked towards integrating the Hymenoptera Online (It was when I have been living in Tehran). On our side we added over 1.2M bibliographic references to the dirty bucket, and we started BLR. The latter is supposed, among others, to allow to upload articles for which there is no PDF available, so that every reference will have an accessible digital copy. We have a tool now that allows bulk upload, such as all the literature on drosophilids (ca 16,000) articles. The tool allows to clean up the data, such as journal names. In this respect there is clearly an option to collaborate towards a clean bucket with the effect that ultimately BLR will be part of the clean bucket by having the journal names, etc all according to the clean bucket.
BLR is a repository for full content articles, figures, treatments, supplementary materials.
BLR has the metadata we upload, that is, it has a degree of variation in journal names, because so far nobody made an effort to clean it up, not required it nor provided a reference list of journal names. This includes some 36,254 articles today, and 701 distinct journal names, which is not too much to clean up, if needed. Here is the list of the journal and publisher names: http://tb.plazi.org/GgServer/dioStats/stats?outputFields=bib.source&groupingFields=bib.source&format=HTML
http://bibref.org ca currently over 1.3M references, daily added via TreatmentBank
BLR has the metadata we upload, that is, it has a degree of variation in journal names, because so far nobody made an effort to clean it up, not required it nor provided a reference list of journal names. This includes some 36,254 articles today, and 701 distinct journal names, which is not too much to clean up, if needed. Here is the list of the journal and publisher names: http://tb.plazi.org/GgServer/dioStats/stats?outputFields=bib.source&groupingFields=bib.source&format=HTML
vai @myrmoteras The clean bucket has been on @deepreef side, and I remember long time ago, I spend some time to help to clean your side up when we worked towards integrating the Hymenoptera Online (It was when I have been living in Tehran). On our side we added over 1.2M bibliographic references to the dirty bucket, and we started BLR. The latter is supposed, among others, to allow to upload articles for which there is no PDF available, so that every reference will have an accessible digital copy. We have a tool now that allows bulk upload, such as all the literature on drosophilids (ca 16,000) articles. The tool allows to clean up the data, such as journal names. In this respect there is clearly an option to collaborate towards a clean bucket with the effect that ultimately BLR will be part of the clean bucket by having the journal names, etc all according to the clean bucket.
BLR is a repository for full content articles, figures, treatments, supplementary materials.
BLR has the metadata we upload, that is, it has a degree of variation in journal names, because so far nobody made an effort to clean it up, not required it nor provided a reference list of journal names. This includes some 36,254 articles today, and 701 distinct journal names, which is not too much to clean up, if needed. Here is the list of the journal and publisher names: http://tb.plazi.org/GgServer/dioStats/stats?outputFields=bib.source&groupingFields=bib.source&format=HTML
http://bibref.org ca currently over 1.3M references, daily added via TreatmentBank
BLR has the metadata we upload, that is, it has a degree of variation in journal names, because so far nobody made an effort to clean it up, not required it nor provided a reference list of journal names. This includes some 36,254 articles today, and 701 distinct journal names, which is not too much to clean up, if needed. Here is the list of the journal and publisher names: http://tb.plazi.org/GgServer/dioStats/stats?outputFields=bib.source&groupingFields=bib.source&format=HTML