NCEAS / metadig-engine

MetaDig Engine: multi-dialect metadata assessment engine
7 stars 5 forks source link

Abstract count incorrect in some cases #79

Closed gothub closed 7 years ago

gothub commented 7 years ago

The check check.abstract.100.words.1 incorrectly calculates the abstract length if <para> elements are included (these are valid https://knb.ecoinformatics.org/#external//emlparser/docs/eml-2.1.1/eml-text.html#TextType markup tags). An example of this is for https://arcticdata.io/catalog/#view/doi:10.18739/A2D107

Fix the check so that a valid word count is calculated for any valid textType markup.

gothub commented 7 years ago

The value returned from the xpath selector included whitespace that was being counted in the abstract word length.

This problem has been fixed in commit ad95b2fdd0c4c81418a97135663541047d013b23