CINERGI / ontology_cleanup

Using OWL API to clean up ontologies
0 stars 0 forks source link

Wrong Annotation #21

Open karen-lo opened 8 years ago

karen-lo commented 8 years ago

Doc ID: SEN:0110

"This site has been setup by the Geological Fluid Dynamics group of the Institut de Physique Du Globe de Paris..."

"Group" is recognized as Material > Chemical, but in this case it seems to be a part of a name.

http://132.249.238.151:8080/cinergi-annotator/annotation/index?docId=SEN%3A0110&enhancedOnly=false

karen-lo commented 8 years ago

Doc ID: cinergi:cuahsi_his:lnf_hydro

"four sites were selected in the Hat Creek Basin of Lassen National Forest to study the hydrologic effects of two common fuel-reduction strategies, forest thinning and group selection."

"Group" is again recognized as Material > Chemical. This time it's part of a phrase.

http://132.249.238.151:8080/cinergi-annotator/annotation/index?docId=cinergi%3Acuahsi_his%3Alnf_hydro&enhancedOnly=false

karen-lo commented 8 years ago

Doc ID: 4f4e48aee4b07f02db52e30c

"Operator: LEWIS, RICHARD JR"

"Lewis A" was annotated as Material > Chemical.

http://132.249.238.151:8080/cinergi-annotator/annotation/index?docId=4f4e48aee4b07f02db52e30c&enhancedOnly=false

karen-lo commented 8 years ago

Doc ID: 4f4e47c7e4b07f02db4aaf70

"Operator: YOUNG, L C"

"Young" recognized as Property > Measure.

http://132.249.238.151:8080/cinergi-annotator/annotation/index?docId=4f4e47c7e4b07f02db4aaf70&enhancedOnly=false

karen-lo commented 8 years ago

Many documents in the ScienceBase WAF source has keywords annotated that are actually names or proper nouns.

karen-lo commented 8 years ago

Doc ID: 0029AA385C604C32B4076DDC8600A64F

"For example, a major challenge facing the US industry today is that the sales contracts of independent producers have reached,"

"sales" is recognized as "Sale" and annotated as Material > Chemical.

http://132.249.238.151:8080/cinergi-annotator/annotation/index?docId=0029AA385C604C32B4076DDC8600A64F&enhancedOnly=false

karen-lo commented 8 years ago

Doc ID: 0029AA385C604C32B4076DDC8600A64F

"some of his company's plants have ''gone over the cliff, the world is not coming to an end.' With the imposition of severe cost-cutting strategies, he said, ''these plants remain profitable...'"

"plants" is recognized as "Plant" and annotated as Science Domain > Biology. In this case, it seems more like an economical term.

http://132.249.238.151:8080/cinergi-annotator/annotation/index?docId=0029AA385C604C32B4076DDC8600A64F&enhancedOnly=false

karen-lo commented 8 years ago

Doc ID: 0110DC91EA6E40FBA7D17B73248B7DCE

"A view of the slate bedrock in an underground placer mine."

"bedrock" is recognized as Bedrock and annotated as Realm > Geosphere.

http://132.249.238.151:8080/cinergi-annotator/annotation/index?docId=0110DC91EA6E40FBA7D17B73248B7DCE&enhancedOnly=false

dv: OK

karen-lo commented 8 years ago

Doc ID: gov.noaa.ncdc:C00824

"....including stations from the Climate Reference Network (CRN)."

"CRN" is recongized as Crn and annotated as Material > Chemical, which is not true in this case.

http://132.249.238.151:8080/cinergi-annotator/annotation/index?docId=gov.noaa.ncdc%3AC00824&enhancedOnly=false

karen-lo commented 8 years ago

Doc ID: 04F1402BF03A430C87005BFAAB75BC6C

"A clear trend emerges in the Klamath Falls/Olene Gap area; "

"Clearing" was recognized and annotated as Feature > Physiographic Feature. I could not find the word "clearing" in the abstract and only found "clear." In this case, it is used as an adjective and not a physiographic feature.

http://132.249.238.151:8080/cinergi-annotator/annotation/index?docId=04F1402BF03A430C87005BFAAB75BC6C&enhancedOnly=false

karen-lo commented 8 years ago

Doc ID: 08ce5b5e7bec3a90a50850008d2e201d

"The Environmental Data Explorer is the authoritative source for data sets used by UNEP and its partners in the Global Environment Outlook (GEO) report and other integrated environment assessments."

"Explorer" is recognized as Explorer and annotated as Equipment > Satellite, which is not true because this is a data search tool.

http://132.249.238.151:8080/cinergi-annotator/annotation/index?docId=08ce5b5e7bec3a90a50850008d2e201d&enhancedOnly=false

karen-lo commented 8 years ago

Doc ID: epa_models:CAP88

"The complete set of dose and risk factors used in CAP88 is provided. "

"factors" is recognized as "Factor A" and annotated as Material > Chemical.

http://132.249.238.151:8080/cinergi-annotator/annotation/index?docId=epa_models%3ACAP88&enhancedOnly=false

karen-lo commented 8 years ago

Doc ID: epa_models:WATERSHEDSS

"a hypertext expert-systems-like user interface, the agricultural BMP data base, and the pollutant budget spreadsheet."

"data base" is annotated as "Base" and recognized as Material > Chemical.

http://132.249.238.151:8080/cinergi-annotator/annotation/index?docId=epa_models%3AWATERSHEDSS&enhancedOnly=false

karen-lo commented 8 years ago

Doc ID: OT.042013.26913.1

"NCALM Seed. PI: Hugo Gutierrez Jurado, New Mexico Tech."

"Tech" is recognized as "TeCH" and annotated as Material > Chemical.

http://132.249.238.151:8080/cinergi-annotator/annotation/index?docId=OT.042013.26913.1&enhancedOnly=false

valentinedwv commented 7 years ago

Doc ID:ecogeo:0017 Organization Unasssigned. Existing KW is Organization.

http://132.249.238.150:8080/cinergi-annotator/annotation/index?docId=ecogeo%3A0017&enhancedOnly=false

karen-lo commented 7 years ago

After reprocessing:

Resolved:

No Longer Accessible: Doc ID: epa_models:CAP88, Doc ID: epa_models:WATERSHEDSS