biodiversitydata-se / SBDI4R

R package to search and access data made available through the Swedish biodiversity data infrastructure SBDI
https://biodiversitydata-se.github.io/SBDI4R/
GNU Affero General Public License v3.0
1 stars 2 forks source link

search by observation type #19

Open DeboraArlt opened 3 years ago

DeboraArlt commented 3 years ago

is part of #18

Observations labelled as Not rediscovered, non-natural occurrence should be possible for the user to filter out. Provide this filtering option using data fields (Adb/Artportalen fields): IsNeverFoundObservation, IsNotRediscoveredObservation, IsNaturalOccurrence, IsPositiveObservation.

Fields need to be included in the SOLR index, current bioatlas SOLR scheme: https://github.com/bioatlas/ala-docker/blob/develop/solr7/mycores/biocache/conf/schema.xml

DeboraArlt commented 3 years ago

comment by Manash:

The 4 terms mentioned are not standard DarwinCore terms (https://www.slu.se/en/subweb/swedish-lifewatch/guides/data-management/data-providers/slu-swedish-species-information-center/dataservices/speciesobservationservice/) and could not be found here https://github.com/tdwg/dwc/tree/master/dist

However if in a source CSV there are some additional columns that are not defined in DarwinCore standard, they are automatically added into a JSON stored in the Cassandra column:

miscProperties

To add them to the SOLR index, the following property needs to be set in the biocache-store config file:

additional.fields.to.index=myField1,myField2,myField3

or to the fle additionalFields.list if using IndexLocalNode function.

aleruete commented 3 years ago

basisOfRecord is indexed and can be searched for

DeboraArlt commented 3 years ago

basisOfRecord does not contain the terms IsNeverFoundObservation, IsNotRediscoveredObservation, IsNaturalOccurrence, IsPositiveObservation. As Manash pinted out above they are not standard DwC terms. I now understand that they are labels/terms used for Artportalen data and included as IsNeverFoundObservation, IsNotRediscoveredObservation, IsNaturalOccurrence, IsPositiveObservation in the result tables created by Analysportalen. I don't know whether those columns are included in the data entering through the IPT and hence could be used for filtering.