gbhl / bhl-europe

Biodiversity Heritage Library Europe
http://www.bhl-europe.eu/
15 stars 2 forks source link

2.1.2 - Search content by scientific (binominal, Latin) name (renamed, Advanced search) #75

Open janahoffmann opened 13 years ago

janahoffmann commented 13 years ago

Search by scientific name (genus, species, binominal, taxon), search in metadata/ OCR text

Precondition: uBio integration (TaxonFinder), save coordinates of names found in OCR in metadata (OLEF)

lobajuluwa commented 12 years ago

Dependency:- Pre-Ingest and taxon finder services - needs to define index in solr. Data has to come from Pre-Ingest and searching correct fields from OLEF. We can prepare function in portal i.e. so long as data can be driven thru.. then the portal can pick it up. (WK)

wkollernhm commented 12 years ago

This is now possible in our portal using sample data (spices books).

JiriFrank commented 12 years ago

2.1.2 - Search content by scientific (binominal, Latin) name (renamed, Advanced search)

COR number: 2.1.2 Testing platform: http://bhl-test.nhm.ac.uk/portal/ Function: Advanced search

Description:

Search by scientific name (genus, species, binominal, taxon), search in metadata/ OCR text

„expand search“ function

@AnneSch @AntonioGVH @fwelter @grahamhrbge1670 @heimor @HenningScholz @JFTester @JiriFrank @LarissaS @RalfH

JiriFrank commented 12 years ago

2.1.2 - Search content by scientific (binominal, Latin) name (renamed, Advanced search)

COR number: 2.1.2 Testing platform: http://bhl-test.nhm.ac.uk/portal/ Function: Advanced search

Description:

Search by scientific name (genus, species, binominal, taxon), search in metadata/ OCR text

„expand search“ function

@AntonioGVH @fwelter @LarissaS

HenningScholz commented 12 years ago

Same applies as before: what is not in the metadata can not be found. As TaxonFinder comes at Pre-Ingest, we don't have the taxonomic information ready now. So I think the infrastructure is there, but we cant see it working properly without the Pre-Ingest. Is that correct?


Von: JiriFrank [mailto:reply@reply.github.com] Gesendet: Di 17.01.2012 15:08 An: Scholz, Henning Betreff: Re: [bhle] 2.1.2 - Search content by scientific (binominal, Latin) name (renamed, Advanced search) (#75)

2.1.2 - Search content by scientific (binominal, Latin) name (renamed, Advanced search)

COR number: 2.1.2 Testing platform: http://bhl-test.nhm.ac.uk/portal/ Function: Advanced search

Description:

Search by scientific name (genus, species, binominal, taxon), search in metadata/ OCR text

"expand search" function

@AnneSch @AntonioGVH @fwelter @grahamhrbge1670 @heimor @HenningScholz @JFTester @JiriFrank @LarissaS @RalfH


Reply to this email directly or view it on GitHub: https://github.com/bhle/bhle/issues/75#issuecomment-3528255

RalfH commented 12 years ago

It works. But there are some open questions.

I searched for e.g. Thalictrum minus.

(1) First question: Do I need to tick the exact word /phrase box? Obviously not, but many users will have the same "problem".

(2) Well the result (Flora de Catalunya) is ok. The user will click on read book. That works. But then he or she has to type Thalictrum minus again. Same question as before: exact phrase and how to deal with it? The search results in the documents are a mixture of results referring to Thalictrum or minus, not to the binomial Thalictrum minus. Anyway, it would be nice that search results in the selected book would be displayed automatically. Currently, you have to type the scientific name again and the results are not really what you are looking for.

This may refer partly to other CORs.

-----Ursprüngliche Nachricht----- Von: JiriFrank [mailto:reply@reply.github.com] Gesendet: Dienstag, 17. Januar 2012 15:08 An: Hand, Ralf Betreff: Re: [bhle] 2.1.2 - Search content by scientific (binominal, Latin) name (renamed, Advanced search) (#75)

2.1.2 - Search content by scientific (binominal, Latin) name (renamed, Advanced search)

COR number: 2.1.2 Testing platform: http://bhl-test.nhm.ac.uk/portal/ Function: Advanced search

Description:

Search by scientific name (genus, species, binominal, taxon), search in metadata/ OCR text

"expand search" function

@AnneSch @AntonioGVH @fwelter @grahamhrbge1670 @heimor @HenningScholz @JFTester @JiriFrank @LarissaS @RalfH


Reply to this email directly or view it on GitHub: https://github.com/bhle/bhle/issues/75#issuecomment-3528255

HenningScholz commented 12 years ago

This is interesting, I did not got it to work using a different name that is present in another book, but the example used by Ralf is working. Would be interesting to know why. I assume, as the OLEF file is created manually, the Taxon Finder information is missing and thus it is not working for all books. This is also the reason why you need to search again in the content viewer without getting the same result, as the Taxon Finder results are not in the data yet that are included for a book. Would be interesting to know if it is really a data problem or a useability problem.

Henning


Dr. Henning Scholz Museum für Naturkunde | Leibniz Institute for Research on Evolution and Biodiversity at the Humboldt University Berlin Invalidenstraße 43 | D-10115 Berlin | Germany Tel.: +49-30-2093-8864 | Fax: +49-30-2093-8868 Email: henning.scholz@mfn-berlin.de http://www.naturkundemuseum-berlin.de/ / http://www.bivalve.de http://www.bhl-europe.eu / http://www.biodiversitylibrary.org/ BHL-Europe in Europeana: bit.ly/BHLE-Europeana

-----Ursprüngliche Nachricht----- Von: RalfH [mailto:reply@reply.github.com] Gesendet: Freitag, 20. Januar 2012 14:21 An: Scholz, Henning Betreff: Re: [bhle] 2.1.2 - Search content by scientific (binominal, Latin) name (renamed, Advanced search) (#75)

It works. But there are some open questions.

I searched for e.g. Thalictrum minus.

(1) First question: Do I need to tick the exact word /phrase box? Obviously not, but many users will have the same "problem".

(2) Well the result (Flora de Catalunya) is ok. The user will click on read book. That works. But then he or she has to type Thalictrum minus again. Same question as before: exact phrase and how to deal with it? The search results in the documents are a mixture of results referring to Thalictrum or minus, not to the binomial Thalictrum minus. Anyway, it would be nice that search results in the selected book would be displayed automatically. Currently, you have to type the scientific name again and the results are not really what you are looking for.

This may refer partly to other CORs.

-----Ursprüngliche Nachricht----- Von: JiriFrank [mailto:reply@reply.github.com] Gesendet: Dienstag, 17. Januar 2012 15:08 An: Hand, Ralf Betreff: Re: [bhle] 2.1.2 - Search content by scientific (binominal, Latin) name (renamed, Advanced search) (#75)

2.1.2 - Search content by scientific (binominal, Latin) name (renamed, Advanced search)

COR number: 2.1.2 Testing platform: http://bhl-test.nhm.ac.uk/portal/ Function: Advanced search

Description:

Search by scientific name (genus, species, binominal, taxon), search in metadata/ OCR text

"expand search" function

@AnneSch @AntonioGVH @fwelter @grahamhrbge1670 @heimor @HenningScholz @JFTester @JiriFrank @LarissaS @RalfH


Reply to this email directly or view it on GitHub: https://github.com/bhle/bhle/issues/75#issuecomment-3528255


Reply to this email directly or view it on GitHub: https://github.com/bhle/bhle/issues/75#issuecomment-3583364

HenningScholz commented 12 years ago

I found another interesting thing: when I search for Thalictrum in simple search I get results. Thalictrum, however, is not in the facet list. When I pick Umbelliferae from the facet list and search for it again, I get no result in whatever field I'm looking for. If I pick another term from the facet list (Halme), I get results. Strange....


Dr. Henning Scholz Museum für Naturkunde | Leibniz Institute for Research on Evolution and Biodiversity at the Humboldt University Berlin Invalidenstraße 43 | D-10115 Berlin | Germany Tel.: +49-30-2093-8864 | Fax: +49-30-2093-8868 Email: henning.scholz@mfn-berlin.de http://www.naturkundemuseum-berlin.de/ / http://www.bivalve.de http://www.bhl-europe.eu / http://www.biodiversitylibrary.org/ BHL-Europe in Europeana: bit.ly/BHLE-Europeana

-----Ursprüngliche Nachricht----- Von: RalfH [mailto:reply@reply.github.com] Gesendet: Freitag, 20. Januar 2012 14:21 An: Scholz, Henning Betreff: Re: [bhle] 2.1.2 - Search content by scientific (binominal, Latin) name (renamed, Advanced search) (#75)

It works. But there are some open questions.

I searched for e.g. Thalictrum minus.

(1) First question: Do I need to tick the exact word /phrase box? Obviously not, but many users will have the same "problem".

(2) Well the result (Flora de Catalunya) is ok. The user will click on read book. That works. But then he or she has to type Thalictrum minus again. Same question as before: exact phrase and how to deal with it? The search results in the documents are a mixture of results referring to Thalictrum or minus, not to the binomial Thalictrum minus. Anyway, it would be nice that search results in the selected book would be displayed automatically. Currently, you have to type the scientific name again and the results are not really what you are looking for.

This may refer partly to other CORs.

-----Ursprüngliche Nachricht----- Von: JiriFrank [mailto:reply@reply.github.com] Gesendet: Dienstag, 17. Januar 2012 15:08 An: Hand, Ralf Betreff: Re: [bhle] 2.1.2 - Search content by scientific (binominal, Latin) name (renamed, Advanced search) (#75)

2.1.2 - Search content by scientific (binominal, Latin) name (renamed, Advanced search)

COR number: 2.1.2 Testing platform: http://bhl-test.nhm.ac.uk/portal/ Function: Advanced search

Description:

Search by scientific name (genus, species, binominal, taxon), search in metadata/ OCR text

"expand search" function

@AnneSch @AntonioGVH @fwelter @grahamhrbge1670 @heimor @HenningScholz @JFTester @JiriFrank @LarissaS @RalfH


Reply to this email directly or view it on GitHub: https://github.com/bhle/bhle/issues/75#issuecomment-3528255


Reply to this email directly or view it on GitHub: https://github.com/bhle/bhle/issues/75#issuecomment-3583364

AntonioGVH commented 12 years ago

I am not sure this function works properly. I looked by A. muricatus, that I know is in the work 'Flora de Catalunya', but the advance search function only gave me Karsten Deutschen Flora.

By the way, I had to look 'Flora de Catalunya' in Google books, because the BHL-E web does not respond when I ask for the pdf.

Quoting JiriFrank:

2.1.2 - Search content by scientific (binominal, Latin) name (renamed, Advanced search)

COR number: 2.1.2 Testing platform: http://bhl-test.nhm.ac.uk/portal/ Function: Advanced search

Description:

Search by scientific name (genus, species, binominal, taxon), search in metadata/ OCR text

„expand search“ function

@AntonioGVH @fwelter @LarissaS


Reply to this email directly or view it on GitHub: https://github.com/bhle/bhle/issues/75#issuecomment-3560969

AnneSch commented 12 years ago

Is working, but I got different results: When I type in "Cirsium palustre", I got some results, same as "Cirsium" (e.g., Billeder af Nordens flora). Instead, when I type in "Orchis mascula", I do not get any matches, but it is in the "New british flora" (It got found in the OCR when I look for it in the text).

-----Ursprüngliche Nachricht----- Von: JiriFrank [mailto:reply@reply.github.com] Gesendet: Dienstag, 17. Januar 2012 15:08 An: Scholz, Annemarie Betreff: Re: [bhle] 2.1.2 - Search content by scientific (binominal, Latin) name (renamed, Advanced search) (#75)

2.1.2 - Search content by scientific (binominal, Latin) name (renamed, Advanced search)

COR number: 2.1.2 Testing platform: http://bhl-test.nhm.ac.uk/portal/ Function: Advanced search

Description:

Search by scientific name (genus, species, binominal, taxon), search in metadata/ OCR text

"expand search" function

@AnneSch @AntonioGVH @fwelter @grahamhrbge1670 @heimor @HenningScholz @JFTester @JiriFrank @LarissaS @RalfH


Reply to this email directly or view it on GitHub: https://github.com/bhle/bhle/issues/75#issuecomment-3528255

JiriFrank commented 12 years ago

The feature seems to works in principle, but closer specification is needed. Also it will be important to test with more content.

Request from testing is add concrete location of Scientific name in content. Need to be solved. Issue #292

Will be tested again with more content from PI.

JiriFrank commented 12 years ago

In Facet list is displayed just genus name. Is the binominal name required for facet list? I will be helpful.

@chris-sleep @lobajuluwa @heimor

JiriFrank commented 12 years ago

Done and testable.

JiriFrank commented 12 years ago

2.1.2 - Search content by scientific (binominal, Latin) name (renamed, Advanced search)

COR number: 2.1.2 Testing platform: http://bhl-test.nhm.ac.uk/portal/ Function: Simple search, Advanced search

Description:

Search by scientific name (genus, species, binominal, taxon), search in metadata/ OCR text

Check Scientific name in simple search and use Scientific name category for the search field in Advanced search.

Latin names are displayed in facet list in category Scientific name (showing all taxons included in books in result list).


Testers: @AnneSch @AntonioGVH @fwelter @grahamhrbge1670 @heimor @HenningScholz @JFTester @JiriFrank @LarissaS @RalfH

JiriFrank commented 12 years ago

2.1.2 - Search content by scientific (binominal, Latin) name (renamed, Advanced search)

COR number: 2.1.2 Testing platform: http://bhl-test.nhm.ac.uk/portal/ Function: Simple search, Advanced search

Description:

Search by scientific name (genus, species, binominal, taxon), search in metadata/ OCR text

Check Scientific name in simple search and use Scientific name category for the search field in Advanced search.

Latin names are displayed in facet list in category Scientific name (showing all taxons included in books in result list).


@GregKenicer @GeoffHarper @KasiaGoral @MartinaMetzger @DanielFisher @NeilWoodcock @SaraPerzley @SaraCarlton @HannaKoivula @PaiviLipsanen

JiriFrank commented 12 years ago

2.1.2 - Search content by scientific (binominal, Latin) name (renamed, Advanced search)

COR number: 2.1.2 Testing platform: http://bhl-test.nhm.ac.uk/portal/ Function: Simple search, Advanced search

Description:

Search by scientific name (genus, species, binominal, taxon), search in metadata/ OCR text

Check Scientific name in simple search and use Scientific name category for the search field in Advanced search.

Latin names are displayed in facet list in category Scientific name (showing all taxons included in books in result list).


@PaiviJaakkola @SiniKarki @TiinaOnttonen

HenningScholz commented 12 years ago

This seems to work in most cases, but not all, e.g. Aira caespitosa, gives no results also the facet list shows the name.


Von: JiriFrank [mailto:reply@reply.github.com] Gesendet: Do 12.04.2012 18:17 An: Scholz, Henning Betreff: Re: [bhle] 2.1.2 - Search content by scientific (binominal, Latin) name (renamed, Advanced search) (#75)

2.1.2 - Search content by scientific (binominal, Latin) name (renamed, Advanced search)

COR number: 2.1.2 Testing platform: http://bhl-test.nhm.ac.uk/portal/ Function: Simple search, Advanced search

Description:

Search by scientific name (genus, species, binominal, taxon), search in metadata/ OCR text

Check Scientific name in simple search and use Scientific name category for the search field in Advanced search.

Latin names are displayed in facet list in category Scientific name (showing all taxons included in books in result list).


Testers: @AnneSch @AntonioGVH @fwelter @grahamhrbge1670 @heimor @HenningScholz @JFTester @JiriFrank @LarissaS @RalfH


Reply to this email directly or view it on GitHub: https://github.com/bhle/bhle/issues/75#issuecomment-5094655

grahamtestrbge commented 12 years ago

This is working as far as I can see. Note that the uBio does not pick up all names on a page, but understand this is not crucial at the moment. Facet list working as expected.

Re-tested using "Hordeum vulgare" as test name. Search for "Hordeum vulgare" in Simple Search with Scientific name box checked : 1 book returned. Search for "Hordeum vulgare" in Advanced Search set to scientific name, no results. Something not quite right.

Opened book that was retrieved in Simple Search to locate references to "Hordeum vulgare" : search inside function not operating as expected, instead all search terms entered when set to search, put an arrow at Page 1 of the retrieved book, and this is not opening when clicked.

LarissaS commented 12 years ago

Sorry, it was more for this function: Now after email of Henning I see this arrow! This is really difficult to find, how you want to explain all users where to look for?! It works but not properly. For example, I open Vinson and in facet list I see a lot of scirntific names. I use Attus (13 results) and via advanced search I indeed get Vinson back, but it's not shown on what pages I can find Attus. So I search content again for Attus and get yellow point with white box where nothing is written and in the box left it's written no names found! So for me this function is not trustable!

-----Original Message----- From: JiriFrank [mailto:reply@reply.github.com] Sent: 11 апреля 2012 г. 16:37 To: Smirnova Larissa Subject: Re: [bhle] 2.1.2 - Search content by scientific (binominal, Latin) name (renamed, Advanced search) (#75)

Done and testable.


Reply to this email directly or view it on GitHub: https://github.com/bhle/bhle/issues/75#issuecomment-5070137


17/4/2012 - Filtered through antispam by ICT

KasiaGoral commented 12 years ago

Works fine.