flaxsearch / BioSolr

A project aiming "to significantly advance the state of the art with regard to indexing and querying biomedical data with freely available open source software"
Apache License 2.0
77 stars 26 forks source link

empty efu_uri? #8

Open theyaa opened 7 years ago

theyaa commented 7 years ago

If I use a local OWL file, and leave the value for the efu_uri empty. Will the plugin be able to parse the text in the document I send to properly classify the document according to the OWL ontology?

Best.

Theyaa

mattflax commented 7 years ago

No, that is not something this plugin can do. I suggest you look at the Zooma tool, which may be useful to you. See http://www.ebi.ac.uk/spot/zooma/ for more details.

Thanks.

dsmiley commented 7 years ago

Sorry this may be out of scope of this repo/issues here but related to Zooma... is the code behind that available? I was poking around the site but I didn't notice it.

I developed something called the SolrTextTagger which seems to be a little similar and I'm curious about what Zooma is doing internally -- how it works.

tburdett commented 7 years ago

Hi David, yes - the Zooma code is here: https://github.com/EBISPOT/zooma. I'd like to have a look at the SolrTextTagger in a bit more detail but my immediate reaction is that they're solving different problems in a manner that is potentially nicely complementary

dsmiley commented 7 years ago

Thanks. Wow there's a lot of code there; it will take some time to explore to find the parts similar to the SolrTextTagger -- which is scanning text to look for words in a dictionary, in a nutshell. I agree that it's likely complementary.