OpenSextant / Xponents

Geographic Place, Date/time, and Pattern entity extraction toolkit along with text extraction from unstructured data and GIS outputters.
Apache License 2.0
44 stars 7 forks source link

Improved connection with Solr 8.x and future "contributions" sections of Solr manual #57

Open mubaldino opened 4 years ago

mubaldino commented 4 years ago

Type of Feature:

Description of Feature I see the success of the Tagger handler (follow on of SolrTextTagger). Its great to see the geonames reference, etc. but the preservation of the "naive tagger" mention and not much more is a gaping hole to be filled. https://lucene.apache.org/solr/guide/8_6/the-tagger-handler.html#tagger-performance-tips

We can list a handful of successful NLP and other uses of the TextTagger. The main example from here is the use of it in our various OpenSextant (Xponents, Gate Toolbox, etc) implementations and the production ready packaging such as is here: https://hub.docker.com/r/mubaldino/opensextant

So what is needed is to understand how to register this interest with the Solr committers and what the sort of connection is between Solr and its users. Ideally, a "contributor" could be someone that contributes applications of Solr that are registered/vetted in a new part of the Community portion of the Solr site. https://lucene.apache.org/solr/community.html#how-to-contribute -- I can see how I can contribute to the Solr code base. I have no interest or time there. So I see the https://lucene.apache.org/solr/ home page missing a venue for its community to understand who is building on top of and applying Solr.

Model: See spaCy.io Universe (contributors are folded in directly with the project) https://spacy.io/universe . This home page has a different feel completely for how a dev community operates and highlights the broader sense of contributor.

I hate to see Solr fall behind,... but it is hard to be heard if you are not a committer.

Marc

dsmiley commented 4 years ago

Succinctly put, are you looking for the Solr website to have a list of all 3rd party Solr integrations?

Something very close to this exists: https://solr.cool As Solr's package manager starts accelerating in usage, it will become commonplace for people to add 3rd party packages, which isn't the case today. This is accelerating quickly as popular 1st party plugins like the DIH are moving to 3rd party. Solr.cool will very likely become the defacto place to register them, and it'll get lots of outbound links from the Solr Ref Guide, and maybe even the Solr website too.