crossminer / scava

https://eclipse.org/scava/
Eclipse Public License 2.0
18 stars 13 forks source link

Non-project related readers #287

Open creat89 opened 5 years ago

creat89 commented 5 years ago

Hello,

We, me and @Danny2097, have two readers that are not related to any project in particular:

These readers use York's Workflow to work but also tools developed for the analysis of text, such as plain text, code detector and indexing. In the case of Stack Overflow it is impossible (or very expensive) to use deltas, although a systematic running (every X days) could be used. So, we are wondering how to include them in the platform.

Both readers are based as well on builders in order to filter on demand the posts to retrieve form these information sources.

Any ideas or suggestions?

Although it might not be the best idea, I could create a metric that could manage the frequency in which these readers are called to run.

davidediruscio commented 5 years ago

I think we discussed yesterday this, right? The idea is to provide users with a web page to specify a string that will be used to query elasticsearch indexes.