natalink / mwe_noske

0 stars 0 forks source link

NoSke or|and KonText #9

Closed natalink closed 6 years ago

natalink commented 6 years ago

Guys, because there is a high probability that the scheme got changed for the PARSEME corpora, I suggest that @Ansa211 makes a page devoted to it (smth like https://ufal.mff.cuni.cz/lindat-kontext/parseme ) and reflect all the changes that may come there. You can put the links to the corpora locations there as well. (both KonText and NoSke) To the discussion should the data be hosted on KonText or Behrang's NoSke, I would argue for both.

Pro's for KonText:

Con's for KonText:

Pro's for NoSke:

Con's for NoSke:

CON for both (or maybe I'm mistaken, correct me): There is no way how to link to all the corpora at once. For NoSke, you need to change the corpora in the first query form. In KonText the corpora are stored under "Multiple corpora", but as far as I remember when I worked on KonText, I could not link to it, I could link only to "Monolingual" corplist.

languagerecipes commented 6 years ago

hi Natalia,

As said adding the data to noske is not a problem and there is no need for justification; I will add a footnote to the paper with a proper link address. From the paper, can you try to remove comments and unwanted information and try to finalize sections? Who is adding screenshots? Not much time left for submission :)

Thank you

Behrang

On Tue, Dec 19, 2017 at 4:25 AM, Natalia notifications@github.com wrote:

Guys, because there is a high probability that the scheme got changed for the PARSEME corpora, I suggest that @Ansa211 https://github.com/ansa211 makes a page devoted to it (smth like https://ufal.mff.cuni.cz/ lindat-kontext/parseme ) and reflect all the changes that may come there. You can put the links to the corpora locations there as well. (both KonText and NoSke) To the discussion should the data be hosted on KonText or Behrang's NoSke, I would argue for both.

Pro's for KonText:

  • KonText is well maintained, new features are added regularly
  • maybe some day we can have possibility to browse trees
  • there can be a link from LINDAT item in a repo to the query system

Con's for KonText:

  • user has to spend some time trying to login. If her/his university may have some issues with Shibboleth, it may be necessary to contact the Uni IT team, so at some point the user may give up:)
  • The feature Ansa mentioned at the paper - overlaps/sub-hits filter works in NoSke, but not in KonText, right? It's not a big con though, since I believe not many users will be so proficient in SkE platform to use it.

Pro's for NoSke:

  • the user will not have to be logged in to access such basic functions as Frequency or Collocations
  • overlaps/sub-hits filter thing

Con's for NoSke:

  • only Behrang maintains it, no new features are added
  • there is no such thing as a corpus list like in KonText, right? So you had to send us links to all the corpora

CON for both (or maybe I'm mistaken, correct me): There is no way how to link to all the corpora at once. For NoSke, you need to change the corpora in the first query form. In KonText the corpora are stored under "Multiple corpora", but as far as I remember when I worked on KonText, I could not link to it, I could link only to "Monolingual" corplist.

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/natalink/mwe_noske/issues/9, or mute the thread https://github.com/notifications/unsubscribe-auth/AHuwEzAjhxwxHmAWyTnDKRqZ7FTb0WhYks5tByzHgaJpZM4RGZF4 .

languagerecipes commented 6 years ago

this is not really relevant to the paper. the message is clear and you have already propose it in the earlier 2016 paper of yours; maybe you can say how this work is different from your 2016 paper instead?!

On Tue, Dec 19, 2017 at 4:25 AM, Natalia notifications@github.com wrote:

Guys, because there is a high probability that the scheme got changed for the PARSEME corpora, I suggest that @Ansa211 https://github.com/ansa211 makes a page devoted to it (smth like https://ufal.mff.cuni.cz/ lindat-kontext/parseme ) and reflect all the changes that may come there. You can put the links to the corpora locations there as well. (both KonText and NoSke) To the discussion should the data be hosted on KonText or Behrang's NoSke, I would argue for both.

Pro's for KonText:

  • KonText is well maintained, new features are added regularly
  • maybe some day we can have possibility to browse trees
  • there can be a link from LINDAT item in a repo to the query system

Con's for KonText:

  • user has to spend some time trying to login. If her/his university may have some issues with Shibboleth, it may be necessary to contact the Uni IT team, so at some point the user may give up:)
  • The feature Ansa mentioned at the paper - overlaps/sub-hits filter works in NoSke, but not in KonText, right? It's not a big con though, since I believe not many users will be so proficient in SkE platform to use it.

Pro's for NoSke:

  • the user will not have to be logged in to access such basic functions as Frequency or Collocations
  • overlaps/sub-hits filter thing

Con's for NoSke:

  • only Behrang maintains it, no new features are added
  • there is no such thing as a corpus list like in KonText, right? So you had to send us links to all the corpora

CON for both (or maybe I'm mistaken, correct me): There is no way how to link to all the corpora at once. For NoSke, you need to change the corpora in the first query form. In KonText the corpora are stored under "Multiple corpora", but as far as I remember when I worked on KonText, I could not link to it, I could link only to "Monolingual" corplist.

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/natalink/mwe_noske/issues/9, or mute the thread https://github.com/notifications/unsubscribe-auth/AHuwEzAjhxwxHmAWyTnDKRqZ7FTb0WhYks5tByzHgaJpZM4RGZF4 .