Schwittleymani / ECO

Electronic Chaos Oracle
https://schwittlick.net/eco
Apache License 2.0
6 stars 1 forks source link

Test and Research Question Classification #122

Open schwittlick opened 7 years ago

schwittlick commented 7 years ago

when the input is a question, the reply shouldn't begin with what the input was.

schwittlick commented 7 years ago

http://cogcomp.cs.illinois.edu/Data/QA/QC/

schwittlick commented 7 years ago

some text parser that seems worth checking out. written in java though:

http://nlp.stanford.edu/software/lex-parser.shtml here are python implementations of that (jython/jpype):

https://github.com/vpekar/stanford-parser-in-jython http://projects.csail.mit.edu/spatial/Stanford_Parser

schwittlick commented 7 years ago
if (sentence ends with question mark || begins with 'how'/'why'):
    if begins with 'how':
        answer with seed 'it is'
    if begins with 'why':
        answer with seed 'because'
schwittlick commented 7 years ago

some more question classification/detection happening here https://github.com/mrzl/ECO/issues/160

schwittlick commented 7 years ago

dataset containing 400.000 similar question pairs: 800.000 questions that could be used for training a CNN to detect questions automatically: https://data.quora.com/First-Quora-Dataset-Release-Question-Pairs