ProjetPP / PPP-QuestionParsing-Grammatical

Question Parsing module for the PPP using a grammatical approch
GNU Affero General Public License v3.0
33 stars 11 forks source link

“What is 802.11?” #113

Closed progval closed 9 years ago

progval commented 9 years ago

parsed as (?,definition,802.11)

http://askplatyp.us/?lang=en&q=What+is+802.11%3F

Ezibenroc commented 9 years ago

The problem come from the Stanford parser. This is even worse for “What is P=NP?”. We could try to train it on such cases. An other solution would be to do some preprocessing: if there is a part of the sentence with numbers or symbols like =, then handle it as if it was within quotations.

yhamoudi commented 9 years ago

(Partially) Fixed in https://github.com/ProjetPP/PPP-QuestionParsing-Grammatical/tree/reverse_predicates (but the problem comes from the stanford parser)

Possible solution for such cases: apply the algorithm used with quotations:

If someone wants to implement this, please do it in branch reverse_predicates in file preprocessingMerge.py.

yhamoudi commented 9 years ago

See https://github.com/ProjetPP/PPP-QuestionParsing-Grammatical/issues/118

Ezibenroc commented 9 years ago

+1 for this solution. But this does not concern this branch. You should do one pull request per problem to avoid huge modifications.

yhamoudi commented 9 years ago

But this does not concern this branch. You should do one pull request per

I will remove the advice once the branch will be merged. It's just to avoid that someone implement the solution in master before, otherwise there will be a big conflict.