try to generate candidates for comparison

uhh-lt / cam

The Comparative Argument Machine

http://ltdemos.informatik.uni-hamburg.de/cam/

MIT License

10 stars 4 forks source link

Closed alexanderpanchenko closed 4 years ago

alexanderpanchenko commented 6 years ago

Given a word, like 'python' generate the list of candidate, like in Google 'python vs ...' .

Get all sentences containing the target words (python)
Classify them (first word = python, second word = last / first noun in the sentence, text = input sentence). OR Classify them iterating over all nouns in the sentence (first word = python, second word = i-th noun in the sentence, text = input sentence).
Rank the nouns found in the sentences by the total number of comparative sentences (with a high threshold). No normalization is needed - just take the raw sentence counts.

alexanderpanchenko commented 6 years ago

Caching mechanism is important not to re-compute everything from scratch every time.

mschildw commented 6 years ago

I now tried a different approach: