sgsinclair / Voyant

GNU General Public License v3.0
207 stars 53 forks source link

Concept Maps and Semantic Linkage #452

Open ghost opened 4 years ago

ghost commented 4 years ago

REF: https://github.com/sgsinclair/VoyantServer/issues/33

My initial github issue was in the wrong repo. I would like to continue the discussion here. @sgsinclair Where in the code do you recommend I start to explore?

I would like to be able to right click on the link between two word bubbles and perform a semantic lookup. image

Multiple changes required. I will start to list these first. 1.) right mouse click handler 2.) semantic interop function: returns a string, requires two strings as input variables

What do you think?

Thank you, Rob

sgsinclair commented 4 years ago

Thanks Rob. What semantic lookup did you have in mind? WordNet? Adding the right-click and menu is fairly trivial, but I'm not sure what call you have in mind in the back-end or exactly what kind of data you'd want return. What would be an example for "text" and "analysis"?

ghost commented 4 years ago

WordNet is one possibility. We could diverge out the hypernym_paths() of "text" synsets and "analysis" synsets and seek intersection of convergence. I have been exploring NLTK with the similarity measure of wup_similarity, but this metric is not powerful enough to converge accurately. @sgsinclair What are your thoughts? Can you point me towards some journal DOI's with respect to polysemy and ontological approaches to semantic relations?

found 26 of 100
         Synset('criticism.n.03') : a written evaluation of a work of literature | 0.8461538461538463
         Synset('lipogram.n.01') : a text that excludes a particular letter or particular letters of the alphabet | 0.8
         Synset('column.n.08') : a page or text that is vertically divided | 0.8
         Synset('installment.n.03') : a part of a published serial | 0.8
         Synset('back_matter.n.01') : written matter following the main text of a book | 0.7857142857142857
         Synset('writing.n.02') : the work of a writer; anything expressed in letters of the alphabet (especially when considered from the point of view of style and effect) | 0.8333333333333334
         Synset('lyric.n.01') : the text of a popular song or musical-comedy number | 0.8
         Synset('soft_copy.n.01') : (computer science) matter that is in a form that a computer can store or display it on a computer screen | 0.7857142857142857
         Synset('explication_de_texte.n.01') : a method of literary criticism that analyzes details of a text in order to reveal its structure and meaning | 0.7857142857142857
         Synset('cookie.n.03') : a short line of text that a web site puts on your computer's hard drive when you access the web site | 0.8
         Synset('matter.n.06') : written works (especially in books or magazines) | 0.8461538461538463
         Synset('letter.n.01') : a written message addressed to a person or organization | 0.8
         Synset('draft.n.05') : any of the various versions in the development of a written work | 0.8
         Synset('addendum.n.01') : textual matter that is added onto a publication; usually at the end | 0.7857142857142857
         Synset('hard_copy.n.01') : (computer science) matter that is held in a computer and is typed or printed on paper | 0.7857142857142857
         Synset('stanza.n.01') : a fixed number of lines of verse forming a unit of a poem | 0.8
         Synset('line.n.05') : text consisting of a row of words written across a page or computer screen | 0.8
         Synset('typescript.n.01') : typewritten matter especially a typewritten copy of a manuscript | 0.7857142857142857
         Synset('recitation.n.01') : written matter that is recited from memory | 0.7857142857142857
         Synset('textual_criticism.n.01') : comparison of a particular text with related materials in order to establish authenticity | 0.7857142857142857
         Synset('new_criticism.n.01') : literary criticism based on close analysis of the text | 0.7857142857142857
         Synset('review.n.02') : an essay or article that gives a critical evaluation (as of a book or play) | 0.7857142857142857
         Synset('dictation.n.03') : matter that has been dictated and transcribed; a dictated passage | 0.7857142857142857
         Synset('front_matter.n.01') : written matter preceding the main text of a book | 0.7857142857142857
         Synset('copy.n.03') : matter to be printed; exclusive of graphical materials | 0.8
         Synset('electronic_text.n.01') : text that is in a form that computer can store or display on a computer screen | 0.8