PathwayCommons / factoid

A project to capture biological pathway data from academic papers
https://biofactoid.org
MIT License
27 stars 7 forks source link

Text generation #957

Open maxkfranz opened 3 years ago

maxkfranz commented 3 years ago

Generation of text from the model may be interesting for:

Potential technologies:

jvwong commented 3 years ago

In doing some mock ups for #937, I was thinking about an Explorer, document-level, redesign and had the below.

Point is

  1. there's more than 1 place to draw raw information from
  2. the information we use should emphasize our niche (interactions) rather than regurgitating the article, which actually hurts search and discoverability
maxkfranz commented 3 years ago

Re. templating:

A activates B via X in CONTEXT.

A short template summary could be shown in search results:

The interaction text is the most salient thing to include, but it's not that interesting because it's out of context. By including an author's one-line context, the search results could be more compelling.

maxkfranz commented 3 years ago

For later: Most articles contain one or two interactions, so they fit the template well. Nice and short. Outliers contain many interactions. Eventually, it may be nice to be able to include the most salient interaction or two, and that could be in the text summary rather than umpteen interactions.

Alternatively, we could prompt the user to make the context a one-line, all-inclusive summary:

jvwong commented 2 years ago

Related user feedback:

Is “USP1 interacts with FANCD2 via deubiquitination” equivalent to “USP1 deubiquitinates FANCD2”? My first impression was that the former statement would mean USP1 only interacts with deubiquitinated FANCD2.

jvwong commented 9 months ago

See https://github.com/PathwayCommons/factoid/discussions/1195