ericleasemorgan / reader

Distant Reader, a tool for using & understanding a corpus
GNU General Public License v2.0
20 stars 7 forks source link

public carrels #177

Closed ericleasemorgan closed 3 years ago

ericleasemorgan commented 3 years ago

Alas, the public interface study carrels do not seem to show any keywords. Is this by design?

dbrower commented 3 years ago

It is by oversight. I didn’t notice public carrels have different metadata than private carrels.

dbrower commented 3 years ago

@ericleasemorgan moved this ticket to where all the other tickets are.

dbrower commented 3 years ago

I added keywords to the harvest, but there is a problem with uniqueness of keywords. The original code I modeled mine after (reader-carrel/bin/table.sh) pulls the top 5 words used out of the carrel database. The issue is that there may be many words used the same (maximal) number of times, in which case 5 of the maximal words are selected. But sometimes it gives different selections of the 5.

For example, the carrel "eb-02" has 53 words that occur the 10 times (the maximum for this carrel). only 5 of these are selected to be the "keywords". and these 5 are selected seemingly at random.

Other than this, I think keywords are done.