dbs / ris2web

Given a RIS file, generate a web site that displays the citations.
GNU General Public License v3.0
7 stars 2 forks source link

Tags in upper case #13

Open LBST opened 9 years ago

LBST commented 9 years ago

The tags retrieved from Ebsco when downloading records are often are in upper case. Look at the tags for this title: Agnes Macphail and Canadian Working Women. dm

LBST commented 9 years ago

On further reflection, maybe we should not display the "keywords" field since it gives the impression there is a thesaurus of some kind - and that's not the case, at least, not yet. dm

dbs commented 9 years ago

Hmm. I think it's important to display the keywords because we use them in the search index.

If someone searches for "Labor union members" and they land at http://labourstudies.ca/citation/1601, the general thinking on the information retrieval side of things is that they will be confused if they don't see those words anywhere in the record.

Building a thesaurus from this would eventually be rather nice :)

LBST commented 9 years ago

Okay, let's leave it as is.... Is there an automated way to normalize the words in upper case, i.e., capitalize only the first letter of the word; the upper case shouts for the reader's attention even though it is not warranted.

dm

On Mon, Jun 1, 2015 at 12:40 PM, Dan Scott notifications@github.com wrote:

Hmm. I think it's important to display the keywords because we use them in the search index.

If someone searches for "Labor union members" and they land at http://labourstudies.ca/citation/1601, the general thinking on the information retrieval side of things is that they will be confused if they don't see those words anywhere in the record.

Building a thesaurus from this would eventually be rather nice :)

— Reply to this email directly or view it on GitHub https://github.com/dbs/ris2web/issues/13#issuecomment-107630435.

dbs commented 8 years ago

We actually nuked keywords display as part of issue #25 (yay overlapping issues). It's easy to restore the display of the keywords if so desired. It would be a tiny bit harder to only initial-capitalize words using the |capitalize filter, but it gets much harder to do so selectively if, for example, there are words like 'CAWLS' that would be silly to convert to initial caps...

LBST commented 8 years ago

We need to turn our minds to what a thesaurus might look like. A good start would be to sort out the tags for acronyms and proper names - for example CAW (Canadian Autoworkers) which is now Unifor. Could you do an A to Z download of all the tags to see what we have and identify obvious corrections and merges (capital letter issue) for the Index. I am also wondering if it is feasible to create an authority file for the Zotero index tags, so that we have clean data from the get go.

dm

On Wed, Jan 13, 2016 at 2:16 PM, Dan Scott notifications@github.com wrote:

We actually nuked keywords display as part of issue #25 https://github.com/dbs/ris2web/issues/25 (yay overlapping issues). It's easy to restore the display of the keywords if so desired. It would be a tiny bit harder to only initial-capitalize words using the |capitalize filter, but it gets much harder to do so selectively if, for example, there are words like 'CAWLS' that would be silly to convert to initial caps...

— Reply to this email directly or view it on GitHub https://github.com/dbs/ris2web/issues/13#issuecomment-171402722.