ThinkUpLLC / ThinkUp

ThinkUp gives you insights into your social networking activity on Twitter, Facebook, Instagram, and beyond.
http://thinkup.com
GNU General Public License v3.0
3.3k stars 676 forks source link

New Insight: New dictionary-word count #2057

Closed adampash closed 10 years ago

adampash commented 10 years ago

One-liner

New dictionary-word counts.

Full explainer

Periodically, the Oxford English Dictionary adds new words to its dictionary. This happens quarterly for online version of the OED. For this general insight, we should keep an eye on the new list when it comes out, cherry pick the most interesting word(s), and define an insight that counts the # of times the user has previously used the word(s) in posts on social media.

Audience for the insight

Works on Twitter, Facebook, and Instagram. Ideally this insight would aggregate them all into one insight.

How often this insight runs

We'll run this insight one time per user for a month each quarter when we choose our new set of words. Say OED releases their update on April 15th, and we push the insight with a new set of words by the 20th, the insight can run for any users who don't have it from April 20th to May 20th.

In that case the logic would be something like: if it's before May 20th and the user does not have an oed-wordlist-april-2014 baseline, then run it. Seems like "just added" in the headline makes sense for a dictionary update that happened 1 to 2 months ago.

if %total_words == 1

E.g.:

if %total_words > 1

E.g.:

In both, the text "just been added"/"just added" should link either to the announcement post from OED's blog or to the definition on OED.

Criteria and logic

The words TU should check for in this first run are:

baller binge-watch clickbait cray dox FML hot mess humblebrag ICYMI mansplain neckbeard side-eye SMH subtweet YOLO

This insight needs to do a word search across all of the user's tweets/posts/comments. (Basically anything the user has written herself on any platform.)

Yes. It should not count text of shares/retweets.

No baseline.

If this is a multiple-word insight (i.e., we've chosen more than one new dictionary word to highlight), the %word_list should only include words that the user has used. So potentially a multi-word version of this could collapse to a single-word version if the user has only used one of the words.

The count must be > 0 for at least 1 word.

Included elements

I know it might be a little overkill to hero image this one, but it could be fun — and I think we need more hero images! Here are a couple of options/ideas:

neologism This is just a screenshot of the "neologism" definition in the OS X Dictionary.

dictionary Generic pic of a dictionary

adampash commented 10 years ago

@anildash Does the OED quarterly update make sense to you as the peg for this? I mean, it's the online dictionary, which people can turn their nose up at in terms of how meaningful the addition of a word is, but also, it's not like anyone uses the dead-tree OED.

ginatrapani commented 10 years ago

Looking good to me so far. Questions:

adampash commented 10 years ago

What's the emphasis? Looks like we listed both medium and high here.

Ah, I forgot this one used the new template so I added the insight by the headline, too. With the sense of how emphasis susses out now, I'd say high.

What exact words should we cherry-pick for this insight the first time it runs?

I think we should pick at most 5. Here's the latest list. My gut says:

Here are the words I think are most prime from the latest additions:

We could either search against all of them and then use at most 5, or we could put 'em to a vote.

Should we use the new headline format (ie, without punctuation, standalone, etc)?

Ah yeah, I updated but didn't kill the punctuation. Beyond that, I like the simple "before "%most_used_word" went legit" — which I know isn't 100% inline with the style guide, but I feel like it's close, and a good exception. But if you don't like it, we can go with something more like:

When would this run the first time?

Depends. Editorially, it would be great to run it next week if we want to use the latest list, but I don't know if that's too tight. Idea is basically that we'd give it a new list of words every quarter when they release the new list.

Should we develop it so it's easy to swap in another array of words on next OED update?

Definitely!

Any special requirements around that? Word list, and.... link to update news? Anything else?

Nope, I think that's it. Then we'd just have to put it on the calendar so we remember to get those words. Normally they do it quarterly, and release mid-month. We could potentially reach out to them, too, and ask them for a heads up beforehand.

ginatrapani commented 10 years ago

Looks great. I love your word list, no vote needed.

My last question is about scheduling logic... if they release this update once a quarter, I think we can get away with running this insight for a month. Say the update comes out on April 15th, and we push the insight by the 20th, I think it could run for any users who don't have it from April 20th to May 20th.

In that case the logic would be something like: if it's before May 20th and the user does not have an oed-wordlist-april-2014 baseline, then run it. Seems like "just added" makes sense for a dictionary update that happened 1 to 2 months ago. Sound reasonable?

adampash commented 10 years ago

I think we can get away with running this insight for a month.

Ha, I think I need to shake some old habits — I'm thinking too much like we're a daily editorial joint. Of course that one month window is fine. That sounds very reasonable.