Closed pnewall closed 3 years ago
It looks as though there's a problem with the apostrophe ("a bachelor degree in *" works ok). I will look into it.
Actually, I can't get the string to work directly in the Google ngram viewer either: https://books.google.com/ngrams/graph?content=a+bachelor%27s+degree+in+%2A Can you please post the url of your successful query?
Actually, I think I know what's happening. The ngram viewer tokenises "bachelor's" to "bachelor 's" (two tokens) which takes the ngram length over 5 but it must check the length before tokenising so the error is confusing. Try "bachelor 's degree in *" (without the "a").
Thanks Sean - looks like you're right
Year Phrase Frequency Corpus Parent 1 1950 bachelor 's degree in a 5.060552e-09 eng_2019 bachelor 's degree in 2 1951 bachelor 's degree in a 4.616514e-09 eng_2019 bachelor 's degree in 3 1952 bachelor 's degree in a 4.642724e-09 eng_2019 bachelor 's degree in 4 1953 bachelor 's degree in a 4.591766e-09 eng_2019 bachelor 's degree in 5 1954 bachelor 's degree in a 5.609661e-09 eng_2019 bachelor 's degree in 6 1955 bachelor 's degree in a 5.323854e-09 eng_2019 bachelor 's degree in
plus output from plot
Excellent!
Hi,
In 1.5.x, the syntax below worked just fine for me
where ip is a string so that ging could be (say) "a bachelor's degree in *".
If I go to Google & the ngram viewer, this string will return results.
But from R, with 1.7.x of the package, I'm getting the warning
The characters +, -, *, / require parentheses to be interpreted as a composition.
Have since tried "(a bachelor's degree in) " "((a bachelor's degree in) )" & others, but without success.