trinker / qdapDictionaries

Word lists used by the qdap package.
4 stars 1 forks source link

Spurious Characters in adverb, preposition and weak.words files #3

Open sonofeft opened 8 years ago

sonofeft commented 8 years ago

In poking around, I found some "?" characters and a ">" character embedded in words. Note that I converted the rda files to csv files. (Hopefully that conversion is not the problem.)

Note also that the R command: write.csv(NAMES_LIST, file="NAMES_LIST.csv" ) failed for the NAMES_LIST.rda file, where it succeeded with every other rda file.

Illegal Char "?" A?RIALLY adverb.csv Illegal Char "?" CO?PERATIVELY adverb.csv Illegal Char "?" CO?RDINATELY adverb.csv Illegal Char "?" DOS-?-DOS adverb.csv Illegal Char "?" ENZO?TICALLY adverb.csv Illegal Char "?" EPIZO?TICALLY adverb.csv Illegal Char "?" MODERNIS?TICALLY adverb.csv Illegal Char "?" NA?VELY adverb.csv Illegal Char "?" PAV? adverb.csv Illegal Char "?" R?NTGENOGRAPHICALLY adverb.csv Illegal Char "?" UNNA?VELY adverb.csv Illegal Char "?" MALGR? preposition.csv Illegal Char ">" MYSELF> weak.words.csv

trinker commented 8 years ago

Thank you for the feedback. These were special characters not handles by R that I missed. I think I will remove these observations or remove them or replace them with ascii characters where appropriate.