paulrougieux / FAOSTATpackage

The full FAOSTAT package including build
1 stars 1 forks source link

R CMD checking data for non-ASCII characters found 179 marked UTF-8 strings #3

Open paulrougieux opened 4 years ago

paulrougieux commented 4 years ago
* checking data for non-ASCII characters ... NOTE
  Note: found 179 marked UTF-8 strings

Here are the character that are causing problems in each data file:

> tools:::showNonASCIIfile("data/FAOcountryProfile.RData")
6: <80>
11: X<80>
13: <80>
17: <80>
106: Ant<c3><a1>rtida
110: Islas Caim<c3><a1>n
116: Kasajst<c3><a1>n
124: Pa<c3><ad>ses Bajos
128: Sud<c3><a1>frica
130: Turkmenist<c3><a1>n
> tools:::showNonASCIIfile("data/FAOmetaTable.RData")
349: <fa>2
350: <fa>3
351: <fa><95>
352: <fa><96>
353: <fa><97>
354: <fa><fa>
355: <fb>^
358: <ba>
359: <bb>
> tools:::showNonASCIIfile("data/FAOregionProfile.RData")
6: <80>
11: X<80>
13: <80>
17: <80>
108: Ant<c3><a1>rtida
112: Islas Caim<c3><a1>n
118: Kasajst<c3><a1>n
126: Pa<c3><ad>ses Bajos
130: Sud<c3><a1>frica
132: Turkmenist<c3><a1>n
paulrougieux commented 4 years ago

Vector wise is easier to understand:

> load("data/FAOregionProfile.RData")
> tools::showNonASCII(FAOregionProfile$OFFICIAL_FAO_NAME)
2: <c3><85>land Islands
66: the Republic of C<c3><b4>te d'Ivoire
69: Cura<c3><a7>ao
205: R<c3><a9>union
209: Saint Barth<c3><a9>lemy
> FAOregionProfile$OFFICIAL_FAO_NAME[c(2,66, 69, 205, 209)]
[1] "Åland Islands"                 "the Republic of Côte d'Ivoire" "Curaçao"                       "Réunion"                       "Saint Barthélemy"