MichaelChirico / r-bugs

A ⚠️read-only⚠️mirror of https://bugs.r-project.org/
20 stars 0 forks source link

[BUGZILLA #14891] Unreadable messages - package.skeleton() - ko_KR.utf8 #4480

Closed MichaelChirico closed 4 years ago

MichaelChirico commented 4 years ago

## Dear R Development Core Team,

## I am reporting the unreadable messages when 'package.skeleton()' is used.

## Demonstration: ## First, I am showing the correct messages on the locale of 'en_US.utf8'.

chel@<::CENSORED -- SEE ORIGINAL ON BUGZILLA::>-laptop∼$ R --slave foo <- function(height, width){ return(height*width) } package.skeleton(name="foo", force=TRUE) Creating directories ... Creating DESCRIPTION ... Creating NAMESPACE ... Creating Read-and-delete-me ... Saving functions and data ... Making help files ... Done. Further steps are described in './foo/Read-and-delete-me'.

## Now, I am changing the locale to "ko_KR.utf8"

Sys.setlocale("LC_MESSAGES", "ko_KR.utf8") [1] "ko_KR.utf8"

package.skeleton(name="foo", force=TRUE) ?????? ??????... DESCRIPTION ? ??? ... NAMESPACE ? ??? ... Read-and-delete-me ? ???... ???? ???? ??? ... ??? ???? ??? ?... ??. ???? ??? './foo/Read-and-delete-me' ?? ?? ? ????

## Here is the session information

sessionInfo() R version 2.15.0 (2012-03-30) Platform: i486-pc-linux-gnu (32-bit)

locale: [1] LC_CTYPE=en_US.utf8 LC_NUMERIC=C
[3] LC_TIME=en_US.utf8 LC_COLLATE=en_US.utf8
[5] LC_MONETARY=en_US.utf8 LC_MESSAGES=ko_KR.utf8
[7] LC_PAPER=C LC_NAME=C
[9] LC_ADDRESS=C LC_TELEPHONE=C
[11] LC_MEASUREMENT=en_US.utf8 LC_IDENTIFICATION=C

attached base packages: [1] stats graphics grDevices utils datasets methods base

## Please feel free to contact me anytime via <gnustats@<::CENSORED -- SEE ORIGINAL ON BUGZILLA::>> or ## <chl948@<::CENSORED -- SEE ORIGINAL ON BUGZILLA::>> if you have any questions.

Chel Hee Lee


METADATA

MichaelChirico commented 4 years ago

## This is the additional report regarding the identical issue with ## package.skeleton() on the Windows 7.

## Probelm Description: ## 1. Since the color of printed messages is 'white', nothing seems to be ## printed in the console ## 2. When the locale is changed to 'English' from 'Korean', it turns out that ## the printed messages are broken. ## ## Tested on: ## Windows 7 Home Edition K ## R-2.15.0 ## ## Please see the following history

sessionInfo()

R version 2.15.0 (2012-03-30) Platform: i386-pc-mingw32/i386 (32-bit)

locale: [1] LC_COLLATE=Korean_Korea.949 LC_CTYPE=Korean_Korea.949
[3] LC_MONETARY=Korean_Korea.949 LC_NUMERIC=C
[5] LC_TIME=Korean_Korea.949

attached base packages: [1] stats graphics grDevices utils datasets methods base

foo <- function(height, width){

+ return(height*width) + }

package.skeleton(name="foo", force=TRUE)

(Nothing to be shown here; however, there are messages)

## Now, I am changing the locale. Then, the messages, which were not shown ## before, are suddenly came out with unreadable texts as the below:

package.skeleton(name="foo", force=TRUE)

ëë í ë¦¬ë¤ì ìì±ì¤ìëë¤... DESCRIPTION ì ìì±ì¤ ... NAMESPACE 를 ìì±ì¤ ... Read-and-delete-me 를 ìì±ì¤... í¨ìë¤ê³¼ ë°ì´í°ë¥¼ ì ì¥ì¤ ... ëìë§ íì¼ë¤ì ë§ëë ì¤... ìë£. ì¶ê°ì ì¸ ì¤ëªì './foo/Read-and-delete-me' ìì ì°¾ì ì ììµëë¤

Sys.setlocale("LC_ALL", "English")

[1] "LC_COLLATE=English_United States.1252;LC_CTYPE=English_United States.1252;LC_MONETARY=English_United States.1252;LC_NUMERIC=C;LC_TIME=English_United States.1252"

## Should I report this issue separately?

## Do you need the screen capture?

## Please feel free to contact me anytime via <gnustats@<::CENSORED -- SEE ORIGINAL ON BUGZILLA::>> or ## <chl948@<::CENSORED -- SEE ORIGINAL ON BUGZILLA::>> if you have any questions.

Chel Hee Lee


METADATA

MichaelChirico commented 4 years ago

On 12-04-22 2:17 PM, r-bugs@<::CENSORED -- SEE ORIGINAL ON BUGZILLA::>-project.org wrote:

https://bugs.r-project.org/bugzilla3/show_bug.cgi?id=14891

Chel Hee Lee<gnustats@<::CENSORED -- SEE ORIGINAL ON BUGZILLA::>>  changed:

What    |Removed                     |Added
----------------------------------------------------------------------------
CC|                            |gnustats@<::CENSORED -- SEE ORIGINAL ON BUGZILLA::>

--- Comment #1 from Chel Hee Lee<gnustats@<::CENSORED -- SEE ORIGINAL ON BUGZILLA::>>  2012-04-22 14:17:09 EDT
---
## This is the additional report regarding the identical issue with
## package.skeleton() on the Windows 7.
##
## Probelm Description:
## 1. Since the color of printed messages is 'white', nothing seems to be
## printed in the console
## 2. When the locale is changed to 'English' from 'Korean', it turns out
that
## the printed messages are broken.
##
## Tested on:
## Windows 7 Home Edition K
## R-2.15.0
##
## Please see the following history

> sessionInfo()
R version 2.15.0 (2012-03-30)
Platform: i386-pc-mingw32/i386 (32-bit)

locale:
[1] LC_COLLATE=Korean_Korea.949  LC_CTYPE=Korean_Korea.949
[3] LC_MONETARY=Korean_Korea.949 LC_NUMERIC=C
[5] LC_TIME=Korean_Korea.949

attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base

> foo<- function(height, width){
+     return(height*width)
+ }
> package.skeleton(name="foo", force=TRUE)

(Nothing to be shown here; however, there are messages)

>

## Now, I am changing the locale. Then, the messages, which were not shown
## before, are suddenly came out with unreadable texts as the below:

> package.skeleton(name="foo", force=TRUE)
디ë 䒠 리들ì„ ìƒÂ성중입니다...
DESCRIPTION ì„ ìƒÂ성중 ...
NAMESPACE 를 ìƒÂ성중 ...
Read-and-delete-me 를 ìƒÂ성중...
함수들과 ë°ì´터를 ì €ìž¥ì¤‘ ...
ë„움맠파ì¼들ì„ 만드는 중...
완료.
추가ì Âì¸ 설명ì€ './foo/Read-and-delete-me'
ì—Â서 ì°¾ì„ 수
있습니다
> Sys.setlocale("LC_ALL", "English")
[1] "LC_COLLATE=English_United States.1252;LC_CTYPE=English_United
States.1252;LC_MONETARY=English_United
States.1252;LC_NUMERIC=C;LC_TIME=English_United States.1252"
>

## Should I report this issue separately?
##
## Do you need the screen capture?
##
## Please feel free to contact me anytime via<gnustats@<::CENSORED -- SEE ORIGINAL ON BUGZILLA::>>  or
##<chl948@<::CENSORED -- SEE ORIGINAL ON BUGZILLA::>>  if you have any questions.

This is sufficient information, and it looks like the same bug. The messages appear to be displayed in the wrong encoding.

I don't think I'll be able to fix this because I can't reproduce it; we need someone who uses a system that has the problem to track it down.

Duncan Murdoch


METADATA

MichaelChirico commented 4 years ago

Yes, I understand. I will do make the better Korean .po files first. Thank you, SIR! Please feel free to contact me any times if you need any helps which are related to Korean.

Chel Hee Lee


METADATA

MichaelChirico commented 4 years ago

I cannot reproduce this in current R-patched.


METADATA

MichaelChirico commented 4 years ago

## Dear professor Brian Ripley,

## The problem still remains. I hope that the following report is useful.

## Procedure: ## 1. Demonstration in English (Unresolved) ## 2. Demonstration in Korean (Unresolved)

## Testing environment: ## Ubuntu 12.04 LTS (released on April 26, 2012).

## Patch information:

gnustats@<::CENSORED -- SEE ORIGINAL ON BUGZILLA::>∼/Desktop/r215/bin$ ./R --version R version 2.15.0 Patched (2012-04-30 r59291) -- "Easter Beagle" Copyright (C) 2012 The R Foundation for Statistical Computing ISBN 3-900051-07-0 Platform: i686-pc-linux-gnu (32-bit)

---------------------------------------------------------------- ## Part 1. in English - Showing the locale of platform ---------------------------------------------------------------- gnustats@<::CENSORED -- SEE ORIGINAL ON BUGZILLA::>∼/Desktop/r215/bin$ locale LANG=en_US.UTF-8 LANGUAGE=ko:en LC_CTYPE="en_US.UTF-8" LC_NUMERIC=en_US.UTF-8 LC_TIME=en_US.UTF-8 LC_COLLATE="en_US.UTF-8" LC_MONETARY=en_US.UTF-8 LC_MESSAGES="en_US.UTF-8" LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8 LC_ALL=

---------------------------------------------------------------- ## Part 1. in English - Showing the locale within R system ---------------------------------------------------------------- gnustats@<::CENSORED -- SEE ORIGINAL ON BUGZILLA::>∼/Desktop/r215/bin$ ./R --slave

sessionInfo() R version 2.15.0 Patched (2012-04-30 r59291) Platform: i686-pc-linux-gnu (32-bit)

locale: [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
[3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8
[5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
[7] LC_PAPER=C LC_NAME=C
[9] LC_ADDRESS=C LC_TELEPHONE=C
[11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C

attached base packages: [1] stats graphics grDevices utils datasets methods base

foo <- function(h,w){ return(h*w) }

setwd(∼/Desktop")

package.skeleton(name="foo", force=TRUE) ?????? ??????... DESCRIPTION ? ??? ... NAMESPACE ? ??? ... Read-and-delete-me ? ???... ???? ???? ??? ... ??? ???? ??? ?... ??. ???? ??? './foo/Read-and-delete-me' ?? ?? ? ????

q()

---------------------------------------------------------------- ## Part 2. in Korean - Showing the locale of platform ---------------------------------------------------------------- gnustats@<::CENSORED -- SEE ORIGINAL ON BUGZILLA::>∼/Desktop/r215/bin$ export LANG=ko_KR.UTF-8 gnustats@<::CENSORED -- SEE ORIGINAL ON BUGZILLA::>∼/Desktop/r215/bin$ locale LANG=ko_KR.UTF-8 LANGUAGE=ko:en LC_CTYPE="ko_KR.UTF-8" LC_NUMERIC=en_US.UTF-8 LC_TIME=en_US.UTF-8 LC_COLLATE="ko_KR.UTF-8" LC_MONETARY=en_US.UTF-8 LC_MESSAGES="ko_KR.UTF-8" LC_PAPER=en_US.UTF-8 LC_NAME=en_US.UTF-8 LC_ADDRESS=en_US.UTF-8 LC_TELEPHONE=en_US.UTF-8 LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=en_US.UTF-8 LC_ALL=

---------------------------------------------------------------- ## Part 2. in Korean - Showing the locale within R system ---------------------------------------------------------------- gnustats@<::CENSORED -- SEE ORIGINAL ON BUGZILLA::>∼/Desktop/r215/bin$ ./R --slave sessionInfo() R version 2.15.0 Patched (2012-04-30 r59291) Platform: i686-pc-linux-gnu (32-bit)

locale: [1] LC_CTYPE=ko_KR.UTF-8 LC_NUMERIC=C
[3] LC_TIME=en_US.UTF-8 LC_COLLATE=ko_KR.UTF-8
[5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=ko_KR.UTF-8
[7] LC_PAPER=C LC_NAME=C
[9] LC_ADDRESS=C LC_TELEPHONE=C
[11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C

attached base packages: [1] stats graphics grDevices utils datasets methods base

setwd(∼/Desktop") foo <- function(h,w){ return(h*w) } package.skeleton(name="foo", force=TRUE) ?????? ??????... DESCRIPTION ? ??? ... NAMESPACE ? ??? ... Read-and-delete-me ? ???... ???? ???? ??? ... ??? ???? ??? ?... ??. ???? ??? './foo/Read-and-delete-me' ?? ?? ? ????

q() ----------------------------------------------------------------

## Comment: ## I will find another computers tomorrow to test


METADATA

MichaelChirico commented 4 years ago

We cannot reproduce this on Linux. We could reproduce it on Windows in a CJK locale.

The issue is that package.skeleton sets LC_CTYPE=C to be able to check for portable file names. Then messaages have to be ouput for that locale. For me Linux uses English messages as it cannot represent the Korean ones: on Windows it seems iconv does not actually report that they cannot be translated and so they are translated and then represented by question marks.

Resolved by never translating these messages.


METADATA

MichaelChirico commented 4 years ago

I very appreciate for your great concerns with my report.


METADATA