interscript / interscript-ruby

Interoperable script conversion systems (ISCS) with the `interscript` gem
Other
11 stars 30 forks source link

Implement GOST 16876-71:1983 Russian (`gost-rus-cyrl-latn-16876-71-1983`) #294

Closed ronaldtse closed 4 years ago

ronaldtse commented 4 years ago

I was able to find GOST 1983 here: 7th_UNCSGN_econf.91_3_Add1.html.zip

We should Implement GOST 16876-71:1983 Russian (gost-rus-cyrl-latn-16876-71-1983)

Screen Shot 2020-05-28 at 9 55 37 AM Screen Shot 2020-05-28 at 9 55 48 AM Screen Shot 2020-05-28 at 9 59 02 AM Screen Shot 2020-05-28 at 9 57 52 AM

Originally posted by @ronaldtse in https://github.com/interscript/interscript/issues/76#issuecomment-635052191

CAMOBAP commented 4 years ago

@ronaldtse I have found original GOST document here http://vsegost.com/Catalog/45/45002.shtml

I have noted that for some letters it allow to select different results: for example: Щ -> Šč or

If I will compare attached screenshots with http://vsegost.com/Catalog/45/45002.shtml it looks inconsistent, because something's it select the first variant (without brackets) or second (within brackets)

@ronaldtse what is your opinion should we follow https://unstats.un.org/unsd/geoinfo/UNGEGN/docs/7th-uncsgn-docs/econf/7th_UNCSGN_econf.91_3_Add1.pdf or http://vsegost.com/Catalog/45/45002.shtml ?

If you ask me if we need real GOST 16876-71 we should check http://vsegost.com/Catalog/45/45002.shtml

P.S. How do we really know that GOST 1983 and GOST 16876-71 refer to the same document?

ronaldtse commented 4 years ago

P.S. How do we really know that GOST 1983 and GOST 16876-71 refer to the same document?

It was stated in one of the UNGEGN documents.

I think GOST 16876-71 provides two systems, the first has no choice, the second allows choice of the rules.

Have you found any discrepancy between https://unstats.un.org/unsd/geoinfo/UNGEGN/docs/7th-uncsgn-docs/econf/7th_UNCSGN_econf.91_3_Add1.pdf and http://vsegost.com/Catalog/45/45002.shtml ?

CAMOBAP commented 4 years ago

Have you found any discrepancy between

Formally no

But to give you a more complete picture about GOST 16876-71:

Let me confirm again that formally this isn't an issue

CAMOBAP commented 4 years ago

@ronaldtse I have found relation GOST 1983 and GOST 16876-71

16 April 1982 - the year when this GOST was officially introduced 13 May 1983 - contains an updated table and according to it it 100% equal to the one which attached it top comment

Official document https://rosreestr.ru/upload/documenty/doc_169.doc

13 May 1983 edition eliminate pluralism of translation, so staring from it there is exists only one way to 'convert' characters

CAMOBAP commented 4 years ago

PR is ready for review https://github.com/interscript/interscript/pull/296

CAMOBAP commented 4 years ago

Done

ronaldtse commented 4 years ago

Thanks @CAMOBAP !