Translation won't work if string contains & codes.

pySilver commented 9 years ago

Extraction & futher translation of the string like this one, works fine:

This is somethign &amp; else

But for string like the following one extraction would work, but translation would not:

This is a string with &rarr; an arrow!

Workaround for now would be using special symbols directly in a source code:

This is a string with → an arrow!

So clearly there is a problem with &xxx; symbols.

gabegorelick commented 9 years ago

@pySilver How exactly is translation not working? Are you using the translate directive, filter, or the JS API? If possible, can you post some code?

pySilver commented 9 years ago

@gabegorelick I'm using translate directive

This p would work (both extraction and translation)

<p translate>
  Configure your website by navigating to <br> <strong>Ad Tools → Configure Contextual Ads</strong>.
</p>

this p translation woul be extracted but translation won't be applied even if it exists

<p translate>
  Configure your website by navigating to <br> <strong>Ad Tools &rarr; Configure Contextual Ads</strong>.
</p>

the only difference here is html entity → vs →

rubenv commented 9 years ago

Yup, known problem. I strongly recommend using UTF-8 instead of escaped entities for now. Every computer on the planet supports it nowadays.

gabegorelick commented 9 years ago

@rubenv What's the source of this bug? It seems very strange to me.

rubenv commented 9 years ago

I have no idea. There's an easy way to work around it (which is also much nicer to translators), so we never bothered looking into it. Just use UTF-8 :-).

zeckson commented 9 years ago

I found out that some angular template compilers convert '&', '>', '<', etc. in '&', '&gt', '&lt' ant this converted string will be passed to angular-gettext, which will be failed to match, since strings are collected on original file. In my case there was gulp html2js plugin. So there are 3 possible solutions to workaround this problem:

Fix this symbols in original templates
Do not convert these symbols in your template compiler
Fetch angular-gettext from compiled files (which is not possible by the way)

michi88 commented 9 years ago

I think this is a bug and should work: <h1 translate>me & you</h1>. It get's extracted but not translated.

But... if I add <h1 translate>me & you</h1> and also extract and translate then they both work. :)

simon04 commented 8 years ago

The "use UTF-8 character instead" workaround is unhandy if the character is not visible or not distinguishable from another character. For instance, U+00AD (soft hyphen) has no visual representation, but permits hyphenation in HTML.

I used the following workaround:

<span ng-init="shy = '\u00ad'">
<span translate>foo{{shy}}bar</span>
</span>

Carl-David commented 8 years ago

Special characters like & inside a label cannot be translated in IE/Edge because here, the DOM is used to return the label key, which will return it encoded as &. Is there any workaround for this?

adambullmer commented 6 years ago

Ran across this caveat myself today. The problem, as I noticed it, is that when compiling strings, the text as it was written is passed to the template.pot html entities and all.

The resulting .po file also has those entities as originally authored.

In use, the translate directive uses javascript to get the html of the element it was applied to, and sends it to the compile function for processing. At this time, the html entities have been converted into their plain text counter part. Then the lookup for a translation string fails as the msgid is different in the compiled translations and what is being used at that time.

I prayed to google for possible answers on how, with javascript, to get raw contents of an element without interpreting special characters, and it seems this is a limitation of how javascript is able to access this information.

I think a possible workaround may be to manually call the gettextCatalog.getString() method with your string in code in your controller. But that sounds truly awful.

A possible resolution this package could do is interpret these html entities when compiling strings to prepare the msgid to have the literal character in it, which sounds moderately better, but still not ideal as it limits compatibility of the characters you can use.

rubenv / angular-gettext

Translation won't work if string contains & codes. #139