lvapeab / m4loc

Automatically exported from code.google.com/p/m4loc
GNU Lesser General Public License v3.0
0 stars 0 forks source link

Retain semantically relevant stand-alone tags in tag removal/reinsertion process #45

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
Unlike paired formatting tags, stand-alone tags typically serve two functions:
1. As placeholders for a named entity (as such having semantical significance)
2. As a isolated formatting tag spanning two or more segments (less common)

For the case 1. the stand-alone tag should be funneled through the decoder 
rather than removed and reinserted in the tag removal/reinsertion case.

Example:
Firefox is a good browser.
<x id="1"/> is a good browser.
is a good browser .

Original issue reported on code.google.com by Achi...@gmail.com on 5 Sep 2013 at 8:35

GoogleCodeExporter commented 9 years ago
Unfortunately at the point where M4Loc receives the stand-alone tag there is no 
information or meta-information whether the original tag serves function 1. or 
2. 

Alternatives for file formats that contain a lot of isolated tags for function 
1. (for example .rc files) are to use the tag-fixed option (-o t) or the 
placeholder functionality available in Moses v2.1 or higher 
(http://www.statmt.org/moses/?n=Moses.AdvancedFeatures#ntoc65)

Original comment by Achi...@gmail.com on 8 Sep 2014 at 8:16