sanyaade-speechtools / delphi-museum-project

Automatically exported from code.google.com/p/delphi-museum-project
0 stars 0 forks source link

Looks like the reduce handling for numbers to "#" is not working. #124

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
Current specification of reduce rules in PAHMA MetadataConfig.xml tries to
simplify some numbers to '#', and this appears not to be working. Need to
work out the correct syntax (between XML, Java, and regexp) to make this work.

Original issue reported on code.google.com by LudicrousResearcher@gmail.com on 4 May 2009 at 4:05

GoogleCodeExporter commented 9 years ago
More details on patterns failing:
"$: .05." : Object 25266
"$200." (and similar variants) : Object 81127, 128, 129, and many more
"$.50." : 120566
"$15.00" : 172301 (note other prices are obfuscated). 

May be a function of the source, not the patterns. Need to think about global
obfuscation (reduce) rules.

Original comment by LudicrousResearcher@gmail.com on 4 Jun 2009 at 10:35

GoogleCodeExporter commented 9 years ago
Likely that it is a function of missing rules on a bunch of sources.
Best bet is to have some replace and elide rules specified on the overall
DBMetadataReader.
These are then passed in to each DBSourceInfo, which (shallow) copies them to 
the
local list. 
Change textRules from an arraylist of Pairs to a new class that maintains the 
list of
pairs, and also does the config scan, and the add functions. Make textRules be 
an
instance of such a class. Move the apply there as well - just pass in a string 
and it
applies all the rules and returns the result.

Original comment by LudicrousResearcher@gmail.com on 10 Jun 2009 at 7:37

GoogleCodeExporter commented 9 years ago
Pushed out to next release.

Original comment by LudicrousResearcher@gmail.com on 23 Jun 2009 at 5:36

GoogleCodeExporter commented 9 years ago
Re-prioritizing for next release.

Original comment by LudicrousResearcher@gmail.com on 23 Jun 2009 at 6:02

GoogleCodeExporter commented 9 years ago

Original comment by michaelb...@gmail.com on 3 Jul 2009 at 7:23

GoogleCodeExporter commented 9 years ago

Original comment by michaelb...@gmail.com on 3 Jul 2009 at 7:41