k4rthikr / unimrcp

Automatically exported from code.google.com/p/unimrcp
0 stars 0 forks source link

prosody-volume and prosody-rate as numbers&relative changes as well as labels #18

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
Please provide the detailed description of the feature being requested and
outline the use cases.

I am entering information only for prosody-volume but it applies also for
prosody-rate.

-------------------------------------------------------

MRCP v2 IETF draft spec. version 17 (
http://tools.ietf.org/html/draft-ietf-speechsc-mrcpv2-17 ) says that:

8.4.7. Prosody-Parameters

   This set of headers defines the prosody of the speech.

   prosody-parameter   =   "Prosody-" prosody-param-name ":"

                           prosody-param-value CRLF

   prosody-param-name is any one of the attribute names under the
   prosody element specified in W3C's Speech Synthesis Markup Language

And W3C's SSML says that ( http://www.w3.org/TR/speech-synthesis/#S3.2.4 )

volume: the volume for the contained text in the range 0.0 to 100.0 (higher
values are louder and specifying a value of zero is equivalent to
specifying "silent"). Legal values are: number, a relative change or
"silent", "x-soft", "soft", "medium", "loud", "x-loud", or "default". The
volume scale is linear amplitude. The default is 100.0. Labels "silent"
through "x-loud" represent a sequence of monotonically non-decreasing
volume levels.

UniMRCP currently supports those values to be given as labels but not
numbers or relative changes.

--------------------------------------------

What version of the product are you using? On what operating system?

UniMRCP 0.4.0 - Windows XP Professional SP3

Original issue reported on code.google.com by bayrambo...@gmail.com on 6 Mar 2009 at 10:23

GoogleCodeExporter commented 9 years ago

Original comment by achalo...@gmail.com on 6 Mar 2009 at 10:48

GoogleCodeExporter commented 9 years ago
I have made the changes for prosody rate and volume.

w3c ssml says

prosody-rate can be either a relative change or a label. and relative change 
for rate
is a number and number is a simple positive floating point value without
exponentials. but relative change of volume is a number preceded by "+" or "-".

prosody-volume can be either a numeric value, which is a number, a relative 
change,
which is a number preceded by "+" or "-", or a label.

I am pretty sure I did everything right about rate_parse and volume_parse 
functions,
but I am not sure about generate functions.

Please check and feel free to offer any corrections.

See what a "number" means http://www.w3.org/TR/speech-synthesis/#number_values
See what a "relative change" means 
http://www.w3.org/TR/speech-synthesis/#relative_values

Original comment by bayrambo...@gmail.com on 10 Mar 2009 at 7:30

Attachments:

GoogleCodeExporter commented 9 years ago
Thanks. I'll look into this hopefully tonight.

Original comment by achalo...@gmail.com on 10 Mar 2009 at 8:35

GoogleCodeExporter commented 9 years ago
I've just committed your modifications to trunk (r843).

I made only a few modifications over your version (just some minor formatting
changes). Overall it looks exactly the way I expected. I tested both parse and
generation and both just work for all of the cases.
The only concern what the default case should be in if -> else if -> else chain 
in
parser and generator routine. Should we have else (default) case at all? Anyway 
this
is not going to be an issue just some thoughts.

BTW, I mainly tested it with tester application (tests/mrcptest). There are v1 
and v2
test messages. You can easily put any message in v1 or v2 dir and run mrcptest 
to
process (parse -> generate).

Original comment by achalo...@gmail.com on 10 Mar 2009 at 8:24

GoogleCodeExporter commented 9 years ago

Original comment by achalo...@gmail.com on 20 May 2009 at 6:41