An Azure service that integrates speech processing into apps and services.
Paralinguistic tags and speaking styles are only honored by voices that explicitly support them. For Chinese (zh-CN), support is voice- and style-specific, and not all Dragon HD / Dragon HD Flash voices implement paralinguistics or the same style set.
From the language support tables:
-
zh-CN-Xiaochen:DragonHDFlashLatestNeurallists supported styles (cheerful,debating,empathetic,live-commercial,poetry-reading,sad,sorry) and explicitly notes Roles: Not supported. - Other zh-CN voices list different style sets, and only some support roles (for example,
zh-CN-XiaomoNeuralandzh-CN-YunyeNeuralsupport roles likeBoy,Girl,YoungAdultFemale, etc.). - The documentation for SSML styles and roles shows that paralinguistics and roles are controlled via
mstts:express-asandroleattributes, not by inserting bracketed tokens like[laughter]in the plain text.
In the provided SSML, the text uses literal tokens like [laughter] instead of SSML elements. For Azure Speech, paralinguistic effects are not triggered by bracketed words; they require supported SSML constructs (for example, mstts:express-as with a supported style value, or role where the voice supports roles). Since zh-CN-Xiaochen:DragonHDLatestNeural is not listed with paralinguistic support in the tables and the text does not use SSML paralinguistic elements, the synthesized audio will be identical regardless of the bracketed markers.
To get audible differences:
- Use a zh-CN voice that lists the desired styles or roles in the language support table (for example,
zh-CN-XiaomoNeural,zh-CN-YunyeNeural, or other zh-CN voices with rich style sets). - Apply styles via
mstts:express-aswith supportedstylevalues (such ascheerful,sad,serious, etc.) and, where available,roleattributes as documented in the SSML customization page. - Do not rely on bracketed tokens like
[laughter]in the text; instead, use the documented SSML mechanisms for styles, paralinguistics, and roles.
If a specific Dragon HD voice does not list paralinguistic or role support in the language support tables, style and paralinguistic tags will not change the output for that voice.
References: