Documentos de Académico
Documentos de Profesional
Documentos de Cultura
5.0
The proposed method was mainly designed for spectrum adapta- supervised unsupervised
4.5
tion. Table 1 confirms that the performance of unsupervised adapta- 95% confidence interval
tion is comparable to that of supervised adaptation no matter which 4.0
(4) In S1-M and U1-M, without any explicit mapping rules, the 3.0
Mandarin adaptation data was directly associated with PDFs of the 2.5
English average voice models by prior phonetic knowledge and in an 2.0
ML-based data-driven manner, respectively. This could be regarded 1.5
as an automatic, more precise, mapping process. So S1-M and U1-M 4.7 3.1 2.7 2.6 3.0 1.8 2.5 2.5 2.9
1.0
could be slightly better than S1-D and U1-D in terms of spectrum. vocoder intra *1-D *1-T *1-M
(5) Unfortunately, the great prosody distinction between En-
glish and Mandarin meant F0 adaptation was not nearly as effective. Fig. 3. Naturalness score (speaker Z)