Bibliography

Next: About this document ... Up: Unit Selection and Emotional Previous: Acknowledgements

Bibliography

1: M. Beutnagel, A. Conkie, J. Schroeter, Y. Stylianou, and A. Syrdal,
``The AT&T Next-Gen TTS system,''
in Joint Meeting of ASA, EAA, and DAGA, Berlin, Germany, 1999, pp. 18-24.
2: A. Hunt and A. Black,
``Unit selection in a concatenative speech synthesis system using a large speech database,''
in ICASSP-96, Atlanta, Georgia, 1996, vol. 1, pp. 373-376.
3: E. Klabbers and R. Veldhuis,
``On the reduction of concatenation artefacts in diphone synthesis,''
in ICSLP98, Sydney, Australia., 1998, pp. 1983-1986.
4: A. Black,
``Perfect synthesis for all of the people all of the time,''
in IEEE 2002 Workshop on Speech Synthesis, Santa Monica, CA., 2002.
5: A. Black and K. Lenzo,
``Limited domain synthesis,''
in ICSLP2000, Beijing, China., 2000, vol. II, pp. 411-414.
6: A. Black and K. Lenzo,
``Optimal data selection for unit selection synthesis,''
in 4th ESCA Workshop on Speech Synthesis, Scotland., 2001.
7: A. Black, P. Taylor, and R. Caley,
``The Festival speech synthesis system,''
http://festvox.org/festival, 1998.
8: R. Sproat, A. Hunt, M. Ostendorf, P. Taylor, A. Black, K. Lenzo, and M. Edgington,
``SABLE: A standard for TTS markup,''
in International Conference on Spoken Language Processing, Sydney, Australia, 1998.
9: M. Hart,
``Project Gutenberg,''
http://promo.net/pg/, 2000.
10: A. Black and P. Taylor,
``Automatically clustering similar units for unit selection in speech synthesis,''
in Eurospeech97, Rhodes, Greece, 1997, vol. 2, pp. 601-604.
11: M. Ostendorf, P. Price, and S. Shattuck-Hufnagel,
``The Boston University Radio News Corpus,''
Tech. Rep. ECS-95-001, Electrical, Computer and Systems Engineering Department, Boston University, Boston, MA, 1995.
12: A. Black, R. Brown, R. Frederking, R. Singh, J. Moody, and E. Steinbrecher,
``Tongues: Rapid development of a speech-to-speech translation system,''
in HLT2002, San Diego, California, 2002, pp. 2051-2054.
13: K. Lenzo and A. Black,
``Customized synthesis: blending and tiering,''
in AVIOS2002, San Jose, CA., 2002.
14: H. Kawai and M. Tsuzaki,
``A study of time-dependent voice quality variation in a large-scale single speaker speech corpus used for speech synthesis.,''
in IEEE 2002 Workshop on Speech Synthesis, Santa Monica, CA., 2002.
15: N. Campbell,
``Towards a grammar of spoken language: Incorporating paralinguistic information,''
in ICSLP2002, Denver, CO., 2002.
16: S. Pan,
Learning Intonation Rules for Concept-to-Speech Generation,
Ph.D. thesis, Columbia University, 1998.
17: K. Silverman, M. Beckman, J. Pitrelli, M. Ostendorf, C. Wightman, P. Price, J. Pierrehumbert, and J. Hirschberg,
``ToBI: a standard for labelling English prosody.,''
in Proceedings of ICSLP92, 1992, vol. 2, pp. 867-870.
18: F. Malfrere, T. Dutoit, and P. Mertens,
``Automatic prosody generation using suprasegmental unit selection.,''
in Proc. ESCA Workshop on Speech Synthesis, Australia., 1998, pp. 323-327.

Alan W Black 2003-09-07