Scat singing generation using a versatile speech manipulation system, STRAIGHT. Kawahara, H. & Katayose, H. The Journal of the Acoustical Society of America, 109(5):2425--2426, May, 2001.
Scat singing generation using a versatile speech manipulation system, STRAIGHT [link]Paper  doi  abstract   bibtex   
A set of procedures to generate scat singing by manipulating a small set of seed voices using a speech manipulation system called STRAIGHT [Kawahara et al., Speech Commun. 27, 187–207 (1999)] is proposed. F0 adaptive spectral smoothing based on a second‐order cardinal spline combined with an F0 extractor based on a fixed‐point analysis from filter center frequencies to the output instantaneous frequency enables the STRAIGHT system to generate a highly natural manipulated singing sound. Group delay manipulation for generating the excitation source signal introduces new control flexibility in source characteristics. F0 trajectories are generated using a dynamical model based on F0 feed‐forward control regulated by auditorily mediated feedback [Kawahara et al., Vocal Fold Physiology , edited by H. Fletcher and P. Davis, pp. 263–278 (1996)]. Effects of the spectral interpolation function and interactions between F0 and the spectral envelope are also discussed and demonstrations of a scat chorus are presented. [Work supported by CREST and MEXT, Japan.]
@article{ kawahara_scat_2001,
  title = {Scat singing generation using a versatile speech manipulation system, {STRAIGHT}},
  volume = {109},
  issn = {0001-4966},
  url = {http://scitation.aip.org/content/asa/journal/jasa/109/5/10.1121/1.4744588},
  doi = {10.1121/1.4744588},
  abstract = {A set of procedures to generate scat singing by manipulating a small set of seed voices using a speech manipulation system called {STRAIGHT} [Kawahara et al., Speech Commun. 27, 187–207 (1999)] is proposed. F0 adaptive spectral smoothing based on a second‐order cardinal spline combined with an F0 extractor based on a fixed‐point analysis from filter center frequencies to the output instantaneous frequency enables the {STRAIGHT} system to generate a highly natural manipulated singing sound. Group delay manipulation for generating the excitation source signal introduces new control flexibility in source characteristics. F0 trajectories are generated using a dynamical model based on F0 feed‐forward control regulated by auditorily mediated feedback [Kawahara et al., Vocal Fold Physiology , edited by H. Fletcher and P. Davis, pp. 263–278 (1996)]. Effects of the spectral interpolation function and interactions between F0 and the spectral envelope are also discussed and demonstrations of a scat chorus are presented. [Work supported by {CREST} and {MEXT}, Japan.]},
  number = {5},
  urldate = {2014-08-07TZ},
  journal = {The Journal of the Acoustical Society of America},
  author = {Kawahara, Hideki and Katayose, Haruhiro},
  month = {May},
  year = {2001},
  keywords = {Acoustic analysis, Acoustic spectrum analyzers, Acoustics, Singing, Speech},
  pages = {2425--2426}
}

Downloads: 0