skip to main content
research-article
Free Access

Thinking deeply to make better speech

Published:21 February 2017Publication History
Skip Abstract Section

Abstract

More work is needed to make synthesized speech more natural, easier to understand, and more pleasant to hear.

References

  1. Van den Oord, A., Dieleman, S., Zen, H., Simonyan, K., Vinyals, O., Graves, A., and Kalchbrenner, N. WaveNet: A Generative Model for Raw Audio, ArXiv, Cornell University Library, 2016 http://arxiv.org/pdf/1609.03499Google ScholarGoogle Scholar
  2. King, S., and Karaiskos, V. The Blizzard Challenge 2016, Blizzard Challenge Workshop, Sept. 2016, Cupertino, CA http://www.festvox.org/blizzard/bc2016/blizzard2016_overview_paper.pdfGoogle ScholarGoogle Scholar
  3. Arnela, M., Dabbaghchian, S., Blandin, R., Guasch, O., Engwall, O., Van Hirtum, A., and Pelorson, X. Influence of vocal tract geometry simplifications on the numerical simulation of vowel sounds, Journal of the Acoustical Society of America, 140, 2016.Google ScholarGoogle Scholar
  4. Deng, L., Li, J., Huang, J-T., Yao, K., Yu, D., Seide, F., Seltzer, M., Zweig, G., He, X., Williams, J., Gong, Y, and Acero, A. Recent advances in deep learning for speech research at Microsoft, IEEE International Conference on Acoustics, Speech and Signal Processing, 2013 http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=6639345Google ScholarGoogle ScholarCross RefCross Ref
  5. Simon King - Using Speech Synthesis to Give Everyone Their Own Voice https://www.youtube.com/watch?v=xzLpxcpo-EGoogle ScholarGoogle Scholar

Index Terms

  1. Thinking deeply to make better speech

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image Communications of the ACM
        Communications of the ACM  Volume 60, Issue 3
        March 2017
        89 pages
        ISSN:0001-0782
        EISSN:1557-7317
        DOI:10.1145/3055102
        • Editor:
        • Moshe Y. Vardi
        Issue’s Table of Contents

        Copyright © 2017 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 21 February 2017

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Popular
        • Pre-selected

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format .

      View HTML Format