skip to main content
10.1145/3290607.3310422acmconferencesArticle/Chapter ViewAbstractPublication PageschiConference Proceedingsconference-collections
research-article

Siri, Echo and Performance: You have to Suffer Darling

Published:02 May 2019Publication History

ABSTRACT

Don't ignore this because its about speech technology. VUIs (voice user interfaces) won a best paper in CHI 2018. Did that get your attention? Good. Siri, Ivona, Google Home, and most speech synthesis systems have voices which are based on imitating a neutral citation style of speech and making it sound natural. But, in the real world, darling, people have to act, to perform! In this paper we will talk about speech synthesis as performance, why the uncanny valley is a bankrupt concept, and how academics can escape from studying corporate speech technology as if it's been bestowed by God.

References

  1. Matthew P Aylett, Per Ola Kristensson, Steve Whittaker, and Yolanda Vazquez-Alvarez. 2014. None of a CHInd: relationship counselling for HCI and speech technology. In CHI'14. ACM, 749--760. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Leigh Clark, Phillip Doyle, Diego Garaialde, Emer Gilmartin, Stephan Schlögl, Jens Edlund, Matthew P. Aylett, João P. Cabral, Cosmin Munteanu, and Benjamin R. Cowan. 2018. The State of Speech in HCI: Trends, Themes and Challenges. CoRR abs/1810.06828 (2018). arXiv:1810.06828 http://arxiv.org/abs/1810.06828Google ScholarGoogle Scholar
  3. Benjamin R Cowan, Nadia Pantidi, David Coyle, Kellie Morrissey, Peter Clarke, Sara Al-Shehri, David Earley, and Natasha Bandeira. 2017. What can I help you with?: infrequent users' experiences of intelligent personal assistants. In Human-Computer Interaction with Mobile Devices and Services. ACM, 43. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Benoit Favre, Kyla Cheung, Siavash Kazemian, Adam Lee, Yang Liu, Cosmin Munteanu, Ani Nenkova, Dennis Ochei, Gerald Penn, Stephen Tratz, et al. 2013. Automatic human utility evaluation of ASR systems: Does WER really predict performance?. In INTERSPEECH. 3463--3467.Google ScholarGoogle Scholar
  5. Erving Goffman. 1959. The Presentation of Self in Everyday Life.Google ScholarGoogle Scholar
  6. Pierre Lison and Casey Kennington. 2016. OpenDial: A toolkit for developing spoken dialogue systems with probabilistic rules. Proceedings of ACL-2016 System Demonstrations (2016), 67--72.Google ScholarGoogle ScholarCross RefCross Ref
  7. Ewa Luger and Abigail Sellen. 2016. Like having a really bad PA: the gulf between user expectation and experience of conversational agents. In CHI '16. ACM, 5286--5297. Siri, Echo and Performance: You have to Suffer Darling CHI'19 Extended Abstracts, May 4--9, 2019, Glasgow, Scotland Uk Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Scott McCloud. 1993. Understanding comics: The invisible art. Northampton, Mass (1993).Google ScholarGoogle Scholar
  9. Joseph Mendelson and Matthew Aylett. 2017. Beyond the Listening Test: An Interactive Approach to TTS Evaluation. In Proceedings of the 18th Annual Conference of the International Speech Communication Association (Interspeech 2017), Stockholm, Sweden. 20--24.Google ScholarGoogle ScholarCross RefCross Ref
  10. Roger K Moore. 2012. A Bayesian explanation of the 'Uncanny Valley' effect and related psychological phenomena. Scientific reports 2 (2012), 864.Google ScholarGoogle Scholar
  11. Roger K Moore. 2017. Is spoken language all-or-nothing? Implications for future speech-based human-machine interaction. In Dialogues with Social Robots. Springer, 281--291.Google ScholarGoogle Scholar
  12. Jussi Palomäki, Anton Kunnari, Marianna Drosinou, Mika Koverola, Noora Lehtonen, Juho Halonen, Marko Repo, and Michael Laakasuo. 2018. Evaluating the replicability of the uncanny valley effect. Heliyon 4, 11 (2018), e00939.Google ScholarGoogle ScholarCross RefCross Ref
  13. Martin Porcheron, Joel E. Fischer, Stuart Reeves, and Sarah Sharples. 2018. Voice Interfaces in Everyday Life. In CHI '18. ACM, New York, NY, USA, Article 640, 12 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Blaise Potard, Matthew P Aylett, David A Baude, and Petr Motlicek. 2016. Idlak Tangle: An Open Source Kaldi Based Parametric Speech Synthesiser Based on DNN.. In INTERSPEECH. 2293--2297.Google ScholarGoogle Scholar
  15. Daniel Povey, Arnab Ghoshal, Gilles Boulianne, Lukás Burget, Ondrej Glembek, Nagendra Goel, Mirko Hannemann, Petr Motlícek, Yanmin Qian, Petr Schwarz, Jan Silovský, Georg Stemmer, and Karel Veselý. 2011. The Kaldi speech recognition toolkit. Proc. IEEE ASRU (2011).Google ScholarGoogle Scholar

Index Terms

  1. Siri, Echo and Performance: You have to Suffer Darling

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      CHI EA '19: Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems
      May 2019
      3673 pages
      ISBN:9781450359719
      DOI:10.1145/3290607

      Copyright © 2019 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 2 May 2019

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate6,164of23,696submissions,26%

      Upcoming Conference

      CHI '24
      CHI Conference on Human Factors in Computing Systems
      May 11 - 16, 2024
      Honolulu , HI , USA

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format