skip to main content
10.1145/3125739.3132592acmconferencesArticle/Chapter ViewAbstractPublication PageshaiConference Proceedingsconference-collections
research-article

A Graphical Digital Personal Assistant that Grounds and Learns Autonomously

Published:27 October 2017Publication History

ABSTRACT

We present a speech-driven digital personal assistant that is robust despite little or no training data and autonomously improves as it interacts with users. The system is able to establish and build common ground between itself and users by signaling understanding and by learning a mapping via interaction between the words that users actually speak and the system actions. We evaluated our system with real users and found an overall positive response. We further show through objective measures that autonomous learning improves performance in a simple itinerary filling task.

References

  1. Gregory Aist, James Allen, Ellen Campana, Lucian Galescu, Carlos Gallo, Scott Stoness, Mary Swift, and Michael Tanenhaus. 2006. Software architectures for incremental understanding of human speech. In Proceedings of CSLP. 1922-1925.Google ScholarGoogle Scholar
  2. Gregory Aist, James Allen, Ellen Campana, Carlos Gomez Gallo, Scott Stoness, and Mary Swift. 2007. Incremental understanding in human-computer dialogue and experimental evidence for advantages over nonincremental methods. In Pragmatics, Vol. 1. Trento, Italy, 149--154.Google ScholarGoogle Scholar
  3. Layla El Asri, Romain Laroche, Olivier Pietquin, and Hatim Khouzaimi. 2014. NASTIA:Negotiating Appointment Setting Interface. In Proceedings of LREC. 266--271.Google ScholarGoogle Scholar
  4. Joyce Y Chai, Lanbo She, Rui Fang, Spencer Ottarson, Cody Littley, Changsong Liu, and Kenneth Hanson. 2014. Collaborative effort towards common ground in situated human-robot dialogue. In Proceedings of the 2014 ACM/IEEE international conference on Human-robot interaction. Bielefeld, Germany, 33--40.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Herbert H. Clark and Edward F. Schaefer. 1989. Contributing to discourse. Cognitive Science 13, 2 (1989), 259--294. Google ScholarGoogle ScholarCross RefCross Ref
  6. Nina Dethlefs, Helen Hastie, Heriberto Cuayáhuitl, Yanchao Yu, Verena Rieser, and Oliver Lemon. 2016. Information density and overlap in spoken dialogue. Computer Speech and Language 37 (2016), 82--97. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Jens Edlund, Joakim Gustafson, Mattias Heldner, and Anna Hjalmarsson. 2008. Towards human-like spoken dialogue systems. Speech Communication 50, 8--9 (2008), 630--645.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Julian Hough and David Schlangen. 2017. A Model of Continuous Intention Grounding for HRI. In Proceedings of The Role of Intentions in Human-Robot Interaction Workshop.Google ScholarGoogle Scholar
  9. Casey Kennington and David Schlangen. 2016. Supporting Spoken Assistant Systems with a Graphical User Interface that Signals Incremental Understanding and Prediction State. In Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue. Association for Computational Linguistics, Los Angeles, 242--251. Google ScholarGoogle ScholarCross RefCross Ref
  10. Casey Kennington and David Schlangen. 2017. A Simple Generative Model of Incremental Reference Resolution in Situated Dialogue. Computer Speech & Language (2017).Google ScholarGoogle Scholar
  11. Geert-Jan M Kruijff. 2012. There is no common ground in human-robot interaction. In Proceedings of SemDial.Google ScholarGoogle Scholar
  12. Pierre Lison. 2015. A hybrid approach to dialogue management based on probabilistic rules. Computer Speech and Language 34, 1 (2015), 232--255. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Chansong Lui, Rui Fang, and Joyce Yue Chai. 2012. Towards Mediating Shared Perceptual Basis in Situated Dialogue. In Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue. Association for Computational Linguistics, Seoul, South Korea, 140--149.Google ScholarGoogle Scholar
  14. Raveesh Meena, Gabriel Skantze, and Joakim Gustafson. 2014. Data-driven models for timing feedback responses in a Map Task dialogue system. In Computer Speech and Language, Vol. 28. Association for Computational Linguistics, Metz, France, 903--922. Google ScholarGoogle ScholarCross RefCross Ref
  15. David Schlangen and Gabriel Skantze. 2011. A General, Abstract Model of Incremental Dialogue Processing. In Dialogue & Discourse, Vol. 2. 83--111. Google ScholarGoogle ScholarCross RefCross Ref
  16. Gabriel Skantze and Anna Hjalmarsson. 1991. Towards Incremental Speech Production in Dialogue Systems. In Word Journal Of The International Linguistic Association. Tokyo, Japan, 1--8.Google ScholarGoogle Scholar
  17. Gabriel Skantze and David Schlangen. 2009. Incremental dialogue processing in a micro-domain. Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics on EACL 09 April (2009), 745--753.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Michael J. Spivey, Michael K. Tanenhaus, Kathleen M. Eberhard, and Julie C. Sedivy. 2002. Eye movements and spoken language comprehension:Effects of visual context on syntactic ambiguity resolution. Cognitive Psychology 45, 4 (2002), 447--481. Google ScholarGoogle ScholarCross RefCross Ref
  19. Michael Tanenhaus, Michael Spivey-Knowlton, Kathleen Eberhard, and Julie Sedivy. 1995. Integration of visual and linguistic information in spoken language comprehension. Science (New York, N.Y.) 268, 5217 (1995), 1632--1634. Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. A Graphical Digital Personal Assistant that Grounds and Learns Autonomously

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      HAI '17: Proceedings of the 5th International Conference on Human Agent Interaction
      October 2017
      550 pages
      ISBN:9781450351133
      DOI:10.1145/3125739

      Copyright © 2017 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 27 October 2017

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate121of404submissions,30%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader