skip to main content
10.5555/1558109.1558138guideproceedingsArticle/Chapter ViewAbstractPublication PagesaamasConference Proceedingsconference-collections
research-article
Free Access

Improved approximation of interactive dynamic influence diagrams using discriminative model updates

Published:10 May 2009Publication History

ABSTRACT

Interactive dynamic influence diagrams (I-DIDs) are graphical models for sequential decision making in uncertain settings shared by other agents. Algorithms for solving I-DIDs face the challenge of an exponentially growing space of candidate models ascribed to other agents, over time. We formalize the concept of a minimal model set, which facilitates qualitative comparisons between different approximation techniques. We then present a new approximation technique that minimizes the space of candidate models by discriminating between model updates. We empirically demonstrate that our approach improves significantly in performance on the previous clustering based approximation technique.

References

  1. P. Doshi, Y. Zeng, and Q. Chen. Graphical models for interactive pomdps: representations and solutions. JAAMAS, DOI:10.1007/s10458-008-9064-7, 2008. Google ScholarGoogle ScholarCross RefCross Ref
  2. Y. Gal and A. Pfeffer. A language for modeling agent's decision-making processes in games. In AAMAS, pages 265--272, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. P. Gmytrasiewicz and P. Doshi. A framework for sequential planning in multiagent settings. JAIR, 24:49--79, 2005. Google ScholarGoogle ScholarCross RefCross Ref
  4. D. Koller and B. Milch. Multi-agent IDs for representing and solving games. In IJCAI, pages 1027--1034, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. J. Pineau, G. Gordon, and S. Thrun. Anytime point-based value iteration for large pomdps. JAIR, 27:335--380, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. D. Pynadath and S. Marsella. Minimal mental models. In AAAI, pages 1038--1044, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. B. Rathnas., P. Doshi, and P. J. Gmytrasiewicz. Exact solutions to interactive pomdps using behavioral equivalence. In AAMAS, pages 1025--1032, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S. Seuken and S. Zilberstein. Improved memory bounded dynamic programming for decentralized pomdps. In UAI, pages 2009--2015, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. R. Smallwood and E. Sondik. The optimal control of partially observable markov decision processes over a finite horizon. OR, 21:1071--1088, 1973.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. D. Suryadi and P. Gmytrasiewicz. Learning models of other agents using IDs. In UM, pages 223--232, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. J. A. Tatman and R. D. Shachter. Dynamic programming and influence diagrams. IEEE Trans. SMC, 20(2):365--379, 1990.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Improved approximation of interactive dynamic influence diagrams using discriminative model updates

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image Guide Proceedings
      AAMAS '09: Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
      May 2009
      730 pages
      ISBN:9780981738178

      Publisher

      International Foundation for Autonomous Agents and Multiagent Systems

      Richland, SC

      Publication History

      • Published: 10 May 2009

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,155of5,036submissions,23%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader