skip to main content
research-article

Iterative Training of Dynamic Skills Inspired by Human Coaching Techniques

Published:29 December 2014Publication History
Skip Abstract Section

Abstract

Inspired by how humans learn dynamic motor skills through a progressive process of coaching and practices, we introduce an intuitive and interactive framework for developing dynamic controllers. The user only needs to provide a primitive initial controller and high-level, human-readable instructions as if s/he is coaching a human trainee, while the character has the ability to interpret the abstract instructions, accumulate the knowledge from the coach, and improve its skill iteratively. We introduce “control rigs” as an intermediate layer of control module to facilitate the mapping between high-level instructions and low-level control variables. Control rigs also utilize the human coach's knowledge to reduce the search space for control optimization. In addition, we develop a new sampling-based optimization method, Covariance Matrix Adaptation with Classification (CMA-C), to efficiently compute-control rig parameters. Based on the observation of human ability to “learn from failure”, CMA-C utilizes the failed simulation trials to approximate an infeasible region in the space of control rig parameters, resulting a faster convergence for the CMA optimization. We demonstrate the design process of complex dynamic controllers using our framework, including precision jumps, turnaround jumps, monkey vaults, drop-and-rolls, and wall-backflips.

Skip Supplemental Material Section

Supplemental Material

References

  1. M. A. Borno, M. De Lasa, and A. Hertzmann. 2013. Trajectory optimization for full-body movements with complex contacts. IEEE Trans. Vis. Comput. Graph. 19, 8, 1405--1414. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. S. Coros, P. Beaudoin, and M. Van De Panne. 2010. Generalized biped walking control. ACM Trans. Graph. 29, 130:1--130:9. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. S. Coros, A. Karpathy, B. Jones, L. Reveret, and M. Van De Panne. 2011. Locomotion skills for simulated quadrupeds. ACM Trans. Graph. 30, 4. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. M. Da Silva, F. Durand, and J. Popovic. 2009. Linear bellman combination for control of character animation. ACM Trans. Graph 28, 3. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. P. Faloutsos, M. Van De Panne, and D. Terzopoulos. 2001. Composable controllers for physics-based character animation. In Proceedings of the 28th Annual Conference on Computer Graphics and Interactive Techniques (SIGGRAPH'01). 251--260. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. A. C. Fang and N. S. Pollard. 2003. Efficient synthesis of physically valid human motion. ACM Trans. Graph. 22, 3, 417--426. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. S. Ha, Y. Ye, and C. K. Liu. 2012. Falling and landing motion control for character animation. ACM Trans. Graph 31, 6, 155. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. N. Hansen and S. Kern. 2004. Evaluating the cma evolution strategy on multimodal test functions. In Proceedings of the 8th International Conference on Parallel Problem Solving from Nature (PPSN'04). Lecture Notes in Computer Science, vol. 3242, Springer, 282--291.Google ScholarGoogle ScholarCross RefCross Ref
  9. J. K. Hodgins, W. L. Wooten, D. C. Brogan, and J. F. O'Brien. 1995. Animating human athletics. In Proceedings of the 22nd Annual Conference on Computer Graphics and Interactive Techniques (SIGGRAPH'95). 71--78. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. M. D. Lasa, I. Mordatch, and A. Hertzmann. 2010. Feature-based locomotion controllers. ACM Trans. Graph. 29, 4. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Y. Lee, S. Kim, and J. Lee. 2010. Data-driven biped control. ACM Trans. Graph. 29, 4. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. C. K. Liu, and Z. Popovic. 2002. Synthesis of complex dynamic character motion from simple animations. ACM Trans. Graph. 21, 3. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. L. Liu, K. Yin, M. Van De Panne, and B. Guo. 2012. Terrain runner: Control, parameterization, composition, and planning for highly dynamic motions. ACM Trans. Graph 31, 6. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. L. Liu, K. Yin, M. Van De Panne, T. Shao, and W. Xu. 2010. Sampling-based contact-rich motion control. ACM Trans. Graph 29, 4. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. I. Mordatch, M. De Lasa, and A. Hertzmann. 2010. Robust physics-based locomotion using low-dimensional planning. ACM Trans. Graph. 29, 71:1--71:8. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. U. Muico, J. Popovic, and Z. Popovic. 2011. Composite control of physically simulated characters. ACM Trans. Graph 30, 3. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Z. Popovic, and A. Witkin. 1999. Physically based motion transformation. In Proceedings of the 26th Annual Conference on Computer Graphics and Interactive Techniques (SIGGRAPH'99). 11--20. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Rtql8. 2012. http://bitbucket.org/karenliu/rtql8.Google ScholarGoogle Scholar
  19. A. Safonova, J. K. Hodgins, and N. S. Pollard. 2004. Synthesizing physically realistic human motion in low-dimensional, behavior-specific spaces. ACM Trans. Graph. 23, 3, 514--521. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. K. W. Sok, M. Kim, and J. Lee. 2007. Simulating biped behaviors from human motion data. ACM Trans. Graph 26, 3. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. C. Sunada, D. Argaez, S. Dubowsky, and C. Mavroidis. 1994. A coordinated jacobian transpose control for mobile multi-limbed robotic systems. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA'94). 1910--1915.Google ScholarGoogle Scholar
  22. J. M. Wang, D. J. Fleet, and A. Hertzmann. 2009. Optimizing walking controllers. ACM Trans. Graph 28, 5. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. J. M. Wang, D. J. Fleet, and A. Hertzmann. 2010. Optimizing walking controllers for uncertain inputs and environments. ACM Trans. Graph 29, 4. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. J. M. Wang, S. R. Hammer, S. L. Delp, and V. Koltun. 2012. Optimizing locomotion controllers using biologically-based actuators and objectives. ACM Trans. Graph 31, 4. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. W. L. Wooten. 1998. Simulation of leaping, tumbling, landing, and balancing humans. https://smartech.gatech.edu/bitstream/handle/1853/3466/98-21.pdf. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. J.-C. Wu and Z. Popovic. 2010. Terrain-adaptive bipedal locomotion control. ACM Trans. Graph. 29, 72:1--72:10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. K. Yin, S. Coros, P. Beaudoin, and M. Van De Panne. 2008. Continuation methods for adapting simulated skills. ACM Trans. Graph 27, 3. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. K. Yin, K. Loken, and M. Van De Panne. 2007. Simbicon: Simple biped locomotion control. ACM Trans. Graph. 26, 3. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Iterative Training of Dynamic Skills Inspired by Human Coaching Techniques

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Graphics
      ACM Transactions on Graphics  Volume 34, Issue 1
      November 2014
      153 pages
      ISSN:0730-0301
      EISSN:1557-7368
      DOI:10.1145/2702692
      Issue’s Table of Contents

      Copyright © 2014 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 29 December 2014
      • Accepted: 1 April 2014
      • Revised: 1 February 2014
      • Received: 1 July 2013
      Published in tog Volume 34, Issue 1

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader