skip to main content
10.1145/3303772.3303794acmotherconferencesArticle/Chapter ViewAbstractPublication PageslakConference Proceedingsconference-collections
research-article

Transfer Learning using Representation Learning in Massive Open Online Courses

Authors Info & Claims
Published:04 March 2019Publication History

ABSTRACT

In a Massive Open Online Course (MOOC), predictive models of student behavior can support multiple aspects of learning, including instructor feedback and timely intervention. Ongoing courses, when the student outcomes are yet unknown, must rely on models trained from the historical data of previously offered courses. It is possible to transfer models, but they often have poor prediction performance. One reason is features that inadequately represent predictive attributes common to both courses. We present an automated transductive transfer learning approach that addresses this issue. It relies on problem-agnostic, temporal organization of the MOOC clickstream data, where, for each student, for multiple courses, a set of specific MOOC event types is expressed for each time unit. It consists of two alternative transfer methods based on representation learning with auto-encoders: a passive approach using transductive principal component analysis and an active approach that uses a correlation alignment loss term. With these methods, we investigate the transferability of dropout prediction across similar and dissimilar MOOCs and compare with known methods. Results show improved model transferability and suggest that the methods are capable of automatically learning a feature representation that expresses common predictive characteristics of MOOCs.

References

  1. Andrew Arnold, Ramesh Nallapati, and William W Cohen. 2007. A Comparative Study of Methods for Transductive Transfer Learning. In ICDM Workshops. ICDM, Pittsburgh, PA, USA, 77--82. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Ryan Shaun Baker and Paul Salvador Inventado. 2014. Educational data mining and learning analytics. In Learning analytics. Springer, New York, NY, USA, 61--75.Google ScholarGoogle Scholar
  3. Shai Ben-David, John Blitzer, Koby Crammer, Alex Kulesza, Fernando Pereira, and Jennifer Wortman Vaughan. 2010. A theory of learning from different domains. Machine learning 79, 1-2 (2010), 151--175. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Sebastien Boyer and Kalyan Veeramachaneni. 2015. Transfer Learning for Predictive Models in Massive Open Online Courses. In Artificial Intelligence in Education, Cristina Conati, Neil Heffernan, Antonija Mitrovic, and M. Felisa Verdejo (Eds.). Springer International Publishing, Cambridge, MA, USA, 54--63.Google ScholarGoogle Scholar
  5. Christopher Brooks, Craig Thompson, and Stephanie Teasley. 2015. A time series interaction analysis method for building predictive models of learners using log data. In Proceedings of the fifth international conference on learning analytics and knowledge. ACM, 126--135. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Concepción Burgos, María L. Campanario, David de la Peña, Juan A. Lara, David Lizcano, and María A. Martínez. 2018. Data mining for modeling students' performance: A tutoring action plan to prevent academic dropout. Computers & Electrical Engineering 66 (2018), 541--556. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Devendra Singh Chaplot, Eunhee Rhim, and Jihie Kim. 2015. Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks.. In AIED Workshops (CEUR Workshop Proceedings), Jesus Boticario and Kasia Muldner (Eds.), Vol. 1432. CEUR-WS.org, Seoul, South Korea.Google ScholarGoogle Scholar
  8. Josh Gardner and Christopher Brooks. 2018. Student success prediction in MOOCs. User Modeling and User-Adapted Interaction 28, 2 (2018), 127--203. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Josh Gardner, Christopher Brooks, Juan Miguel Andres, and Ryan Baker. 2018. Replicating MOOC Predictive Models at Scale. In Proceedings of the Fifth Annual ACM Conference on Learning at Scale (L@S '18). ACM, New York, NY, USA, 1:1--1:10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Jiayuan Huang, Alexander J. Smola, Arthur Gretton, Karsten M. Borgwardt, and Bernhard Scholkopf. 2006. Correcting Sample Selection Bias by Unlabeled Data. In Proceedings of the 19th International Conference on Neural Information Processing Systems (NIPS'06). MIT Press, Cambridge, MA, USA, 601--608. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Xin J. Hunt, Ilknur Kaynar Kabul, and Jorge Silva. 2017. Transfer Learning for Education Data. KDD Workshop (2017).Google ScholarGoogle Scholar
  12. Ian T Jolliffe. 1986. Principal component analysis and factor analysis. In Principal component analysis. Springer, Kent, England, UK, 115--128.Google ScholarGoogle Scholar
  13. Marius Kloft, Felix Stiehler, Zhilin Zheng, and Niels Pinkwart. 2014. Predicting MOOC dropout over weeks using machine learning methods. In Proceedings of the EMNLP 2014 Workshop on Analysis of Large Scale Social Interaction in MOOCs. Association for Computational Linguistics, Doha, Qatar, 60--65.Google ScholarGoogle ScholarCross RefCross Ref
  14. Christopher Vu Le, Zachary A. Pardos, Samuel D. Meyer, and Rachel Thorp. 2018. Communication at Scale in a MOOC Using Predictive Engagement Analytics. In Artificial Intelligence in Education - 19th International Conference, AIED 2018, London, UK, June 27-30, 2018, Proceedings, Part I. IJAIED, Berkeley, CA, USA, 239--252.Google ScholarGoogle Scholar
  15. Grégoire Mesnil, Yann Dauphin, Xavier Glorot, Salah Rifai, Yoshua Bengio, Ian Goodfellow, Erick Lavoie, Xavier Muller, Guillaume Desjardins, David Warde-Farley, Pascal Vincent, Aaron Courville, and James Bergstra. 2012. Unsupervised and Transfer Learning Challenge: a Deep Learning Approach. In Proceedings of ICML Workshop on Unsupervised and Transfer Learning (Proceedings of Machine Learning Research), Isabelle Guyon, Gideon Dror, Vincent Lemaire, Graham Taylor, and Daniel Silver (Eds.), Vol. 27. PMLR, Bellevue, Washington, USA, 97--110. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Sinno Jialin Pan and Qiang Yang. 2010. A survey on transfer learning. IEEE Transactions on knowledge and data engineering 22, 10 (2010), 1345--1359. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Baochen Sun, Jiashi Feng, and Kate Saenko. 2016. Return of frustratingly easy domain adaptation. In AAAI, Vol. 6. Palo Alto, CA, USA, 8 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Baochen Sun and Kate Saenko. 2016. Deep coral: Correlation alignment for deep domain adaptation. In European Conference on Computer Vision. Springer, Springer International Publishing, Cambridge, MA, USA, 443--450.Google ScholarGoogle ScholarCross RefCross Ref
  19. Kalyan Veeramachaneni, Una-May O'Reilly, and Colin Taylor. 2014. Towards Feature Engineering at Scale for Data from Massive Open Online Courses. CoRR abs/1407.5238 (2014).Google ScholarGoogle Scholar

Index Terms

  1. Transfer Learning using Representation Learning in Massive Open Online Courses

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          LAK19: Proceedings of the 9th International Conference on Learning Analytics & Knowledge
          March 2019
          565 pages
          ISBN:9781450362566
          DOI:10.1145/3303772

          Copyright © 2019 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 4 March 2019

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed limited

          Acceptance Rates

          Overall Acceptance Rate236of782submissions,30%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader