research-article

Finding Similar Exercises in Online Education Systems

Authors:
Qi Liu

University of Science and Technology of China, Hefei, China

University of Science and Technology of China, Hefei, China
View Profile

,
Zai Huang

University of Science and Technology of China, Hefei, China

University of Science and Technology of China, Hefei, China
View Profile

,
Zhenya Huang

University of Science and Technology of China, Hefei, China

University of Science and Technology of China, Hefei, China
View Profile

,
Chuanren Liu

Drexel University, Philadelphia, USA

Drexel University, Philadelphia, USA
View Profile

,
Enhong Chen

University of Science and Technology of China, Hefei, China

University of Science and Technology of China, Hefei, China
View Profile

,
Yu Su

Anhui University, Hefei, China

Anhui University, Hefei, China
View Profile

,
Guoping Hu

iFLYTEK Research, Hefei, China

iFLYTEK Research, Hefei, China
View Profile

KDD '18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data MiningJuly 2018Pages 1821–1830https://doi.org/10.1145/3219819.3219960

Published:19 July 2018Publication History

KDD '18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Pages 1821–1830

ABSTRACT

In online education systems, finding similar exercises is a fundamental task of many applications, such as exercise retrieval and student modeling. Several approaches have been proposed for this task by simply using the specific textual content (e.g. the same knowledge concepts or the similar words) in exercises. However, the problem of how to systematically exploit the rich semantic information embedded in multiple heterogenous data (e.g. texts and images) to precisely retrieve similar exercises remains pretty much open. To this end, in this paper, we develop a novel Multimodal Attention-based Neural Network (MANN) framework for finding similar exercises in large-scale online education systems by learning a unified semantic representation from the heterogenous data. In MANN, given exercises with texts, images and knowledge concepts, we first apply a convolutional neural network to extract image representations and use an embedding layer for representing concepts. Then, we design an attention-based long short-term memory network to learn a unified semantic representation of each exercise in a multimodal way. Here, two attention strategies are proposed to capture the associations of texts and images, texts and knowledge concepts, respectively. Moreover, with a Similarity Attention, the similar parts in each exercise pair are also measured. Finally, we develop a pairwise training strategy for returning similar exercises. Extensive experimental results on real-world data clearly validate the effectiveness and the interpretation power of MANN.

Supplemental Material

liu_finding_education.mp4

mp4

331.4 MB

Download

References

Mart'ın Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Greg S Corrado, Andy Davis, Jeffrey Dean, Matthieu Devin, et almbox. . 2016. Tensorflow: Large-scale machine learning on heterogeneous distributed systems. arXiv preprint arXiv:1603.04467 (2016).Google ScholarDigital Library
Hicham HAGE Esma AIMEUR . 2005. Exam question recommender system. Artificial Intelligence in Education: Supporting Learning Through Intelligent and Socially Informed Technology Vol. 125 (2005), 249. Google ScholarDigital Library
Yue Cao, Mingsheng Long, Jianmin Wang, Qiang Yang, and Philip S Yu . 2016. Deep visual-semantic hashing for cross-modal retrieval Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1445--1454. Google ScholarDigital Library
Yuying Chen, Qi Liu, Zhenya Huang, Le Wu, Enhong Chen, Runze Wu, Yu Su, and Guoping Hu . 2017. Tracking Knowledge Proficiency of Students with Educational Priors ACM International on Conference on Information and Knowledge Management. ACM, 989--998. Google ScholarDigital Library
Peng Cui, Shifei Jin, Linyun Yu, Fei Wang, Wenwu Zhu, and Shiqiang Yang . 2013. Cascading outbreak prediction in networks: a data-driven approach Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 901--909. Google ScholarDigital Library
Peng Cui, Shaowei Liu, and Wenwu Zhu . 2018. General Knowledge Embedded Image Representation Learning. IEEE Transactions on Multimedia Vol. 20, 1 (2018), 198--207. Google ScholarDigital Library
Teresa del Solato and Benedict Du Boulay . 1995. Implementation of motivational tactics in tutoring systems. Journal of Interactive Learning Research Vol. 6, 4 (1995), 337. Google ScholarDigital Library
Alex Graves . 2013. Generating sequences with recurrent neural networks. arXiv preprint arXiv:1308.0850 (2013).Google Scholar
Hicham Hage and E Aimeru . 2006. ICE: A system for identification of conflicts in exams Computer Systems and Applications, 2006. IEEE International Conference on. IEEE, 980--987. Google ScholarDigital Library
Robert Hecht-Nielsen . 1989. Theory of the backpropagation neural network. In Neural Networks, 1989. IJCNN., International Joint Conference on. IEEE, 593--605.Google ScholarCross Ref
Sepp Hochreiter and Jürgen Schmidhuber . 1997. Long short-term memory. Neural computation Vol. 9, 8 (1997), 1735--1780. Google ScholarDigital Library
Andrew G Howard . 2013. Some improvements on deep convolutional neural network based image classification. arXiv preprint arXiv:1312.5402 (2013).Google Scholar
Zhenya Huang, Qi Liu, Enhong Chen, Hongke Zhao, Mingyong Gao, Si Wei, Yu Su, and Guoping Hu . 2017. Question Difficulty Prediction for READING Problems in Standard Tests. Thirty-First AAAI Conference on Artificial Intelligence. 1352--1359.Google Scholar
Andrej Karpathy, Armand Joulin, and Fei Fei F Li . 2014. Deep fragment embeddings for bidirectional image sentence mapping Advances in neural information processing systems. 1889--1897. Google ScholarDigital Library
Diederik Kingma and Jimmy Ba . 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).Google Scholar
Vlasta Kokol-Voljc . 2000. Exam Questions When Using CAS for School Mathematics Teaching. Algebra Vol. 7 (2000), 13.Google Scholar
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton . 2012. Imagenet classification with deep convolutional neural networks Advances in neural information processing systems. 1097--1105. Google ScholarDigital Library
Johan Lithner . 2004. Mathematical reasoning in calculus textbook exercises. The Journal of Mathematical Behavior Vol. 23, 4 (2004), 405--427.Google ScholarCross Ref
Qi Liu, Runze Wu, Enhong Chen, Guandong Xu, Yu Su, Zhigang Chen, and Guoping Hu . 2018. Fuzzy cognitive diagnosis for modelling examinee performance. ACM Transactions on Intelligent Systems and Technology (TIST) Vol. 9, 4 (2018), 48. Google ScholarDigital Library
Yuping Liu, Qi Liu, Runze Wu, Enhong Chen, Yu Su, Zhigang Chen, and Guoping Hu . 2016. Collaborative learning team formation: a cognitive modeling perspective International Conference on Database Systems for Advanced Applications. Springer, 383--400. Google ScholarDigital Library
Lin Ma, Zhengdong Lu, and Hang Li . 2016. Learning to Answer Questions from Image Using Convolutional Neural Network. Thirtieth AAAI Conference on Artificial Intelligence. 3567--3573. Google ScholarDigital Library
Iaroslav Melekhov, Juho Kannala, and Esa Rahtu . 2016. Siamese network features for image matching. In International Conference on Pattern Recognition. IEEE, 378--383.Google ScholarCross Ref
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean . 2013. Distributed representations of words and phrases and their compositionality Advances in neural information processing systems. 3111--3119. Google ScholarDigital Library
Youssef Mroueh, Etienne Marcheret, and Vaibhava Goel . 2015. Deep multimodal learning for audio-visual speech recognition IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 2130--2134.Google Scholar
Jonas Mueller and Aditya Thyagarajan . 2016. Siamese Recurrent Architectures for Learning Sentence Similarity. Thirtieth AAAI Conference on Artificial Intelligence. 2786--2792. Google ScholarDigital Library
Zachary A Pardos and Anant Dadu . 2017. Imputing KCs with representations of problem content and context Proceedings of the 25th Conference on User Modeling, Adaptation and Personalization. ACM, 148--155. Google ScholarDigital Library
Cesc C Park and Gunhee Kim . 2015. Expressing an image stream with a sequence of natural sentences Advances in neural information processing systems. 73--81. Google ScholarDigital Library
Razvan Pascanu, Tomas Mikolov, and Yoshua Bengio . 2013. On the difficulty of training recurrent neural networks International Conference on Machine Learning. 1310--1318. Google ScholarDigital Library
Jir'ı Rihák and Radek Pelánek . 2017. Measuring Similarity of Educational Items Using Data on Learners' Performance Proceedings of the 10th International Conference on Educational Data Mining. 16--23.Google Scholar
Shuo Shang, Ruogu Ding, Bo Yuan, Kexin Xie, Kai Zheng, and Panos Kalnis . 2012. User oriented trajectory search for trip recommendation Proceedings of the 15th International Conference on Extending Database Technology. ACM, 156--167. Google ScholarDigital Library
Shuo Shang, Jiajun Liu, Kun Zhao, Mingrui Yang, Kai Zheng, and Jirong Wen . 2015. Dimension reduction with meta object-groups for efficient image retrieval. Neurocomputing Vol. 169 (2015), 50--54.Google ScholarCross Ref
Mohammad E Shiri, A Esma A"ımeur, and Claude Frasson . 1998. Student modelling by case based Reasoning. In International Conference on Intelligent Tutoring Systems. Springer, 394--403. Google ScholarDigital Library
Nitish Srivastava, Geoffrey E Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov . 2014. Dropout: a simple way to prevent neural networks from overfitting. Journal of machine learning research Vol. 15, 1 (2014), 1929--1958. Google ScholarDigital Library
Armin Stahl . 2006. Combining Case-Based and Similarity-Based Product Recommendation European Conference on Advances in Case-Based Reasoning. 355--369. Google ScholarDigital Library
Yu Su, Qingwen Liu, Qi Liu, Zhenya Huang, Yu Yin, Enhong Chen, Chris Ding, Si Wei, and Guoping Hu . 2018. Exercise-Enhanced Sequential Modeling for Student Performance Prediction Thirty-Second AAAI Conference on Artificial Intelligence. 2435--2443.Google Scholar
Avgoustos Tsinakos and Ioannis Kazanidis . 2012. Identification of conflicting questions in the PARES system. The International Review of Research in Open and Distributed Learning Vol. 13, 3 (2012), 297--313.Google ScholarCross Ref
Adrienne E Williams, Nancy M Aguilar-Roca, Michelle Tsai, Matthew Wong, Marin Moravec Beaupré, and Diane K O'Dowd . 2011. Assessment of learning gains associated with independent exam analysis in introductory biology. CBE-Life Sciences Education Vol. 10, 4 (2011), 346--356.Google ScholarCross Ref
Le Wu, Yong Ge, Qi Liu, Enhong Chen, Richang Hong, Junping Du, and Meng Wang . 2017 a. Modeling the evolution of users' preferences and social links in social networking services. IEEE Transactions on Knowledge and Data Engineering Vol. 29, 6 (2017), 1240--1253. Google ScholarDigital Library
Le Wu, Yong Ge, Qi Liu, Enhong Chen, Bai Long, and Zhenya Huang . 2016. Modeling users' preferences and social links in Social Networking Services: a joint-evolving perspective. In Thirtieth AAAI Conference on Artificial Intelligence. 279--286. Google ScholarDigital Library
Runze Wu, Qi Liu, Yuping Liu, Enhong Chen, Yu Su, Zhigang Chen, and Guoping Hu . 2015. Cognitive Modelling for Predicting Examinee Performance. International Joint Conferences on Artificial Intelligence. 1017--1024. Google ScholarDigital Library
Runze Wu, Guandong Xu, Enhong Chen, Qi Liu, and Wan Ng . 2017 b. Knowledge or Gaming?: Cognitive Modelling Based on Multiple-Attempt Response Proceedings of the 26th International Conference on World Wide Web Companion. International World Wide Web Conferences Steering Committee, 321--329. Google ScholarDigital Library
Ran Xu, Caiming Xiong, Wei Chen, and Jason J Corso . 2015. Jointly Modeling Deep Video and Compositional Text to Bridge Vision and Language in a Unified Framework.. In Twenty-Ninth AAAI Conference on Artificial Intelligence. 2346--2352. Google ScholarDigital Library
Wenpeng Yin, Hinrich Schütze, Bing Xiang, and Bowen Zhou . 2016. ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs. Transactions of the Association for Computational Linguistics Vol. 4 (2016), 259--272.Google ScholarCross Ref
Jing Yu, Dongmei Li, Jiajia Hou, Ying Liu, and Zhaoying Yang . 2014. Similarity Measure of Test Questions Based on Ontology and VSM. Open Automation and Control Systems Journal Vol. 6 (2014), 262--267.Google ScholarCross Ref
Matthew D Zeiler, Dilip Krishnan, Graham W Taylor, and Rob Fergus . 2010. Deconvolutional networks. In Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on. IEEE, 2528--2535.Google ScholarCross Ref
Jiani Zhang, Xingjian Shi, Irwin King, and Dit-Yan Yeung . 2017. Dynamic Key-Value Memory Networks for Knowledge Tracing Proceedings of the 26th International Conference on World Wide Web. 765--774. Google ScholarDigital Library
Hengshu Zhu, Hui Xiong, Yong Ge, and Enhong Chen . 2014. Mobile app recommendations with security and privacy awareness Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 951--960. Google ScholarDigital Library
Tianyu Zhu, Qi Liu, Zhenya Huang, Enhong Chen, Defu Lian, Yu Su, and Guoping Hu . 2018. MT-MCD: A Multi-task Cognitive Diagnosis Framework for Student Assessment International Conference on Database Systems for Advanced Applications. Springer, 318--335.Google Scholar

Index Terms

Finding Similar Exercises in Online Education Systems
1. Applied computing
  1. Education
    1. E-learning
2. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
      1. Neural networks

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
KDD '18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
July 2018
2925 pages
ISBN:9781450355520
DOI:10.1145/3219819
General Chairs:
Yike Guo
Imperial College London
,
Faisal Farooq
IBM
Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 19 July 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
heterogenous data
online education systems
similar exercises
Qualifiers
- research-article
Conference

Acceptance Rates
KDD '18 Paper Acceptance Rate107of983submissions,11%Overall Acceptance Rate1,133of8,635submissions,13%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 47
  Total Citations
  View Citations
- 1,360
  Total Downloads
- Downloads (Last 12 months)52
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Finding Similar Exercises in Online Education Systems

KDD '18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

AutoCAD Exercises

CAD Exercises

150 CAD Exercises

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Finding Similar Exercises in Online Education Systems

KDD '18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

AutoCAD Exercises

CAD Exercises

150 CAD Exercises

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media