ABSTRACT
In online education systems, finding similar exercises is a fundamental task of many applications, such as exercise retrieval and student modeling. Several approaches have been proposed for this task by simply using the specific textual content (e.g. the same knowledge concepts or the similar words) in exercises. However, the problem of how to systematically exploit the rich semantic information embedded in multiple heterogenous data (e.g. texts and images) to precisely retrieve similar exercises remains pretty much open. To this end, in this paper, we develop a novel Multimodal Attention-based Neural Network (MANN) framework for finding similar exercises in large-scale online education systems by learning a unified semantic representation from the heterogenous data. In MANN, given exercises with texts, images and knowledge concepts, we first apply a convolutional neural network to extract image representations and use an embedding layer for representing concepts. Then, we design an attention-based long short-term memory network to learn a unified semantic representation of each exercise in a multimodal way. Here, two attention strategies are proposed to capture the associations of texts and images, texts and knowledge concepts, respectively. Moreover, with a Similarity Attention, the similar parts in each exercise pair are also measured. Finally, we develop a pairwise training strategy for returning similar exercises. Extensive experimental results on real-world data clearly validate the effectiveness and the interpretation power of MANN.
Supplemental Material
- Mart'ın Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Greg S Corrado, Andy Davis, Jeffrey Dean, Matthieu Devin, et almbox. . 2016. Tensorflow: Large-scale machine learning on heterogeneous distributed systems. arXiv preprint arXiv:1603.04467 (2016).Google ScholarDigital Library
- Hicham HAGE Esma AIMEUR . 2005. Exam question recommender system. Artificial Intelligence in Education: Supporting Learning Through Intelligent and Socially Informed Technology Vol. 125 (2005), 249. Google ScholarDigital Library
- Yue Cao, Mingsheng Long, Jianmin Wang, Qiang Yang, and Philip S Yu . 2016. Deep visual-semantic hashing for cross-modal retrieval Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1445--1454. Google ScholarDigital Library
- Yuying Chen, Qi Liu, Zhenya Huang, Le Wu, Enhong Chen, Runze Wu, Yu Su, and Guoping Hu . 2017. Tracking Knowledge Proficiency of Students with Educational Priors ACM International on Conference on Information and Knowledge Management. ACM, 989--998. Google ScholarDigital Library
- Peng Cui, Shifei Jin, Linyun Yu, Fei Wang, Wenwu Zhu, and Shiqiang Yang . 2013. Cascading outbreak prediction in networks: a data-driven approach Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 901--909. Google ScholarDigital Library
- Peng Cui, Shaowei Liu, and Wenwu Zhu . 2018. General Knowledge Embedded Image Representation Learning. IEEE Transactions on Multimedia Vol. 20, 1 (2018), 198--207. Google ScholarDigital Library
- Teresa del Solato and Benedict Du Boulay . 1995. Implementation of motivational tactics in tutoring systems. Journal of Interactive Learning Research Vol. 6, 4 (1995), 337. Google ScholarDigital Library
- Alex Graves . 2013. Generating sequences with recurrent neural networks. arXiv preprint arXiv:1308.0850 (2013).Google Scholar
- Hicham Hage and E Aimeru . 2006. ICE: A system for identification of conflicts in exams Computer Systems and Applications, 2006. IEEE International Conference on. IEEE, 980--987. Google ScholarDigital Library
- Robert Hecht-Nielsen . 1989. Theory of the backpropagation neural network. In Neural Networks, 1989. IJCNN., International Joint Conference on. IEEE, 593--605.Google ScholarCross Ref
- Sepp Hochreiter and Jürgen Schmidhuber . 1997. Long short-term memory. Neural computation Vol. 9, 8 (1997), 1735--1780. Google ScholarDigital Library
- Andrew G Howard . 2013. Some improvements on deep convolutional neural network based image classification. arXiv preprint arXiv:1312.5402 (2013).Google Scholar
- Zhenya Huang, Qi Liu, Enhong Chen, Hongke Zhao, Mingyong Gao, Si Wei, Yu Su, and Guoping Hu . 2017. Question Difficulty Prediction for READING Problems in Standard Tests. Thirty-First AAAI Conference on Artificial Intelligence. 1352--1359.Google Scholar
- Andrej Karpathy, Armand Joulin, and Fei Fei F Li . 2014. Deep fragment embeddings for bidirectional image sentence mapping Advances in neural information processing systems. 1889--1897. Google ScholarDigital Library
- Diederik Kingma and Jimmy Ba . 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).Google Scholar
- Vlasta Kokol-Voljc . 2000. Exam Questions When Using CAS for School Mathematics Teaching. Algebra Vol. 7 (2000), 13.Google Scholar
- Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton . 2012. Imagenet classification with deep convolutional neural networks Advances in neural information processing systems. 1097--1105. Google ScholarDigital Library
- Johan Lithner . 2004. Mathematical reasoning in calculus textbook exercises. The Journal of Mathematical Behavior Vol. 23, 4 (2004), 405--427.Google ScholarCross Ref
- Qi Liu, Runze Wu, Enhong Chen, Guandong Xu, Yu Su, Zhigang Chen, and Guoping Hu . 2018. Fuzzy cognitive diagnosis for modelling examinee performance. ACM Transactions on Intelligent Systems and Technology (TIST) Vol. 9, 4 (2018), 48. Google ScholarDigital Library
- Yuping Liu, Qi Liu, Runze Wu, Enhong Chen, Yu Su, Zhigang Chen, and Guoping Hu . 2016. Collaborative learning team formation: a cognitive modeling perspective International Conference on Database Systems for Advanced Applications. Springer, 383--400. Google ScholarDigital Library
- Lin Ma, Zhengdong Lu, and Hang Li . 2016. Learning to Answer Questions from Image Using Convolutional Neural Network. Thirtieth AAAI Conference on Artificial Intelligence. 3567--3573. Google ScholarDigital Library
- Iaroslav Melekhov, Juho Kannala, and Esa Rahtu . 2016. Siamese network features for image matching. In International Conference on Pattern Recognition. IEEE, 378--383.Google ScholarCross Ref
- Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean . 2013. Distributed representations of words and phrases and their compositionality Advances in neural information processing systems. 3111--3119. Google ScholarDigital Library
- Youssef Mroueh, Etienne Marcheret, and Vaibhava Goel . 2015. Deep multimodal learning for audio-visual speech recognition IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 2130--2134.Google Scholar
- Jonas Mueller and Aditya Thyagarajan . 2016. Siamese Recurrent Architectures for Learning Sentence Similarity. Thirtieth AAAI Conference on Artificial Intelligence. 2786--2792. Google ScholarDigital Library
- Zachary A Pardos and Anant Dadu . 2017. Imputing KCs with representations of problem content and context Proceedings of the 25th Conference on User Modeling, Adaptation and Personalization. ACM, 148--155. Google ScholarDigital Library
- Cesc C Park and Gunhee Kim . 2015. Expressing an image stream with a sequence of natural sentences Advances in neural information processing systems. 73--81. Google ScholarDigital Library
- Razvan Pascanu, Tomas Mikolov, and Yoshua Bengio . 2013. On the difficulty of training recurrent neural networks International Conference on Machine Learning. 1310--1318. Google ScholarDigital Library
- Jir'ı Rihák and Radek Pelánek . 2017. Measuring Similarity of Educational Items Using Data on Learners' Performance Proceedings of the 10th International Conference on Educational Data Mining. 16--23.Google Scholar
- Shuo Shang, Ruogu Ding, Bo Yuan, Kexin Xie, Kai Zheng, and Panos Kalnis . 2012. User oriented trajectory search for trip recommendation Proceedings of the 15th International Conference on Extending Database Technology. ACM, 156--167. Google ScholarDigital Library
- Shuo Shang, Jiajun Liu, Kun Zhao, Mingrui Yang, Kai Zheng, and Jirong Wen . 2015. Dimension reduction with meta object-groups for efficient image retrieval. Neurocomputing Vol. 169 (2015), 50--54.Google ScholarCross Ref
- Mohammad E Shiri, A Esma A"ımeur, and Claude Frasson . 1998. Student modelling by case based Reasoning. In International Conference on Intelligent Tutoring Systems. Springer, 394--403. Google ScholarDigital Library
- Nitish Srivastava, Geoffrey E Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov . 2014. Dropout: a simple way to prevent neural networks from overfitting. Journal of machine learning research Vol. 15, 1 (2014), 1929--1958. Google ScholarDigital Library
- Armin Stahl . 2006. Combining Case-Based and Similarity-Based Product Recommendation European Conference on Advances in Case-Based Reasoning. 355--369. Google ScholarDigital Library
- Yu Su, Qingwen Liu, Qi Liu, Zhenya Huang, Yu Yin, Enhong Chen, Chris Ding, Si Wei, and Guoping Hu . 2018. Exercise-Enhanced Sequential Modeling for Student Performance Prediction Thirty-Second AAAI Conference on Artificial Intelligence. 2435--2443.Google Scholar
- Avgoustos Tsinakos and Ioannis Kazanidis . 2012. Identification of conflicting questions in the PARES system. The International Review of Research in Open and Distributed Learning Vol. 13, 3 (2012), 297--313.Google ScholarCross Ref
- Adrienne E Williams, Nancy M Aguilar-Roca, Michelle Tsai, Matthew Wong, Marin Moravec Beaupré, and Diane K O'Dowd . 2011. Assessment of learning gains associated with independent exam analysis in introductory biology. CBE-Life Sciences Education Vol. 10, 4 (2011), 346--356.Google ScholarCross Ref
- Le Wu, Yong Ge, Qi Liu, Enhong Chen, Richang Hong, Junping Du, and Meng Wang . 2017 a. Modeling the evolution of users' preferences and social links in social networking services. IEEE Transactions on Knowledge and Data Engineering Vol. 29, 6 (2017), 1240--1253. Google ScholarDigital Library
- Le Wu, Yong Ge, Qi Liu, Enhong Chen, Bai Long, and Zhenya Huang . 2016. Modeling users' preferences and social links in Social Networking Services: a joint-evolving perspective. In Thirtieth AAAI Conference on Artificial Intelligence. 279--286. Google ScholarDigital Library
- Runze Wu, Qi Liu, Yuping Liu, Enhong Chen, Yu Su, Zhigang Chen, and Guoping Hu . 2015. Cognitive Modelling for Predicting Examinee Performance. International Joint Conferences on Artificial Intelligence. 1017--1024. Google ScholarDigital Library
- Runze Wu, Guandong Xu, Enhong Chen, Qi Liu, and Wan Ng . 2017 b. Knowledge or Gaming?: Cognitive Modelling Based on Multiple-Attempt Response Proceedings of the 26th International Conference on World Wide Web Companion. International World Wide Web Conferences Steering Committee, 321--329. Google ScholarDigital Library
- Ran Xu, Caiming Xiong, Wei Chen, and Jason J Corso . 2015. Jointly Modeling Deep Video and Compositional Text to Bridge Vision and Language in a Unified Framework.. In Twenty-Ninth AAAI Conference on Artificial Intelligence. 2346--2352. Google ScholarDigital Library
- Wenpeng Yin, Hinrich Schütze, Bing Xiang, and Bowen Zhou . 2016. ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs. Transactions of the Association for Computational Linguistics Vol. 4 (2016), 259--272.Google ScholarCross Ref
- Jing Yu, Dongmei Li, Jiajia Hou, Ying Liu, and Zhaoying Yang . 2014. Similarity Measure of Test Questions Based on Ontology and VSM. Open Automation and Control Systems Journal Vol. 6 (2014), 262--267.Google ScholarCross Ref
- Matthew D Zeiler, Dilip Krishnan, Graham W Taylor, and Rob Fergus . 2010. Deconvolutional networks. In Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on. IEEE, 2528--2535.Google ScholarCross Ref
- Jiani Zhang, Xingjian Shi, Irwin King, and Dit-Yan Yeung . 2017. Dynamic Key-Value Memory Networks for Knowledge Tracing Proceedings of the 26th International Conference on World Wide Web. 765--774. Google ScholarDigital Library
- Hengshu Zhu, Hui Xiong, Yong Ge, and Enhong Chen . 2014. Mobile app recommendations with security and privacy awareness Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 951--960. Google ScholarDigital Library
- Tianyu Zhu, Qi Liu, Zhenya Huang, Enhong Chen, Defu Lian, Yu Su, and Guoping Hu . 2018. MT-MCD: A Multi-task Cognitive Diagnosis Framework for Student Assessment International Conference on Database Systems for Advanced Applications. Springer, 318--335.Google Scholar
Index Terms
- Finding Similar Exercises in Online Education Systems
Comments