research-article

Personal Knowledge Base Construction from Text-based Lifelogs

Authors:
An-Zi Yen

National Taiwan University, Taipei, Taiwan Roc

National Taiwan University, Taipei, Taiwan Roc
View Profile

,
Hen-Hsen Huang

National Chengchi University & MOST Joint Research Center for AI Technology and All Vista Healthcare, Taipei, Taiwan Roc

National Chengchi University & MOST Joint Research Center for AI Technology and All Vista Healthcare, Taipei, Taiwan Roc
View Profile

,
Hsin-Hsi Chen

National Taiwan University & MOST Joint Research Center for AI Technology and All Vista Healthcare, Taipei, Taiwan Roc

National Taiwan University & MOST Joint Research Center for AI Technology and All Vista Healthcare, Taipei, Taiwan Roc
View Profile

SIGIR'19: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information RetrievalJuly 2019Pages 185–194https://doi.org/10.1145/3331184.3331209

Published:18 July 2019Publication History

SIGIR'19: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pages 185–194

ABSTRACT

Previous work on lifelogging focuses on life event extraction from image, audio, and video data via wearable sensors. In contrast to wearing an extra camera to record daily life, people are used to log their life on social media platforms. In this paper, we aim to extract life events from textual data shared on Twitter and construct personal knowledge bases of individuals. The issues to be tackled include (1) not all text descriptions are related to life events, (2) life events in a text description can be expressed explicitly or implicitly, (3) the predicates in the implicit events are often absent, and (4) the mapping from natural language predicates to knowledge base relations may be ambiguous. A joint learning approach is proposed to detect life events in tweets and extract event components including subjects, predicates, objects, and time expressions. Finally, the extracted information is transformed to knowledge base facts. The evaluation is performed on a collection of lifelogs from 18 Twitter users. Experimental results show our proposed system is effective in life event extraction, and the constructed personal knowledge bases are expected to be useful to memory recall applications.

References

Sören Auer, Christian Bizer, Georgi Kobilarov, Jens Lehmann, Richard Cyganiak, and Zachary G. Ives. 2007. Dbpedia: A Nucleus for a Web of Open Data. In Proceedings of the 6th International the Semantic Web and 2nd Asian Conference on Asian Semantic Web Conference, 722--735. Google ScholarDigital Library
Kurt D. Bollacker, Colin Evans, Praveen Paritosh, Tim Sturge, and Jamie Taylor. 2008. Freebase: A Collaboratively Created Graph Database for Structuring Human Knowledge. In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, 1247--1250. Google ScholarDigital Library
Richard Caruana. 1993. Multitask learning: a knowledge based source of inductive bias. In Proceedings of the Tenth International Conference on Machine Learning. Google ScholarDigital Library
Smitashree Choudhury and Harith Alani. 2015. Detecting Presence of Personal Events in Twitter Streams. In Proceedings of the International Conference on Social Informatics, 157--166.Google ScholarCross Ref
Yubo Chen, Liheng Xu, Kang Liu, Daojian Zeng, and Jian Zhao. 2015. Event extraction via dynamic multi-pooling convolutional neural networks. In Proceedings of ACL, 167--176.Google ScholarCross Ref
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805.Google Scholar
Duc-Tien Dang-Nguyen, Luca Piras, Michael Riegler, Liting Zhou, Mathias Lux, and Cathal Gurrin. 2018. Overview of ImageCLEFlifelog 2018: Daily Living Understanding and Lifelog Moment Retrieval. In: CLEF 2018 Working Notes. CEUR Workshop Proceedings, CEURWS.org http://ceur-ws.org, Avignon, France.Google Scholar
Thomas Dickinson, Miriam Fernández, Lisa A. Thomas, Paul Mulholland, Pamela Briggs, and Harith Alani. 2016. Identifying Important Life Events from Twitter Using Semantic and Syntactic Patterns. In Proceedings of the 15th International Conference WWW, 143--150.Google Scholar
Rowanne Fleck and Geraldine Fitzpatrick. 2009. Teachers' and tutors' social reflection around SenseCam images. In International Journal of Human-Computer Studies, 67(12): 1024--1036. Google ScholarDigital Library
Jenny Rose Finkel, Christopher D. Manning, and Andrew Y. Ng. 2006. Solving the problem of cascading errors: approximate Bayesian inference for linguistic annotation pipelines. In Proceedings of EMNLP, 618--626. Google ScholarDigital Library
Cathal Gurrin, Hideo Joho, Frank Hopfgartner, Liting Zhou, and Rami Albatal. 2016. Overview of ntcir lifelog task. In Proceedings of the 11th NTCIR Conference on Evaluation of Information Access Technologies, NTCIR-12. National Center of Sciences.Google Scholar
Alex Graves, Abdel-rahman Mohamed, and Geoffrey Hinton. 2013. Speech recognition with deep recurrent neural networks. In Proceedings of ICASSP 2013, 6645--6649.Google ScholarCross Ref
Alex Graves and Jürgen Schmidhuber. 2005. Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Networks, 18(5--6), 602--610. Google ScholarDigital Library
Eben Harrell. 2010. Remains of the day: can a new device help amnesia patients outsource memory? Time Magazine, 46--51.Google Scholar
Richard Harper, Dave W. Randall, Nicola Smyth, Cara Evans, L. Heledd, Ronald E. Moore. 2008. The Past is a Different Place: They Do Things Differently There. In Proceedings of the 7th ACM conference on Designing interactive systems, 271--280. Google ScholarDigital Library
Steve Hodges, Lyndsay Williams, Emma Berry, Shahram Izadi, James Srinivasan, Alex Butler, Gavin Smyth, Narinder Kapur, and Ken Woodberry. 2006. SenseCam: A retrospective memory aid. In Proceedings of International Conference on Ubiquitous Computing, 177--193. Google ScholarDigital Library
Sen Hu, Lei Zou, and Xinbo Zhang. 2018. A State-transition Framework to Answer Complex Questions over Knowledge Base. In Proceedings of EMNLP, 2098--2108.Google ScholarCross Ref
Alan Jackoway, Hanan Samet, and Jagan Sankaranarayanan. 2012. Identification of live news events using twitter. In Proceedings of the 3rd ACM SIGSPATIAL International Workshop on Location-Based Social Networks, 25--32. Google ScholarDigital Library
Martin Källström. 2013. Lifelogging camera: the narrative clip. Retrieved from http://getnarrative.com/Google Scholar
Jacqueline Kerr, Simon J. Marshall, Suneeta Godbole, Jacqueline Chen, Amanda Legge, Aiden R. Doherty, Paul Richard Kelly, Melody Oliver, Hannah Badland, Charlie Foster. 2013. Using the SenseCam to Improve Classifications of Sedentary Behaviour in Free-Living Settings. In American journal of preventive medicine, 44(3): 290--296.Google Scholar
Guillaume Lample, Miguel Ballesteros, Sandeep Subramanian, Kazuya Kawakami, Chris Dyer. 2016. Neural architectures for named entity recognition. In Proceedings of NAACL-HLT, 260--270.Google ScholarCross Ref
Jiwei Li and Claire Cardie. 2014, Timeline Generation: Tracking individuals on Twitter. In Proceedings of the 23rd international conference on World wide web, 643--652. Google ScholarDigital Library
Chin-Ho Lin, Hen-Hsen Huang, and Hsin-Hsi Chen. 2018. Learning to Map Natural Language Statements into Knowledge Base Representations for Knowledge Base Construction. In Proceedings of LREC.Google Scholar
John D. Lafferty, Andrew McCallum, and Fernando Pereira. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data.Google Scholar
Jiwei Li, Alan Ritter, Claire Cardie, and Eduard Hovy. 2014. Major Life Event Extraction from Twitter based on Congratulations/Condolences Speech Acts. In Proceedings of the EMNLP, 1997--2007.Google ScholarCross Ref
Davis Liang and Yan Shu. 2017. Deep Automated Multi-task Learning. In Proceedings of the The 8th International Joint Conference on Natural Language Processing, 55--60.Google Scholar
Xueliang Liu, Raphaël Troncy, and Benoit Huet. 2011. Using social media to identify events. In Proceedings of the 3rd ACM SIGMM international workshop on Social media, 3--8. Google ScholarDigital Library
Takuya Maekawa. 2013. A sensor device for automatic food lifelogging that is embedded in home ceiling light: a preliminary investigation. In Proceedings of the 7th International Conference on Pervasive Computing Technologies for Healthcare, 405--407. Google ScholarDigital Library
Rajesh Elara Mohan, Lee Hyowon, K. S. Jaichandar, and Carlos Acosta Antonio Calderon. 2012. LifeVision: Integrating heart rate sensing in lifelogging camera for accurate risk diagnosis for the elderly. In Proceedings of the 6th International Conference on Rehabilitation Engineering & Assistive Technology, 35. Google ScholarDigital Library
Christopher D. Manning, Mihai Surdeanu, John Bauer, Jenny Rose Finkel, Steven Bethard, and David McClosky. 2014, The Stanford CoreNLP natural language processing toolkit. In Proceedings of 52nd annual meeting of the association for computational linguistics: system demonstrations, 55--60.Google ScholarCross Ref
Thien Huu Nguyen, Kyunghyun Cho, and Ralph Grishman. 2016. Joint event extraction via recurrent neural networks. In Proceedings of NAACL, 300--309.Google ScholarCross Ref
Thien Huu Nguyen and Ralph Grishman. 2015. Event detection and domain adaptation with convolutional neural networks. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, 365--371.Google ScholarCross Ref
Mitsuo Nohara, Nobuhide Kotsuka, Masayuki Hashimoto, and Hiroki Horiuchi. 2010. A study on food-log application to a medical-care consult via telecommunications. In Proceedings of the 16th International Conference on Virtual Systems and Multimedia, 88--91.Google ScholarCross Ref
Gillian O'Loughlin, Sarah Jane Cullen, Adrian McGoldrick, Siobhán O'Connor, Richard J. Blain, Shane O'Malley, and Giles D. Warrington. 2013, Using a Wearable Camera to Increase the Accuracy of Dietary Analysis. In American journal of preventive medicine, 44(3): 290--296.Google Scholar
Juan José Soler, Fernando Cuartero, and Manuel Roblizo. 2012, Twitter as a tool for predicting elections results. In Proceedings of the International Conference on Advances in Social Networks Analysis and Mining, 1194--1200. Google ScholarDigital Library
Anders Søgaard and Yoav Goldberg. 2016. Deep multi-task learning with low level tasks supervised at lower layers. In Proceedings of ACL, 231--235.Google ScholarCross Ref
Takeshi Sakaki, Makoto Okazaki, and Yutaka Matsuo. 2010. Earthquake shakes twitter users: real-time event detection by social sensors. In Proceedings of the 19th international conference on World wide web, 851--860. Google ScholarDigital Library
Krishna Chaitanya Sanagavarapu, Alakananda Vempala, and Eduardo Blanco. 2017. Determining Whether and When People Participate in the Events They Tweet About. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. ACL, 641--646.Google ScholarCross Ref
Tsung-Han Yang, Hen-Hsen Huang, An-Zi Yen, and Hsin-Hsi Chen. 2018, Transfer of Frames from English FrameNet to Construct Chinese FrameNet: A Bilingual Corpus-Based Approach. In Proceedings of LREC.Google Scholar
Xuchen Yao, and Benjamin Van Durme. 2014. Information extraction over structured data: Question answering with freebase. In Proceedings of ACL, 956--966.Google ScholarCross Ref

Index Terms

Personal Knowledge Base Construction from Text-based Lifelogs
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Information extraction
2. Information systems
  1. World Wide Web
    1. Web searching and information discovery
      1. Personalization

Recommendations

Personal Knowledge Base Construction from Multimodal Data
ICMR '21: Proceedings of the 2021 International Conference on Multimedia Retrieval

With the passage of time, people often have misty memories of their past experiences. Information recall support for people by collecting personal lifelogs is emerging. Recently, people tend to record their daily life via filming Video Weblog (VLog), ...
Read More
Using visual lifelogs to automatically characterize everyday activities

Visual lifelogging is the term used to describe recording our everyday lives using wearable cameras, for applications which are personal to us and do not involve sharing our recorded data. Current applications of visual lifelogging are built around ...
Read More
VidLife: A Dataset for Life Event Extraction from Videos
CIKM '21: Proceedings of the 30th ACM International Conference on Information & Knowledge Management

Filming video blogs, which is shortened to vlog, becomes a popular way for people to record their life experiences in recent years. In this work, we present a novel task that is aimed at extracting life events from videos and constructing personal ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SIGIR'19: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval
July 2019
1512 pages
ISBN:9781450361729
DOI:10.1145/3331184
General Chairs:
Benjamin Piwowarski
CNRS - Sorbonne Universite, France
,
Max Chevalier
Universite de Toulouse, CNRS, France
,
Eric Gaussier
Universite Grenoble Alpes, CNRS, France
,
Program Chairs:
Yoelle Maarek
Amazon Research, Israel
,
Jian-Yun Nie
University of Montreal, Canada
,
Falk Scholer
RMIT University, Australia
Copyright © 2019 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 18 July 2019
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
life event detection
lifelogging
personal knowledge base construction
social media
Qualifiers
- research-article
Conference

Acceptance Rates
SIGIR'19 Paper Acceptance Rate84of426submissions,20%Overall Acceptance Rate792of3,983submissions,20%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 14
  Total Citations
  View Citations
- 596
  Total Downloads
- Downloads (Last 12 months)52
- Downloads (Last 6 weeks)7
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Personal Knowledge Base Construction from Text-based Lifelogs

SIGIR'19: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

Personal Knowledge Base Construction from Multimodal Data

Using visual lifelogs to automatically characterize everyday activities

VidLife: A Dataset for Life Event Extraction from Videos