research-article

SAAMEAT: Active Feature Transformation and Selection Methods for the Recognition of User Eating Conditions

Authors:
Fasih Haider

University of Edinburgh, Edinburgh, United Kingdom

University of Edinburgh, Edinburgh, United Kingdom
View Profile

,
Senja Pollak

University of Edinburgh, Edinburgh, United Kingdom

University of Edinburgh, Edinburgh, United Kingdom
View Profile

,
Eleni Zarogianni

University of Edinburgh, Edinburgh, United Kingdom

University of Edinburgh, Edinburgh, United Kingdom
View Profile

,
Saturnino Luz

University of Edinburgh, Edinburgh, United Kingdom

University of Edinburgh, Edinburgh, United Kingdom
View Profile

ICMI '18: Proceedings of the 20th ACM International Conference on Multimodal InteractionOctober 2018Pages 564–568https://doi.org/10.1145/3242969.3243685

Published:02 October 2018Publication History

ICMI '18: Proceedings of the 20th ACM International Conference on Multimodal Interaction

Pages 564–568

ABSTRACT

Automatic recognition of eating conditions of humans could be a useful technology in health monitoring. The audio-visual information can be used in automating this process, and feature engineering approaches can reduce the dimensionality of audio-visual information. The reduced dimensionality of data (particularly feature subset selection) can assist in designing a system for eating conditions recognition with lower power, cost, memory and computation resources than a system which is designed using full dimensions of data. This paper presents Active Feature Transformation (AFT) and Active Feature Selection (AFS) methods, and applies them to all three tasks of the ICMI 2018 EAT Challenge for recognition of user eating conditions using audio and visual features. The AFT method is used for the transformation of the Mel-frequency Cepstral Coefficient and ComParE features for the classification task, while the AFS method helps in selecting a feature subset. Transformation by Principal Component Analysis (PCA) is also used for comparison. We find feature subsets of audio features using the AFS method (422 for Food Type, 104 for Likability and 68 for Difficulty out of 988 features) which provide better results than the full feature set. Our results show that AFS outperforms PCA and AFT in terms of accuracy for the recognition of user eating conditions using audio features. The AFT of visual features (facial landmarks) provides less accurate results than the AFS and AFT sets of audio features. However, the weighted score fusion of all the feature set improves the results.

References

Nathalie T. Burkert, Johanna Muckenhuber, Franziska Großschädl, Éva Rásky, and Wolfgang Freidl. 2014. Nutrition and Health textendash The Association between Eating Behavior and Various Health Parameters: A Matched Sample Study. PLoS ONE Vol. 9, 2 (feb. 2014), e88278.Google ScholarCross Ref
Florian Eyben, Felix Weninger, Florian Groß, and Björn Schuller. 2013. Recent developments in opensmile, the munich open-source multimedia feature extractor. In Proceedings of the 21st ACM international conference on Multimedia. ACM, 835--838. Google ScholarDigital Library
Fasih Haider, Fahim Salim, Owen Conlan, and Saturnino Luz. 2017. An Active Feature Transformation Method For Attitude Recognition of Video Bloggers Proc. Interspeech 2017.Google Scholar
Simone Hantke, Maximilian Schmitt, Panagiotis Tzirakis, and Björn Schuller. 2018. EAT - The ICMI 2018 Eating Analysis and Tracking Challenge Proceedings of the 2018 ACM on International Conference on Multimodal Interaction. ACM. Google ScholarDigital Library
Simone Hantke, Felix Weninger, Richard Kurle, Fabien Ringeval, Anton Batliner, Amr El-Desoky Mousa, and Björn Schuller. 2016. I Hear You Eat and Speak: Automatic Recognition of Eating Condition and Food Type, Use-Cases, and Impact on ASR Performance. PLOS ONE Vol. 11, 5 (may. 2016), e0154486.Google ScholarCross Ref
Guang-Bin Huang, Hongming Zhou, Xiaojian Ding, and Rui Zhang. 2012. Extreme learning machine for regression and multiclass classification. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) Vol. 42, 2 (2012), 513--529. Google ScholarDigital Library
Guang-Bin Huang, Qin-Yu Zhu, and Chee-Kheong Siew. 2006. Extreme learning machine: theory and applications. Neurocomputing Vol. 70, 1--3 (2006), 489--501.Google ScholarCross Ref
Heysem Kaya, Alexey A. Karpov, and Albert Ali Salah. 2015. Fisher vectors with cascaded normalization for paralinguistic analysis Sixteenth Annual Conference of the International Speech Communication Association.Google Scholar
Teuvo Kohonen. 1998. The self-organizing map. Neurocomputing Vol. 21, 1-3 (1998), 1--6.Google ScholarCross Ref
Mengyi Liu, Ruiping Wang, Shaoxin Li, Shiguang Shan, Zhiwu Huang, and Xilin Chen. 2014. Combining multiple kernel methods on riemannian manifold for emotion recognition in the wild. In Proceedings of the 16th International Conference on Multimodal Interaction. ACM, 494--501. Google ScholarDigital Library
Sarunas Raudys and Robert P. W. Duin. 1998. Expected classification error of the Fisher linear classifier with pseudo-inverse covariance matrix. Pattern Recognition Letters Vol. 19, 5-6 (April. 1998), 385--392. Google ScholarDigital Library
Björn W. Schuller, Stefan Steidl, Anton Batliner, Simone Hantke, Florian Hönig, Juan Rafael Orozco-Arroyave, Elmar Nöth, Yue Zhang, and Felix Weninger. 2015. The INTERSPEECH 2015 computational paralinguistics challenge: nativeness, parkinson's & eating condition. In INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015. 478--482. http://www.isca-speech.org/archive/interspeech_2015/i15_0478.htmlGoogle Scholar
Herman Wold. 1985. Partial least squares. Encyclopedia of statistical sciences (1985).Google Scholar

Index Terms

SAAMEAT: Active Feature Transformation and Selection Methods for the Recognition of User Eating Conditions
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Speech recognition
  2. Machine learning
    1. Learning paradigms
      1. Supervised learning
    2. Machine learning algorithms
      1. Feature selection
2. Information systems
  1. Information systems applications
    1. Multimedia information systems
      1. Multimedia databases

Recommendations

Fuzzy rough dimensionality reduction: A feature set partition-based approach
Abstract
Dimensionality reduction is considered in many learning methods using discriminative features to obtain optimal performance. In general, feature extraction and feature selection are two independent methods that cherry-pick the informative ...
Highlights
- The ϑ-fuzzy similarity relation is proposed.
- The feature set is divided into the nonsignificant feature set, weak significant feature set, and significant feature set.
- A feature extraction method called FSLLE is proposed, and the ...
Read More
Survival analysis for high-dimensional, heterogeneous medical data

HighlightsWe propose random survival forests for feature extraction for survival analysis.We formulate two constraints on the neighborhood graph specific to survival analysis.We implement a comparative analysis of 16 feature extraction/selection ...
Read More
Features: the more the better
ISCGAV'08: Proceedings of the 8th conference on Signal processing, computational geometry and artificial vision

In pattern recognition problems, it is usually recommended to extract a low number of features in order to avoid the computational cost. However, using today's computer capabilities we are able to extract and process more features than before. In this ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ICMI '18: Proceedings of the 20th ACM International Conference on Multimodal Interaction
October 2018
687 pages
ISBN:9781450356923
DOI:10.1145/3242969
General Chairs:
Sidney K. D'Mello
University of Illinois, USA
,
Panayiotis (Panos) Georgiou
University of Southern California, USA
,
Stefan Scherer
University of Southern California, USA
,
Program Chairs:
Emily Mower Provost
University of Michigan, USA
,
Mohammad Soleymani
University of Southern California, USA
,
Marcelo Worsley
Northwestern University, USA
Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 2 October 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
audio-visual processing
dimensionality reduction
eating condition
feature extraction
feature selection
feature transformation
Qualifiers
- research-article
Conference

Acceptance Rates
ICMI '18 Paper Acceptance Rate63of149submissions,42%Overall Acceptance Rate453of1,080submissions,42%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 4
  Total Citations
  View Citations
- 119
  Total Downloads
- Downloads (Last 12 months)2
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

SAAMEAT: Active Feature Transformation and Selection Methods for the Recognition of User Eating Conditions

ICMI '18: Proceedings of the 20th ACM International Conference on Multimodal Interaction

ABSTRACT

References

Cited By

Index Terms

Recommendations

Fuzzy rough dimensionality reduction: A feature set partition-based approach

Survival analysis for high-dimensional, heterogeneous medical data

Features: the more the better

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

SAAMEAT: Active Feature Transformation and Selection Methods for the Recognition of User Eating Conditions

ICMI '18: Proceedings of the 20th ACM International Conference on Multimodal Interaction

ABSTRACT

References

Cited By

Index Terms

Recommendations

Fuzzy rough dimensionality reduction: A feature set partition-based approach

Survival analysis for high-dimensional, heterogeneous medical data

Features: the more the better

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media