Text-Enriched Representations for News Image Classification

Authors:
Elias Moons

KU Leuven, Leuven, Belgium

KU Leuven, Leuven, Belgium
View Profile

,
Tinne Tuytelaars

KU Leuven, Leuven, Belgium

KU Leuven, Leuven, Belgium
View Profile

,
Marie-Francine Moens

KU Leuven, Leuven, Belgium

KU Leuven, Leuven, Belgium
View Profile

WWW '18: Companion Proceedings of the The Web Conference 2018April 2018Pages 99–100https://doi.org/10.1145/3184558.3186948

Published:23 April 2018Publication History

WWW '18: Companion Proceedings of the The Web Conference 2018

Pages 99–100

ABSTRACT

Images have a prominent role in the communication of news on the Web. We propose a novel method for image classification with subject categories when limited annotated images are available for training the classifier. A neural network based encoder learns image representations from paired news images and their texts. Once trained, this encoder transforms any image to a text-enriched representation of the image, which is then used as input for the classifier that categorizes an image according to its subject category. We have trained classifiers with different amounts of annotated images and found that the image classifier that uses the text-enriched image representations outperforms a baseline model that only uses image features especially in cases with limited training examples.

References

Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. ImageNet: A large-scale hierarchical image database CVPR 2009. IEEE, 248--255.Google Scholar
Persi Diaconis and Bradley Efron. 1985. Testing for independence in a two-way table: New interpretations of the chi-square statistic. The Annals of Statistics Vol. 13, 3, 845--874.Google ScholarCross Ref

Index Terms

Text-Enriched Representations for News Image Classification
1. Computer systems organization
  1. Architectures
    1. Other architectures
      1. Neural networks

Recommendations

Semi-supervised robust deep neural networks for multi-label image classification
Highlights
- Large-scale data includes many noisily labeled and unlabeled examples.
- With ...
Abstract
This paper introduces a robust method for semi-supervised training of deep neural networks for multi-label image classification. To this end, a ramp loss is utilized since it is more robust against noisy and incomplete image labels ...
Read More
Dual class representation learning for few-shot image classification
Abstract
Few-shot learning (FSL) models are trained on base classes that have many training examples and evaluated on novel classes that have very few training examples. Since these models cannot be properly fine-tuned on the novel classes ...
Highlights
- Proposes dual class representation learning (DCRL) for few-shot image classification.
Read More
Systematic Comparison of Incomplete-Supervision Approaches for Biomedical Image Classification
Artificial Neural Networks and Machine Learning – ICANN 2022
Abstract
Deep learning based classification of biomedical images requires expensive manual annotation by experts. Incomplete-supervision approaches including active learning, pre-training, and semi-supervised learning have thus been developed to increase ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WWW '18: Companion Proceedings of the The Web Conference 2018
April 2018
2023 pages
ISBN:9781450356404
General Chairs:
Pierre-Antoine Champin
Université Claude Bernard Lyon 1, France
,
Fabien Gandon
Inria, Université Côte d'Azur, CNRS, I3S, France
,
Lionel Médini
Université Claude Bernard Lyon 1, CNRS, LIRIS, France
,
Program Chairs:
Mounia Lalmas
Spotify, UK
,
Panagiotis G. Ipeirotis
New York University, USA
Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
International World Wide Web Conferences Steering Committee
Republic and Canton of Geneva, Switzerland
Publication History
- Published: 23 April 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
deep learning
limited training data
news image classification
Qualifiers
- poster
Conference

Acceptance Rates
Overall Acceptance Rate1,899of8,196submissions,23%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 309
  Total Downloads
- Downloads (Last 12 months)33
- Downloads (Last 6 weeks)11
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Text-Enriched Representations for News Image Classification

WWW '18: Companion Proceedings of the The Web Conference 2018

ABSTRACT

References

Cited By

Index Terms

Recommendations

Semi-supervised robust deep neural networks for multi-label image classification

Dual class representation learning for few-shot image classification

Systematic Comparison of Incomplete-Supervision Approaches for Biomedical Image Classification