ABSTRACT
Images have a prominent role in the communication of news on the Web. We propose a novel method for image classification with subject categories when limited annotated images are available for training the classifier. A neural network based encoder learns image representations from paired news images and their texts. Once trained, this encoder transforms any image to a text-enriched representation of the image, which is then used as input for the classifier that categorizes an image according to its subject category. We have trained classifiers with different amounts of annotated images and found that the image classifier that uses the text-enriched image representations outperforms a baseline model that only uses image features especially in cases with limited training examples.
- Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. ImageNet: A large-scale hierarchical image database CVPR 2009. IEEE, 248--255.Google Scholar
- Persi Diaconis and Bradley Efron. 1985. Testing for independence in a two-way table: New interpretations of the chi-square statistic. The Annals of Statistics Vol. 13, 3, 845--874.Google ScholarCross Ref
Index Terms
- Text-Enriched Representations for News Image Classification
Recommendations
Semi-supervised robust deep neural networks for multi-label image classification
Highlights- Large-scale data includes many noisily labeled and unlabeled examples.
- With ...
AbstractThis paper introduces a robust method for semi-supervised training of deep neural networks for multi-label image classification. To this end, a ramp loss is utilized since it is more robust against noisy and incomplete image labels ...
Dual class representation learning for few-shot image classification
AbstractFew-shot learning (FSL) models are trained on base classes that have many training examples and evaluated on novel classes that have very few training examples. Since these models cannot be properly fine-tuned on the novel classes ...
Highlights- Proposes dual class representation learning (DCRL) for few-shot image classification.
Systematic Comparison of Incomplete-Supervision Approaches for Biomedical Image Classification
Artificial Neural Networks and Machine Learning – ICANN 2022AbstractDeep learning based classification of biomedical images requires expensive manual annotation by experts. Incomplete-supervision approaches including active learning, pre-training, and semi-supervised learning have thus been developed to increase ...
Comments