Multimodal Deep Learning for Online Meme Classification

Han, Stephanie; Leal-Arenas, Sebastian; Zdravevski, Eftim; Cavalcante, Charles C; Boukouvalas, Zois; Corizzo, Roberto

Multimodal Deep Learning for Online Meme Classification

Date Issued

2024-12-15

Author(s)

Han, Stephanie

Leal-Arenas, Sebastian

Cavalcante, Charles C

Boukouvalas, Zois

Corizzo, Roberto

Abstract

Memes possess a humorous intent, yet they can
also be used for malicious purposes. Analysing meme data has
the potential to enhance content monitoring, identify emerging
topics, and support content moderation in online platforms.
Memes also represent an interesting use case for multimodal
machine learning, as they combine text and image data. In this
study, we explored the linguistic characteristics and analysed
the convergent themes of five meme classes through common
word extraction. Moreover, we compared the effectiveness of
various machine learning models, i.e., unimodal (text or image)
and multimodal (early fusion, late fusion) in binary and multiclass meme classification tasks. Our results on a large meme
dataset showed that memes heavily adhered to current affairs,
demonstrated by the high frequency of topical words across meme
classes. Regarding model accuracy, early fusion achieved superior
accuracy over late fusion in meme classification. Binary models
outperformed multi-class classification methods. However, fusion
models did not consistently surpass the accuracy of independent
text or image-based models.

Subjects

Memes, multimodal lea...

File(s)

Name

Multimodal_Deep_Learning_of_Online_Memes__Copy_-8.pdf

Size

1.13 MB

Format

Adobe PDF

Checksum

(MD5):94380a129f724cc47efe7c1a83ce93b1