CrisisKAN: Knowledge-Infused and Explainable Multimodal Attention Network for Crisis Event Classification

Jan 1, 2024·

Shubham Gupta

Nandini Saini

Suman Kundu

Debasis Das

· 0 min read

Cite Code DOI

Abstract

Pervasive use of social media has become the emerging source for real-time information (like images, text, or both) to identify various events. Despite the rapid growth of image and text-based event classification, the state-of-the-art (SOTA) models find it challenging to bridge the semantic gap between features of image and text modalities due to inconsistent encoding. Also, the black-box nature of models fails to explain the model’s outcomes for building trust in high-stakes situations such as disasters, pandemic. Additionally, the word limit imposed on social media posts can potentially introduce bias towards specific events. To address these issues, we proposed CrisisKAN, a novel Knowledge-infused and Explainable Multimodal Attention Network that entails images and texts in conjunction with external knowledge from Wikipedia to classify crisis events. To enrich the context-specific understanding of textual information, we integrated Wikipedia knowledge using proposed wiki extraction algorithm. Along with this, a guided cross-attention module is implemented to fill the semantic gap in integrating visual and textual data. In order to ensure reliability, we employ a model-specific approach called Gradient-weighted Class Activation Mapping (Grad-CAM) that provides a robust explanation of the predictions of the proposed model. The comprehensive experiments conducted on the CrisisMMD dataset yield in-depth analysis across various crisis-specific tasks and settings. As a result, CrisisKAN outperforms existing SOTA methodologies and provides a novel view in the domain of explainable multimodal event classification. (Code repository: https://github.com/shubhamgpt007/CrisisKAN)

Type

Conference paper

Publication

European Conference on Information Retrieval (ECIR 2024)

Last updated on Jan 1, 2024

Authors

Suman Kundu

Assistant Professor

My research interests lies in the intersection of Graph Algorithms and AI including graph representation learning, social network analysis, network data science, streaming algorithms, information retrival, big data, and data visualization.

← CookingINWild: Unleashing the Challenges of Indian Cuisine Cooking Videos for Action Recognition Jan 1, 2024

Cyberbully: Aggressive Tweets, Bully and Bully Target Profiling from Multilingual Indian Tweets Jan 1, 2024 →