Just KIDDIN' : Knowledge Infusion and Distillation for Detection of INdecent memes

Overview

This project addresses the challenge of toxicity identification in online multimodal environments, where understanding contextual connections across modalities, such as text and visuals, is crucial. The proposed framework leverages Knowledge Distillation (KD) from Large Visual Language Models (LVLMs) and knowledge infusion from external sources to improve the detection of toxic content in hateful memes.

Model Architecture

1. Generation of LLaVA captions

LLaVA (Large Language and Vision Alignment) captions are generated from meme images and their corresponding OCR (Optical Character Recognition) captions. These captions help provide a semantic understanding of the meme’s context, which is critical for identifying toxic content.

Script: LLaVa Generation/main.py
Input: Meme image and its OCR caption
Output: LLaVA captions, which will be used in the following subgraph extraction steps.

2. Subgraph Extraction for KID-VLM

The subgraph extraction process is a key part of Knowledge Infusion. Nodes are extracted from the LLaVA-generated captions, expanded into knowledge graphs, and pruned to retain relevant nodes for model training.

2.1 Entity Extraction

To extract nodes (entities) from the LLaVA caption, we use keyword extraction techniques. This requires the following files

Script: Entity Extraction/grounding2.py
Input: Generated LLaVA caption
Required Files: Entity Extraction/concept.txt for ConceptNet Knowledge Graph and Entity Extraction/matcher_patterns.json for the matcher patterns
Output: Extracted Set of Nodes

2.2 Graph Expansion

Once the initial nodes are extracted, the graph is expanded by adding surrounding nodes from the ConceptNet Knowledge Graph. The expansion can include nodes at Hop 1 or Hop 2, depending on the requirement.

Script: RelKMG/parse_w_caption.py
Input: Previously Extracted Nodes
Output: Expanded Sub-graph

2.3 Relevancy Scoring and Pruning

After expanding the graph, use a relevancy scoring mechanism to prune the graph and retain only the most important nodes (top 750, 500, or 250 nodes) based on relevance to the meme context.

Script: RelKMG/RelScoreMiniLM.py
Input: Expanded Subgraph
Output: Pruned set of Nodes

2.4 Graph Contruction

Use the TranSE node embeddings to embed the nodes into a low-dimensional space, and construct the final pruned subgraph, which will be used in the training process.

Script: makeGraphs.py
Input: Pruned Nodes
Required Files: glove.transe.sgd.ent.npy to get the TranSE node embedding for the Knowledge Graph and Entity Extraction/concept.txt to get the Knowledge Graph
Output: Final graph to be used

3. KID-VLM

3.1 Hateful Memes Dataset

To train the model on the Hateful Memes Dataset, use the following script. This will train the KID-VLM model and log results on Weights and Biases (WandB). You will need to log in to WandB and set up a database for Optuna hyperparameter optimization.

Script: hateclipperModel_rgcn_caption_llava_distil.py

3.2 HarMeme Dataset

For the HarMeme Dataset, follow the subgraph extraction pipeline described above, and run the following scripts for each target variable (Intensity and Target)

Script: HarMeme/hateclipperModel_rgcn_lava_capt_distil_intensity.py for Intensity variable and HarMeme/hateclipperModel_rgcn_lava_capt_distil_target.py for Target variable

Please cite our work as


@article{garg2024just,
  title={Just KIDDIN: Knowledge Infusion and Distillation for Detection of INdecent Memes},
  author={Garg, Rahul and Padhi, Trilok and Jain, Hemang and Kursuncu, Ugur and Kumaraguru, Ponnurangam},
  journal={arXiv preprint arXiv:2411.12174},
  year={2024}
}

Reference Update (Feb 8, 2026)

We identified and corrected a small number of issues in the reference list (bibliographic and metadata inconsistencies). The paper’s content and conclusions remain unchanged.

Updated version:
https://github.com/SWAN-AI/Knowledge-Infused-Distilled-VLMs/blob/main/_ACL_2025__Just_KIDDIN__Knowledge_Infusion_and_Distillation_for_Detection_of_INdecent_Memes____Citations_Fixed%20(1).pdf

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Entity Extraction		Entity Extraction
HarMeme		HarMeme
LLaVa Generation		LLaVa Generation
MMBT Baselines		MMBT Baselines
RelKMG		RelKMG
Visuals.png		Visuals.png
_ACL_2025__Just_KIDDIN__Knowledge_Infusion_and_Distillation_for_Detection_of_INdecent_Memes____Citations_Fixed (1).pdf		_ACL_2025__Just_KIDDIN__Knowledge_Infusion_and_Distillation_for_Detection_of_INdecent_Memes____Citations_Fixed (1).pdf
hateclipperModel_rgcn_caption_llava_distil.py		hateclipperModel_rgcn_caption_llava_distil.py
makeGraphs.py		makeGraphs.py
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Just KIDDIN' : Knowledge Infusion and Distillation for Detection of INdecent memes

Overview

Model Architecture

1. Generation of LLaVA captions

2. Subgraph Extraction for KID-VLM

2.1 Entity Extraction

2.2 Graph Expansion

2.3 Relevancy Scoring and Pruning

2.4 Graph Contruction

3. KID-VLM

3.1 Hateful Memes Dataset

3.2 HarMeme Dataset

Reference Update (Feb 8, 2026)

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

SWAN-AI/Knowledge-Infused-Distilled-VLMs

Folders and files

Latest commit

History

Repository files navigation

Just KIDDIN' : Knowledge Infusion and Distillation for Detection of INdecent memes

Overview

Model Architecture

1. Generation of LLaVA captions

2. Subgraph Extraction for KID-VLM

2.1 Entity Extraction

2.2 Graph Expansion

2.3 Relevancy Scoring and Pruning

2.4 Graph Contruction

3. KID-VLM

3.1 Hateful Memes Dataset

3.2 HarMeme Dataset

Reference Update (Feb 8, 2026)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages