Detection of Fake News in Short Videos: A Multimodal Approach
No Thumbnail Available
Date
2025-06
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Indian Statistical Institute, Kolkata
Abstract
The rise of generative models and affordable video editing tools has fueled the spread
of fake and manipulated videos, undermining information reliabilityespecially on social
media. Traditional detection methods, focused on single modalities like visual artifacts
or text cues, often struggle with diverse, user-generated content.
This dissertation presents a unified framework for fake video detection that integrates
multimodal semantics, narrative structure, and propagation behavior. Visual, audio,
text, and OCR features are extracted using pretrained models (CLIP, Wav2Vec2), and
segment-level graphs are built to model narrative flow using Graph Attention Networks
(GATv2Conv). User engagement dynamics are modeled via a bidirectional LSTM.
A cross-modal consistency loss encourages semantic alignment across modalities, improving
representational coherence. The end-to-end model is evaluated on heterogeneous
datasets like FakeTT, demonstrating strong generalization and robustness.
Results show the proposed system outperforms existing baselines, especially in challenging
cases with asynchronous or fragmented content. By combining content, structure,
and behavioral cues, the framework enables more reliable and interpretable fake video
detection.
Description
Dissertation under the supervision of Dr. Ujjwal Bhattacharya
Keywords
Fake Video Detection, Multimodal Representation Learning, Graph Neural Networks, Temporal Modeling, Cross-Modal Consistency, Misinformation Detection
Citation
61p.
