A New Comic Image Segmentation and Adaptive Differential Evolution Algorithm with Different Times Characteristics

Sivanagireddy Kalli; Srilakshmi Aouthu; B. Narendra Kumar; Yerram Srinivas; S. Jagadeesh; Jatothu Brahmaiah Naik

doi:10.5750/ijme.v167iA2(S).1637

PDF (GBP 29.99)

Published: Aug 19, 2025

DOI: https://doi.org/10.5750/ijme.v167iA2(S).1637

Keywords:

Comic scene Segmentation Time-series analysis accident prevention Optimization frog leap Genetic algorithm

Dr. Sivanagireddy Kalli

Professor, Department of ECE, Sridevi Women’s Engineering College, Hyderabad, Telangana, India

Dr. Srilakshmi Aouthu

Associate Professor, Department of Electronics and Communication Engineering, Vasavi College of Engineering, Hyderabad, Telangana, India

Dr. B. Narendra Kumar

Professor, Department of CSE, Sridevi Women's Engineering College, Hyderabad, Telengana, India

Dr. Yerram Srinivas

Professor, Department of ECE, Vignana Bharathi Institute of Technology, Hyderabad, Telangana, India

Dr. S. Jagadeesh

Professor, ECE Department, Sridevi Women's Engineering College, Hyderabad, Telangana, India

Dr. Jatothu Brahmaiah Naik

Prof in ECE department, Narasaraopet Engineering College, Andhra Pradesh, India

Abstract

Comic scene segmentation is crucial in understanding and analyzing visual storytelling, as it involves identifying and separating distinct elements within a sequence of panels. This paper proposes a novel segmentation approach, Frog Leap Differential Time Series Segmentation (FLDTSS), tailored for analyzing comic images, which often contain complex visual storytelling elements such as expressive characters, dynamic speech bubbles, and background effects. By leveraging time-series features across sequential comic panels, FLDTSS integrates both spatial and temporal cues for more context-aware segmentation. The method was tested on a diverse set of cartoon panels and achieved a precision of 91.6%, recall of 88.3%, and an F1-score of 89.9%, outperforming traditional methods such as Otsu Thresholding (F1-score: 70.6%), Edge-based Canny (76.1%), K-means Clustering (77.8%), Watershed (80.6%), and even Genetic Algorithm-based segmentation (83.2%). The segmentation time for FLDTSS was 1.22 seconds, demonstrating computational efficiency compared to more intensive evolutionary methods. Simulation results showed the model's ability to extract meaningful narrative components such as characters, speech bubbles, emotional cues, and visual effects, with background occupying ~55% of the segmented area, character regions ~22%, and speech bubbles ~8%. This study confirms FLDTSS as a powerful and scalable technique for semantic segmentation and narrative interpretation in visual storytelling formats like comics.

Issue

Vol. 167 No. A2(S) (2025): Special Issue - New Technologies and their Effects on Real-Time Social Developments

Section

Articles

Article Sidebar

Main Article Content

Abstract

Article Details