BPG is committed to discovery and dissemination of knowledge
Observational Study
Copyright: ©Author(s) 2026.
World J Gastroenterol. Apr 21, 2026; 32(15): 116105
Published online Apr 21, 2026. doi: 10.3748/wjg.v32.i15.116105
Table 1 Details of the dataset used in the study
Kvasir labeled video
Category10
Bleeding images1312 frames
Else6081 frames
Total7393 frames
TrainStage: BleedingImages 1049Total = 5913
Stage: Un-bleedingImages 4864
ValStage: BleedingImages 263Total = 1480
Stage: Un-bleedingImages 1217
Table 2 Convolutional neural network layer configuration and training parameters
Layer type
Details
InputCapsule endoscopy frame, resized to 224 × 224 pixels, 3 channels (RGB)
Conv block 1Conv2D, 8 filters, 3 × 3 kernel, ReLU activation → MaxPooling 2 × 2
Conv block 2Conv2D, 16 filters, 3 × 3 kernel, ReLU activation → MaxPooling 2 × 2
Conv block 3Conv2D, 32 filters, 3 × 3 kernel, ReLU activation → MaxPooling 2 × 2
FlattenFlatten feature maps into 1D vector
Fully connected (FC1)4096 neurons, ReLU activation, dropout = 0.4
Fully connected (FC2)4096 neurons, ReLU activation
Fully connected (FC3)1024 neurons, ReLU activation
Fully connected (FC4)512 neurons, ReLU activation
Fully connected (FC5)256 neurons, ReLU activation
Output layer10 neurons, Softmax activation (10-class); 2 neurons, Softmax (2-class)
Training optimizerAdam, learning rate = 0.001, scheduler (reduce on plateau, factor 0.1)
Batch size32
EpochsUp to 180 (early stopping patience = 15)
Loss functionCategorical cross-entropy (10-class); binary cross-entropy (2-class)
Table 3 Summary of metric
Model
Precision
Recall
F1-score
Accuracy
10-class (collapsed)98.6%93.3%95.9%93.3%
2-class (direct)98.4%95.1%96.7%94.7%