Image detection method for multi-category lesions in wireless capsule endoscopy based on deep learning models

doi:10.3748/wjg.v30.i48.5111

Advanced Search

BPG is committed to discovery and dissemination of knowledge

Home / Archive / Volume 30, Issue 48

This Article

Academic Content and Language Evaluation of This Article

CrossCheck and Google Search of This Article

Academic Rules and Norms of This Article

Citation of this article

Corresponding Author of This Article

Research Domain of This Article

Article-Type of This Article

Open-Access Policy of This Article

Times Cited Counts in Google of This Article

Number of Hits and Downloads for This Article

Total Article Views (4595)

All Articles published online

The chart showing PDF series, HTML series, Figures (1-12) series, Tables (1-7) series.

Item

Count

PDF

143

HTML

1922

Figures (1-12)

337

Tables (1-7)

358

Sum=2760

Featured Article

The chart showing Browse series, Download series.

Item

Count

Browse

317

Download

742

Sum=1059

Publishing Process of This Article

Item

Count

Browse

Download

550

Sum=625

Dec 28, 2024 (publication date) through Oct 14, 2025

Times Cited of This Article

Times Cited (4)

Journal Information of This Article

Publication Name

World Journal of Gastroenterology

ISSN

1007-9327

Publisher of This Article

Baishideng Publishing Group Inc, 7041 Koll Center Parkway, Suite 160, Pleasanton, CA 94566, USA

Retrospective Study

World J Gastroenterol. Dec 28, 2024; 30(48): 5111-5129
Published online Dec 28, 2024. doi: 10.3748/wjg.v30.i48.5111

Open in New Tab Full Size Figure Download Figure

Figure 1 Dataset example.

Open in New Tab Full Size Figure Download Figure

Figure 2 Wireless capsule endoscopy_Detection model structure. Conv: Convolution; SPPF: Spatial pyramid pooling fast; SwinT: Swin transformer; SiLU: Sigmoid linear unit.

Open in New Tab Full Size Figure Download Figure

Figure 3 Wireless capsule endoscopy_Detection network structure diagram. C1, C2, C3, C4, C5: Layers 1, 2, 3, 4, and 5 of the backbone network; F2, F3, F4, F5: Layers 2, 3, 4, and 5 of the neck network; P2, P3, P4, P5: 2^nd, 3^rd, 4^th, 5^th detection head.

Open in New Tab Full Size Figure Download Figure

Figure 4 Context information of the reflux esophagitis lesion. A: Captured at 00:00:18, representing the earliest result; B: Captured at 00:00:19, representing an earlier frame at this time point; C: Captured at 00:00:19, representing a continued frame subsequent frame, showing further details of the lesion; D: Captured at 00:00:19, representing a later frame at this time point, where the lesion area might reveal new angles or further details due to the capsule’s movement.

Open in New Tab Full Size Figure Download Figure

Figure 5 Vision transformer. MLP: Multilayer perceptron; L: Layer.

Open in New Tab Full Size Figure Download Figure

Figure 6 Swin transformer model structure. H: Height; W: Width; C: Channels.

Open in New Tab Full Size Figure Download Figure

Figure 7 Two consecutive Swin transformer blocks. LN: Layer normalization; W-MSA: Window multihead self-attention; MLP: Multilayer perceptron; SW-MSA: Shifted window multihead self-attention.

Open in New Tab Full Size Figure Download Figure

Figure 8 Different feature fusion structures. A: Feature pyramid network (FPN); B: Path aggregation network; C: Bidirectional FPN. P2: Represents layer 2 feature maps; P3: Represents layer 3 feature maps; P4: Represents layer 4 feature maps; P5: Represents layer 5 feature maps; P6: Represents layer 6 feature maps. FPN: Feature pyramid network; PANet: Path aggregation network; BiFPN: Bidirectional feature pyramid network.

Open in New Tab Full Size Figure Download Figure

Figure 9 Precision-recall curves of the wireless capsule endoscopy_detection model for the dataset.

Open in New Tab Full Size Figure Download Figure

Figure 10 Confusion matrix of the wireless capsule endoscopy_detection model.

Open in New Tab Full Size Figure Download Figure

Figure 11 Wireless capsule endoscopy_detection model detection visualization results. Different letters in the image represent different types of lesions. A: Duodenal bulbar ulcer; B: Ulcerative trauma; C: Luminal stenosis; D: Gastritis; E: Small intestinal nodule; F: Stomach; G: Submucosal mass of the small intestine; H: Colonic mucosal melanosis; I: Esophageal protruding lesion tumor mass; J: Large ulcer; K: Angular notch; L: Duodenal papilla; M: Duodenal bulb; N: Reflux esophagitis; O: Large intestine; P: Bile; Q: Colocystosis hemorrhoids; R: Esophagus; S: Gastric antrum; T: Duodenal bulbar erosion.

Open in New Tab Full Size Figure Download Figure

Figure 12 Example of a wrongly detected lesion. A and B: Examples of misdetected lesions; C and D: Lesions with low detection accuracy.

Citation: Xiao ZG, Chen XQ, Zhang D, Li XY, Dai WX, Liang WH. Image detection method for multi-category lesions in wireless capsule endoscopy based on deep learning models. World J Gastroenterol 2024; 30(48): 5111-5129
URL: https://www.wjgnet.com/1007-9327/full/v30/i48/5111.htm
DOI: https://dx.doi.org/10.3748/wjg.v30.i48.5111