1
|
Boers TGW, Fockens KN, van der Putten JA, Jaspers TJM, Kusters CHJ, Jukema JB, Jong MR, Struyvenberg MR, de Groof J, Bergman JJ, de With PHN, van der Sommen F. Foundation models in gastrointestinal endoscopic AI: Impact of architecture, pre-training approach and data efficiency. Med Image Anal 2024; 98:103298. [PMID: 39173410 DOI: 10.1016/j.media.2024.103298] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Revised: 07/18/2024] [Accepted: 08/06/2024] [Indexed: 08/24/2024]
Abstract
Pre-training deep learning models with large data sets of natural images, such as ImageNet, has become the standard for endoscopic image analysis. This approach is generally superior to training from scratch, due to the scarcity of high-quality medical imagery and labels. However, it is still unknown whether the learned features on natural imagery provide an optimal starting point for the downstream medical endoscopic imaging tasks. Intuitively, pre-training with imagery closer to the target domain could lead to better-suited feature representations. This study evaluates whether leveraging in-domain pre-training in gastrointestinal endoscopic image analysis has potential benefits compared to pre-training on natural images. To this end, we present a dataset comprising of 5,014,174 gastrointestinal endoscopic images from eight different medical centers (GastroNet-5M), and exploit self-supervised learning with SimCLRv2, MoCov2 and DINO to learn relevant features for in-domain downstream tasks. The learned features are compared to features learned on natural images derived with multiple methods, and variable amounts of data and/or labels (e.g. Billion-scale semi-weakly supervised learning and supervised learning on ImageNet-21k). The effects of the evaluation is performed on five downstream data sets, particularly designed for a variety of gastrointestinal tasks, for example, GIANA for angiodyplsia detection and Kvasir-SEG for polyp segmentation. The findings indicate that self-supervised domain-specific pre-training, specifically using the DINO framework, results into better performing models compared to any supervised pre-training on natural images. On the ResNet50 and Vision-Transformer-small architectures, utilizing self-supervised in-domain pre-training with DINO leads to an average performance boost of 1.63% and 4.62%, respectively, on the downstream datasets. This improvement is measured against the best performance achieved through pre-training on natural images within any of the evaluated frameworks. Moreover, the in-domain pre-trained models also exhibit increased robustness against distortion perturbations (noise, contrast, blur, etc.), where the in-domain pre-trained ResNet50 and Vision-Transformer-small with DINO achieved on average 1.28% and 3.55% higher on the performance metrics, compared to the best performance found for pre-trained models on natural images. Overall, this study highlights the importance of in-domain pre-training for improving the generic nature, scalability and performance of deep learning for medical image analysis. The GastroNet-5M pre-trained weights are made publicly available in our repository: huggingface.co/tgwboers/GastroNet-5M_Pretrained_Weights.
Collapse
Affiliation(s)
- Tim G W Boers
- Eindhoven University of Technology, Groene Loper 3, 5612 AE Eindhoven, The Netherlands.
| | - Kiki N Fockens
- Amsterdam UMC, Location VUmc, De Boelelaan 1117, 1081 HV Amsterdam, The Netherlands
| | | | - Tim J M Jaspers
- Eindhoven University of Technology, Groene Loper 3, 5612 AE Eindhoven, The Netherlands
| | - Carolus H J Kusters
- Eindhoven University of Technology, Groene Loper 3, 5612 AE Eindhoven, The Netherlands
| | - Jelmer B Jukema
- Amsterdam UMC, Location VUmc, De Boelelaan 1117, 1081 HV Amsterdam, The Netherlands
| | - Martijn R Jong
- Amsterdam UMC, Location VUmc, De Boelelaan 1117, 1081 HV Amsterdam, The Netherlands
| | | | - Jeroen de Groof
- Amsterdam UMC, Location VUmc, De Boelelaan 1117, 1081 HV Amsterdam, The Netherlands
| | - Jacques J Bergman
- Amsterdam UMC, Location VUmc, De Boelelaan 1117, 1081 HV Amsterdam, The Netherlands
| | - Peter H N de With
- Eindhoven University of Technology, Groene Loper 3, 5612 AE Eindhoven, The Netherlands
| | - Fons van der Sommen
- Eindhoven University of Technology, Groene Loper 3, 5612 AE Eindhoven, The Netherlands
| |
Collapse
|
2
|
Campion JR, O'Connor DB, Lahiff C. Human-artificial intelligence interaction in gastrointestinal endoscopy. World J Gastrointest Endosc 2024; 16:126-135. [PMID: 38577646 PMCID: PMC10989254 DOI: 10.4253/wjge.v16.i3.126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/31/2023] [Revised: 01/18/2024] [Accepted: 02/23/2024] [Indexed: 03/14/2024] Open
Abstract
The number and variety of applications of artificial intelligence (AI) in gastrointestinal (GI) endoscopy is growing rapidly. New technologies based on machine learning (ML) and convolutional neural networks (CNNs) are at various stages of development and deployment to assist patients and endoscopists in preparing for endoscopic procedures, in detection, diagnosis and classification of pathology during endoscopy and in confirmation of key performance indicators. Platforms based on ML and CNNs require regulatory approval as medical devices. Interactions between humans and the technologies we use are complex and are influenced by design, behavioural and psychological elements. Due to the substantial differences between AI and prior technologies, important differences may be expected in how we interact with advice from AI technologies. Human–AI interaction (HAII) may be optimised by developing AI algorithms to minimise false positives and designing platform interfaces to maximise usability. Human factors influencing HAII may include automation bias, alarm fatigue, algorithm aversion, learning effect and deskilling. Each of these areas merits further study in the specific setting of AI applications in GI endoscopy and professional societies should engage to ensure that sufficient emphasis is placed on human-centred design in development of new AI technologies.
Collapse
Affiliation(s)
- John R Campion
- Department of Gastroenterology, Mater Misericordiae University Hospital, Dublin D07 AX57, Ireland
- School of Medicine, University College Dublin, Dublin D04 C7X2, Ireland
| | - Donal B O'Connor
- Department of Surgery, Trinity College Dublin, Dublin D02 R590, Ireland
| | - Conor Lahiff
- Department of Gastroenterology, Mater Misericordiae University Hospital, Dublin D07 AX57, Ireland
- School of Medicine, University College Dublin, Dublin D04 C7X2, Ireland
| |
Collapse
|
3
|
Schmitz R, Kather JN. Artificial intelligence in Barrett's oesophagus and the need for shared and combined data. United European Gastroenterol J 2022; 10:525-527. [PMID: 35704382 PMCID: PMC9278571 DOI: 10.1002/ueg2.12260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Affiliation(s)
- Rüdiger Schmitz
- Department of Internal Medicine I and Institute of Computational Neuroscience and Department of Interdisciplinary EndoscopyUniversity Medical Center Hamburg‐EppendorfHamburgGermany
| | - Jakob Nikolas Kather
- Else Kroener Fresenius Center for Digital HealthMedical Faculty Carl Gustav CarusTechnical University DresdenDresdenGermany
| |
Collapse
|
4
|
Schmitz R, Werner R, Repici A, Bisschops R, Meining A, Zornow M, Messmann H, Hassan C, Sharma P, Rösch T. Artificial intelligence in GI endoscopy: stumbling blocks, gold standards and the role of endoscopy societies. Gut 2022; 71:451-454. [PMID: 33479051 DOI: 10.1136/gutjnl-2020-323115] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Revised: 01/04/2021] [Accepted: 01/05/2021] [Indexed: 02/06/2023]
Affiliation(s)
- Rüdiger Schmitz
- Interdisciplinary Endoscopy, University Medical Center Hamburg-Eppendorf, Hamburg, Germany.,Institute of Computational Neuroscience, University Medical Center Hamburg-Eppendorf, Hamburg, Germany.,Center for Biomedical Artificial Intelligence (bAIome), University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Rene Werner
- Institute of Computational Neuroscience, University Medical Center Hamburg-Eppendorf, Hamburg, Germany.,Center for Biomedical Artificial Intelligence (bAIome), University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Alessandro Repici
- Humanitas Clinical and Research Center - IRCCS, Rozzano, Italy.,Humanitas University, Department of Biomedical Sciences, Milan, Italy
| | - Raf Bisschops
- Gastroenterology, University Hospital Gasthuisberg, Leuven, Belgium
| | - Alexander Meining
- Department of Gastroenterology, University of Würzburg, Würzburg, Germany
| | - Michael Zornow
- Chair for Public and European Law, University of Göttingen, Göttingen, Germany
| | - Helmut Messmann
- Department of Gastroenterology, Universitätsklinikum Augsburg, Augsburg, Germany
| | - Cesare Hassan
- Gastroenterology Unit, Nuovo Regina Margherita Hospital, Rome, Italy
| | - Prateek Sharma
- Division of Gastroenterology and Hepatology, Veterans Affairs Medical Center and University of Kansas, Lawrence, Kansas, USA
| | - Thomas Rösch
- Interdisciplinary Endoscopy, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| |
Collapse
|
5
|
Ciuti G, Skonieczna-Żydecka K, Marlicz W, Iacovacci V, Liu H, Stoyanov D, Arezzo A, Chiurazzi M, Toth E, Thorlacius H, Dario P, Koulaouzidis A. Frontiers of Robotic Colonoscopy: A Comprehensive Review of Robotic Colonoscopes and Technologies. J Clin Med 2020; 9:1648. [PMID: 32486374 PMCID: PMC7356873 DOI: 10.3390/jcm9061648] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2020] [Revised: 05/16/2020] [Accepted: 05/19/2020] [Indexed: 12/15/2022] Open
Abstract
Flexible colonoscopy remains the prime mean of screening for colorectal cancer (CRC) and the gold standard of all population-based screening pathways around the world. Almost 60% of CRC deaths could be prevented with screening. However, colonoscopy attendance rates are affected by discomfort, fear of pain and embarrassment or loss of control during the procedure. Moreover, the emergence and global thread of new communicable diseases might seriously affect the functioning of contemporary centres performing gastrointestinal endoscopy. Innovative solutions are needed: artificial intelligence (AI) and physical robotics will drastically contribute for the future of the healthcare services. The translation of robotic technologies from traditional surgery to minimally invasive endoscopic interventions is an emerging field, mainly challenged by the tough requirements for miniaturization. Pioneering approaches for robotic colonoscopy have been reported in the nineties, with the appearance of inchworm-like devices. Since then, robotic colonoscopes with assistive functionalities have become commercially available. Research prototypes promise enhanced accessibility and flexibility for future therapeutic interventions, even via autonomous or robotic-assisted agents, such as robotic capsules. Furthermore, the pairing of such endoscopic systems with AI-enabled image analysis and recognition methods promises enhanced diagnostic yield. By assembling a multidisciplinary team of engineers and endoscopists, the paper aims to provide a contemporary and highly-pictorial critical review for robotic colonoscopes, hence providing clinicians and researchers with a glimpse of the major changes and challenges that lie ahead.
Collapse
Affiliation(s)
- Gastone Ciuti
- The BioRobotics Institute, Scuola Superiore Sant’Anna, 56025 Pisa, Italy; (V.I.); (M.C.); (P.D.)
- Department of Excellence in Robotics & AI, Scuola Superiore Sant’Anna, 56127 Pisa, Italy
| | - Karolina Skonieczna-Żydecka
- Department of Human Nutrition and Metabolomics, Pomeranian Medical University in Szczecin, 71-460 Szczecin, Poland;
| | - Wojciech Marlicz
- Department of Gastroenterology, Pomeranian Medical University in Szczecin, 71-252 Szczecin, Poland;
- Endoklinika sp. z o.o., 70-535 Szczecin, Poland
| | - Veronica Iacovacci
- The BioRobotics Institute, Scuola Superiore Sant’Anna, 56025 Pisa, Italy; (V.I.); (M.C.); (P.D.)
- Department of Excellence in Robotics & AI, Scuola Superiore Sant’Anna, 56127 Pisa, Italy
| | - Hongbin Liu
- School of Biomedical Engineering & Imaging Sciences, Faculty of Life Sciences and Medicine, King’s College London, London SE1 7EH, UK;
| | - Danail Stoyanov
- Wellcome/EPSRC Centre for Interventional and Surgical Sciences (WEISS), University College London, London W1W 7TY, UK;
| | - Alberto Arezzo
- Department of Surgical Sciences, University of Torino, 10126 Torino, Italy;
| | - Marcello Chiurazzi
- The BioRobotics Institute, Scuola Superiore Sant’Anna, 56025 Pisa, Italy; (V.I.); (M.C.); (P.D.)
- Department of Excellence in Robotics & AI, Scuola Superiore Sant’Anna, 56127 Pisa, Italy
| | - Ervin Toth
- Department of Gastroenterology, Skåne University Hospital, Lund University, 20502 Malmö, Sweden;
| | - Henrik Thorlacius
- Department of Clinical Sciences, Section of Surgery, Lund University, 20502 Malmö, Sweden;
| | - Paolo Dario
- The BioRobotics Institute, Scuola Superiore Sant’Anna, 56025 Pisa, Italy; (V.I.); (M.C.); (P.D.)
- Department of Excellence in Robotics & AI, Scuola Superiore Sant’Anna, 56127 Pisa, Italy
| | | |
Collapse
|