Artificial intelligence assisted ultrasound report generation

doi:10.35711/aimi.v6.i1.107069

Advanced Search

BPG is committed to discovery and dissemination of knowledge

Home / Archive / Volume 6, Issue 1

This Article

Peer-Review Report of This Article

CrossCheck and Google Search of This Article

Academic Rules and Norms of This Article

Citation of this article

Corresponding Author of This Article

Research Domain of This Article

Article-Type of This Article

Open-Access Policy of This Article

Times Cited Counts in Google of This Article

Number of Hits and Downloads for This Article

Total Article Views (4844)

All Articles published online

The chart showing PDF series, HTML series, Tables (1-3) series.

Item

Count

PDF

122

HTML

2990

Tables (1-3)

327

Sum=3439

Featured Article

The chart showing Browse series, Download series.

Item

Count

Browse

252

Download

571

Sum=823

Publishing Process of This Article

Item

Count

Browse

Download

317

Sum=374

Jun 8, 2025 (publication date) through Mar 4, 2026

Times Cited of This Article

Times Cited (0)

Journal Information of This Article

Publication Name

Artificial Intelligence in Medical Imaging

ISSN

2644-3260

Publisher of This Article

Baishideng Publishing Group Inc, 7041 Koll Center Parkway, Suite 160, Pleasanton, CA 94566, USA

Minireviews

Artif Intell Med Imaging. Jun 8, 2025; 6(1): 107069
Published online Jun 8, 2025. doi: 10.35711/aimi.v6.i1.107069

Table 1 Comparison of artificial intelligence models for ultrasound report generation

Method	Architectural features	Clinical relevance
CNN-LSTM	Combines CNN and LSTM, suitable for processing sequential data	Performs well in handling image and sequence information, applicable for ultrasound image analysis
Transformer-based models	Based on self-attention mechanisms, capable of capturing long-range dependencies, suitable for parallel processing	Excels in generating natural language reports, suitable for complex ultrasound report generation
VLMs	Integrates visual and linguistic information, capable of understanding image content and generating related text	Outstanding performance in multimodal learning, enhances the accuracy and clinical relevance of ultrasound reports

CNN: Convolutional Neural Networks; LSTM: Long short-term memory networks; VLMs: Visual language models.

Full Size Table

Table 2 Key concepts in ultrasound report generation

Concept	Description	Significance
AI-assisted ultrasound report generation	Technology using AI to convert ultrasound imaging into structured diagnostic reports	Enhances efficiency, accuracy, and consistency of diagnosis
VLMs	AI models that integrate visual (images) and linguistic (text) information	Enable understanding of image content and generation of descriptive text
Image encoder	A component of VLMs that encodes image information	Transforms images into a format that the model can process
Text encoder	A component of VLMs that encodes text information	Transforms text into a format that the model can process
Attention mechanism	A technique that allows the model to focus on specific parts of the input (image or text)	Improves the model's ability to focus on important image regions and text
LLMs	Transformer-based models pre-trained on large text corpora	Enhance the quality and fluency of generated text

AI: Artificial intelligence; LLMs: Large language models; VLMs: Visual language models.

Full Size Table

Table 3 Challenges and proposed solutions in visual language model -based ultrasound report generation

Challenge	Proposed solution
Poor accuracy in text generation related to measurement results	Extract numerical values from ultrasound images using tools like TrOCR[15] and insert them into the report
Suboptimal handling of correspondence between text and images	Annotate the correspondence between text and images and design mechanisms to learn these relationships
Ineffective utilization of report templates	Use report templates as input, treat template prediction as an intermediate task, or have the model learn to modify templates
Issues with training data volume	Split existing reports into text-image pairs and reassemble them to create pseudo-cases for training
Ineffective utilization of historical reports	Use historical reports along with current ultrasound images as input
Neglect of image selection task	Explicitly model the image selection process to choose representative images for the report
Lack of utilization of ultrasound-related expertise	Fine-tune LLM models to learn this prior knowledge
Lack of exploration of predictive tasks	Conduct in-depth research on ultrasound examination scenarios to define effective predictive tasks

LLM: Large language model.

Full Size Table

Citation: Zeng JH, Zhao KK, Zhao NB. Artificial intelligence assisted ultrasound report generation. Artif Intell Med Imaging 2025; 6(1): 107069
URL: https://www.wjgnet.com/2644-3260/full/v6/i1/107069.htm
DOI: https://dx.doi.org/10.35711/aimi.v6.i1.107069