Go to:
Logótipo
Você está em: Start > Publications > View > Optimizing Medical Image Captioning with Conditional Prompt Encoding
Publication

Optimizing Medical Image Captioning with Conditional Prompt Encoding

Title
Optimizing Medical Image Captioning with Conditional Prompt Encoding
Type
Article in International Conference Proceedings Book
Year
2026
Authors
Fernandes, RF
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Oliveira, HS
(Author)
Other
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page Without ORCID
Pedro Ribeiro
(Author)
FCUP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page Without ORCID
Conference proceedings International
Pages: 196-207
12th Iberian Conference on Pattern Recognition and Image Analysis-IbPRIA
Coimbra, PORTUGAL, JUN 30-JUL 03, 2025
Indexing
Publicação em ISI Web of Knowledge ISI Web of Knowledge - 0 Citations
Publicação em Scopus Scopus - 0 Citations
Other information
Authenticus ID: P-019-TXA
Abstract (EN): Medical image captioning is an essential tool to produce descriptive text reports of medical images. One of the central problems of medical image captioning is their poor domain description generation because large pre-trained language models are primarily trained in non-medical text domains with different semantics of medical text. To overcome this limitation, we explore improvements in contrastive learning for X-ray images complemented with soft prompt engineering for medical image captioning and conditional text decoding for caption generation. The main objective is to develop a softprompt model to improve the accuracy and clinical relevance of the automatically generated captions while guaranteeing their complete linguistic accuracy without corrupting the models' performance. Experiments on the MIMIC-CXR and ROCO datasets showed that the inclusion of tailored soft-prompts improved accuracy and efficiency, while ensuring a more cohesive medical context for captions, aiding medical diagnosis and encouraging more accurate reporting.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 12
Documents
We could not find any documents associated to the publication.
Recommend this page Top
Copyright 1996-2026 © Faculdade de Farmácia da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2026-03-03 at 01:48:17 | Privacy Policy | Personal Data Protection Policy | Whistleblowing | Electronic Yellow Book