Abstract: With the growing number of images generated daily in radiological practices and the digitization of historical studies, we face large databases where metadata can be incomplete or incorrect.
Abstract: Image captioning, situated at the intersection of computer vision and natural language processing, seeks to generate captions that are linguistically fluent, accurate, and semantically rich.