Pubblicato in Stories by Research Graph on Medium
Autore Vaibhav Khobragade
Refining AI Vision: How Retrieval-Augmented Generation Transforms Image Captioning in Large Language Models Leveraging External Knowledge to Enhance the Descriptive Capabilities of AI Systems Author Vaibhav Khobragade (ORCID: 0009–0009–8807–5982) Introduction Large Language Models (LLMs) are artificial intelligence models that are trained on massive amounts of text data in order to generate human-like language and produce coherent