# Image Captioning

> Source: https://sukruyusufkaya.com/en/glossary/image-captioning
> Updated: 2026-05-13T20:00:57.077Z
> Type: glossary
> Category: bilgisayarli-goru
**TLDR:** The task of expressing the content of an image in fluent and meaningful natural language.

<p>Image captioning is one of the classic multimodal tasks that combines visual understanding with natural language generation. The system must not only recognize objects but also describe relations among them and the broader scene context. It has important applications in accessibility technologies, media indexing, and robotics. Strong captioning systems build a semantic bridge between vision and language.</p>