Technical GlossaryComputer Vision
Referring Expression Comprehension
A task that matches a natural language description to the correct region in an image.
Referring expression comprehension identifies which region in an image is described by phrases such as "the small cup next to the blue box." It is one of the more precise and interactive forms of multimodal grounding. It has strong application value for robotics, visual interfaces, and instruction-driven systems. The task requires jointly resolving object, relation, and location cues from language.
You Might Also Like
Explore these concepts to continue your artificial intelligence journey.
