Enhancing Visual Word Sense Disambiguation through Prompt-Based and Cross-Modal Retrieval
Introduction In the ever-evolving landscape of natural language processing and computer vision, the fusion of various modalities has given rise to innovative approaches to tackle complex tasks. Visual Word Sense Disambiguation (VWSD), often abbreviated as VWSD, is one such task where the goal is to determine the correct sense of a word in a given […]









