Description: The homepage of Hao Li.
jekyll (1577) portfolio-website (1329) academic-website (1046) jekyll-theme (957)
Hao Li [email protected] Let the research be the goal, not the tool.
Cross-modal Retrieval methods build similarity relations between vision and language modalities by jointly learning a common representation space. However, the predictions are often unreliable due to the Aleatoric uncertainty, which is induced by low-quality data, e.g., corrupt images, fast-paced videos, and non-detailed texts. In this paper, we propose a novel Prototype-based Aleatoric Uncertainty Quantification (PAU) framework to provide trustworthy predictions by quantifying the uncertainty arisen from t