Abstract: Videos contain multimodal content, and exploring multi-branch cross-modal interactions with natural language queries can be of benefit to the text-video retrieval task (TVR). However, recent ...
Abstract: The rapid development of mobile internet has turned multimodal sentiment analysis (MSA) into a prominent research focus. Despite the progress achieved by existing models, the heterogeneity ...