You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello. We'd like to introduce our paper "Query-Dependent Video Representation for Moment Retrieval and Highlight Detection (CVPR 2023 Paper)" regarding cross-modal moment retrieval.
Would you mind adding 2 papers about video-text retrieval.
Paper 1: Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning
Accepted at ECCV 2024.
It leverages LLaVA to increase the scale of training data to video-text retrieval. The approach is to forward the concatenated frames of a video to LLaVA to generate the caption for the video.
Hello. We'd like to introduce our paper "Query-Dependent Video Representation for Moment Retrieval and Highlight Detection (CVPR 2023 Paper)" regarding cross-modal moment retrieval.
Code : https://github.com/wjun0830/QD-DETR
Arxiv : https://arxiv.org/abs/2303.13874
The text was updated successfully, but these errors were encountered: