- Article
Adapting CLIP for Action Recognition via Dual Semantic Supervision and Temporal Prompt Reparameterization
- Lujuan Deng,
- Jieqing Tan and
- Fangmei Liu
The contrastive vision–language pre-trained model CLIP, driven by large-scale open-vocabulary image–text pairs, has recently demonstrated remarkable zero-shot generalization capabilities in diverse downstream image tasks, which has made n...

