Prefer a chat interface with context about you and your work?
Expanding Language-Image Pretrained Models for General Video Recognition