TikTokActions: A TikTok-Derived Video Dataset for Human Action Recognition

Type: Preprint

Publication Date: 2024-02-13

Citations: 0

DOI: https://doi.org/10.48550/arxiv.2402.08875

Abstract

The increasing variety and quantity of tagged multimedia content on platforms such as TikTok provides an opportunity to advance computer vision modeling. We have curated a distinctive dataset of 283,582 unique video clips categorized under 386 hashtags relating to modern human actions. We release this dataset as a valuable resource for building domain-specific foundation models for human movement modeling tasks such as action recognition. To validate this dataset, which we name TikTokActions, we perform two sets of experiments. First, we pretrain the state-of-the-art VideoMAEv2 with a ViT-base backbone on TikTokActions subset, and then fine-tune and evaluate on popular datasets such as UCF101 and the HMDB51. We find that the performance of the model pre-trained using our Tik-Tok dataset is comparable to models trained on larger action recognition datasets (95.3% on UCF101 and 53.24% on HMDB51). Furthermore, our investigation into the relationship between pre-training dataset size and fine-tuning performance reveals that beyond a certain threshold, the incremental benefit of larger training sets diminishes. This work introduces a useful TikTok video dataset that is available for public use and provides insights into the marginal benefit of increasing pre-training dataset sizes for video-based foundation models.

Locations

  • arXiv (Cornell University) - View - PDF

Similar Works

Action Title Year Authors
+ VideoLightFormer: Lightweight Action Recognition using Transformers 2021 Raivo E. Koot
Haiping Lu
+ Large-scale weakly-supervised pre-training for video action recognition 2019 Deepti Ghadiyaram
Matt Feiszli
Du Tran
Xueting Yan
Heng Wang
Dhruv Mahajan
+ Large-scale weakly-supervised pre-training for video action recognition 2019 Deepti Ghadiyaram
Matt Feiszli
Du Tran
Xueting Yan
Heng Wang
Dhruv Mahajan
+ PDF Chat Large-Scale Weakly-Supervised Pre-Training for Video Action Recognition 2019 Deepti Ghadiyaram
Du Tran
Dhruv Mahajan
+ Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset 2017 JoĂŁo Carreira
Andrew Zisserman
+ Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset 2017 JoĂŁo Carreira
Andrew Zisserman
+ PDF Chat Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset 2017 JoĂŁo Carreira
Andrew Zisserman
+ A Comprehensive Study of Deep Video Action Recognition 2020 Yi Zhu
Xinyu Li
Chunhui Liu
Mohammadreza Zolfaghari
Yuanjun Xiong
Chongruo Wu
Zhi Zhang
Joseph Tighe
R. Manmatha
Mu Li
+ Adapting Short-Term Transformers for Action Detection in Untrimmed Videos 2023 Min Yang
Huan Gao
Ping Guo
Limin Wang
+ PDF Chat The THUMOS challenge on action recognition for videos “in the wild” 2016 Haroon Idrees
Amir Zamir
Yu‐Gang Jiang
Alex Gorban
Ivan Laptev
Rahul Sukthankar
Mubarak Shah
+ AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions 2017 Chunhui Gu
Chen Sun
David A. Ross
Carl Vondrick
Caroline Pantofaru
Yeqing Li
Sudheendra Vijayanarasimhan
George Toderici
Susanna Ricco
Rahul Sukthankar
+ PDF Chat VideoBadminton: A Video Dataset for Badminton Action Recognition 2024 Qi Li
Tzu-Chen Chiu
Hsiang-Wei Huang
Min-Te Sun
Wei-Shinn Ku
+ PDF Chat Enhancing Video Transformers for Action Understanding with VLM-aided Training 2024 Hui Jing Lu
Jian Hu
Ronald Poppe
Albert Ali Salah
+ EXMOVES: Classifier-based Features for Scalable Action Recognition 2013 Du Tran
Lorenzo Torresani
+ EXMOVES: Classifier-based Features for Scalable Action Recognition 2013 Du Tran
Lorenzo Torresani
+ PDF Chat Temporal Segment Networks for Action Recognition in Videos 2018 Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
+ PDF Chat Boundary-sensitive Pre-training for Temporal Localization in Videos 2021 Mengmeng Xu
Juan-Manuel PĂ©rez-RĂșa
VĂ­ctor Escorcia
Brais MartĂ­nez
Xiatian Zhu
Li Zhang
Bernard Ghanem
Tao Xiang
+ Action Recognition from Single Timestamp Supervision in Untrimmed Videos 2019 Davide Moltisanti
Sanja Fidler
Dima Damen
+ Action Recognition from Single Timestamp Supervision in Untrimmed Videos 2019 Davide Moltisanti
Sanja Fidler
Dima Damen
+ PDF Chat Action Recognition From Single Timestamp Supervision in Untrimmed Videos 2019 Davide Moltisanti
Sanja Fidler
Dima Damen

Works That Cite This (0)

Action Title Year Authors

Works Cited by This (0)

Action Title Year Authors