Human Activity/Action Databases
1. RPI-ISL Activity Datasets
a. Parking Lot Dataset:
The dataset consists of 108 sequences for 7 actions captured from a parking lot. The actions includes walking, running, leaving car, entering car, bending down, throwing and looking around. These action examples are performed by two people with scale variation, view change and shadow interference.
| Action | # Examples |
| Walking | 20 |
| Running | 16 |
| Leaving Car | 8 |
| Entering Car | 7 |
| Bending Down | 22 |
| Throwing | 21 |
| Looking Around | 14 |
b. Complex Activity Dataset
The complex activity dataset consists of 15 video sequences for 5 complex activities in daily life: Shaking hands, Talking, Chasing, Boxing and Wrestling. These activities are conducted by two interactive subjects who perform 5 basic individual actions: Standing, Running, Making a fist, Clinching, and Reaching out. The following table describes the basic actions performed by the two subjects for each complex activity. In each sequence, the 5 complex activities are sequentially performed, so there are 15 examples for each complex activity.
| Activity | Action of subject 1 | Action of Subject 2 |
| Shaking hands | Reaching out | Reaching out |
| Talking | Standing | Standing |
| Chasing | Running | Running |
| Boxing | Making a fist | Making a fist |
| Wrestling | Clinching | Clinching |
*** The datasets are available upon request.
90 low-resolution(180x144) video clips with 9 different subjects, each of which performs 10 basic actions.
3. KTH Dataset
6 human actions performed several times by 25 subjects in four different scenarios. It totally contains 2391 sequences.
4. UCF Datasets
Six basic scenarios acted out by the CAVIAR team members: Walking, Browsing, Resting, Leaving bags behind, People meeting/walking together/splitting up and Two people fitting. There are about 3 – 6 clips for each scenario.
a. Hollywood Human Actions dataset: contains 8 classes of human actions from 32 Hollywood movies: AnswerPhone, GetOutCar, HandShake, HugPerson, Kiss, SitDown, SitUp, StandUp.
b. Hollywood-2 Human Actions and Scenes dataset: extended from last dataset. It contains 12 classes of human actions and 10 classes of scenes. There are over 3669 video sequences and approximately 20.1 hours of video in total.
7. LSCOM Event/Activity Dataset
Events/activities labeled with 24 LSCOM concepts from TRECVID 2005 benchmark. Each event has more than 60 samples.
8. UMN Dataset
An unusual crowd activity dataset consisting of 11 different scenarios of escape events in 3 different scenes.
9. UIUC Pair-activity Dataset
This dataset consists of five classes of pair-activities: chasing, following, together, meeting, and independent. There are 131 to 203 video clips in each class, and 867 video clips in total.