Robert Schulz commited on
Commit
b94e3b3
·
1 Parent(s): f3870ae

updated README

Browse files
Files changed (3) hide show
  1. README.md +83 -1
  2. image/model_architecture.png +0 -0
  3. image/tuc.png +0 -0
README.md CHANGED
@@ -7,4 +7,86 @@ metrics:
7
  pipeline_tag: video-classification
8
  tags:
9
  - robotics
10
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  pipeline_tag: video-classification
8
  tags:
9
  - robotics
10
+ ---
11
+
12
+ <img src="image/tuc.png" alt="drawing" width="200"/>
13
+
14
+ University of Technology Chemnitz, Germany<br>
15
+ Department Robotics and Human Machine Interaction<br>
16
+ Author: Robert Schulz
17
+
18
+ <h1>Action Recognition</h1>
19
+
20
+ <h2>Table of Contents</h2>
21
+
22
+ - [1 Overview](#1-overview)
23
+ - [2 Pretrained Models](#2-pretrained-models)
24
+ - [2.1 TUC-AR](#21-tuc-ar)
25
+ - [2.2 UCF101](#22-ucf101)
26
+
27
+
28
+ ## 1 Overview
29
+
30
+ Here, we provide a PyTorch model which was trained on different datasets (see [2 Pretrained Models](#2-pretrained-models)). The model consists of a 3D CNN multi-stage feature extraction module, followed by a classification head. It achieves state-of-the-art results on the UCF101 dataset.
31
+
32
+ ![](image/model_architecture.png)
33
+ _**Figure 1** Model architecture_
34
+
35
+ ## 2 Pretrained Models
36
+ ### 2.1 TUC-AR
37
+
38
+ **Input**
39
+
40
+ | Dimension | Fixed | Value | Parameter | Description |
41
+ |-----------|---------|-------|-----------------|-------------------------------------------|
42
+ | 0 | no | ? | Batch Size | Number of samples that will be propagated through the network (number of sequences) |
43
+ | 1 | yes | 30 | Sequence Length | Number of frames in one sequence |
44
+ | 2 | yes | 4 | Input Channels | Number of channels of one frame (RGB+D=4) |
45
+ | 3 | yes | 400 | Width | Width of one frame |
46
+ | 4 | yes | 400 | Height | Height of one frame |
47
+
48
+
49
+ **Output**
50
+
51
+ | Dimension | Fixed | Value | Parameter | Description |
52
+ |-----------|---------|-------|-----------------|-------------------------------------------|
53
+ | 0 | no | ? | Batch Size | Number of samples that will be propagated through the network (number of sequences) |
54
+ | 1 | yes | 10 | Number of action classes | Number of action classes<br>0 - None<br>1 - Waving<br>2 - Pointing<br>3 - Clapping<br>4 - Follow<br>5 - Walking<br>6 - Stop |
55
+
56
+ **Usage**
57
+
58
+ ```python
59
+ from huggingface_hub import HfApi
60
+
61
+ api = HfApi()
62
+ model_path = api.hf_hub_download('SchulzR97/TUC-AR-C3D', filename='tuc-ar.pth')
63
+ ```
64
+
65
+ ### 2.2 UCF101
66
+
67
+ **Input**
68
+
69
+ | Dimension | Fixed | Value | Parameter | Description |
70
+ |-----------|---------|-------|-----------------|-------------------------------------------|
71
+ | 0 | no | ? | Batch Size | Number of samples that will be propagated through the network (number of sequences) |
72
+ | 1 | yes | 60 | Sequence Length | Number of frames in one sequence |
73
+ | 2 | yes | 3 | Input Channels | Number of channels of one frame (RGB=3) |
74
+ | 3 | yes | 400 | Width | Width of one frame |
75
+ | 4 | yes | 400 | Height | Height of one frame |
76
+
77
+
78
+ **Output**
79
+
80
+ | Dimension | Fixed | Value | Parameter | Description |
81
+ |-----------|---------|-------|-----------------|-------------------------------------------|
82
+ | 0 | no | ? | Batch Size | Number of samples that will be propagated through the network (number of sequences) |
83
+ | 1 | yes | 101 | Number of action classes | Number of action classes |
84
+
85
+ **Usage**
86
+
87
+ ```python
88
+ from huggingface_hub import HfApi
89
+
90
+ api = HfApi()
91
+ model_path = api.hf_hub_download('SchulzR97/TUC-AR-C3D', filename='ucf101.pth')
92
+ ```
image/model_architecture.png ADDED
image/tuc.png ADDED