Spaces:

BAAI
/

EmbodiedVerse

Running

App Files Files Community

HelloGitHub commited on Sep 2

Commit

e854023

1 Parent(s): 11923fe

update about table

Browse files

Files changed (1) hide show

src/about.py +86 -17

src/about.py CHANGED Viewed

@@ -177,23 +177,92 @@ Planning
 We have categorized the data of the above 10 datasets by capability dimensions, and summarized four major capability dimensions required for embodied intelligence scenarios: spatial reasoning, perception, prediction, and planning. According to the capability dimensions, a high-quality subset with 2,042 samples was sampled. The definitions of the capability dimensions and the data volume of each dimension are as follows:
-|   |   |   |   |
-|---|---|---|---|
-|Capability Dimension（能力维度）|Sub-capability Dimension（子能力维度 ）|Data Volume（数据量）|Percentage（百分比）|
-|Spatial Reasoning|Dynamic|200|18.43%|
-|Relative direction|200|18.43%|
-|Multi-view matching|200|18.43%|
-|Relative distance|200|18.43%|
-|Depth estimation|107|9.86%|
-|Relative shape|82|7.56%|
-|Size estimation|96|8.85%|
-|Perception|Visual Grounding|200|44.64%|
-|Counting|200|44.64%|
-|State & Activity Understanding|48|10.71%|
-|Prediction|Trajectory|188|76.73%|
-|Future prediction|57|23.27%|
-|Planning|Goal Decomposition|200|75.76%|
-|Navigation|64|24.24%|
 ## EmbodiedVerse Tool - FlagEvalMM

 We have categorized the data of the above 10 datasets by capability dimensions, and summarized four major capability dimensions required for embodied intelligence scenarios: spatial reasoning, perception, prediction, and planning. According to the capability dimensions, a high-quality subset with 2,042 samples was sampled. The definitions of the capability dimensions and the data volume of each dimension are as follows:
+<table>
+    <thead>
+        <tr>
+            <th>Capability Dimension (能力维度)</th>
+            <th>Sub-capability Dimension (子能力维度)</th>
+            <th>Data Volume (数据量)</th>
+            <th>Percentage (百分比)</th>
+        </tr>
+    </thead>
+    <tbody>
+        <tr>
+            <td rowspan="7">Spatial Reasoning</td>
+            <td>Dynamic</td>
+            <td>200</td>
+            <td>18.43%</td>
+        </tr>
+        <tr>
+            <td>Relative direction</td>
+            <td>200</td>
+            <td>18.43%</td>
+        </tr>
+        <tr>
+            <td>Multi-view matching</td>
+            <td>200</td>
+            <td>18.43%</td>
+        </tr>
+        <tr>
+            <td>Relative distance</td>
+            <td>200</td>
+            <td>18.43%</td>
+        </tr>
+        <tr>
+            <td>Depth estimation</td>
+            <td>107</td>
+            <td>9.86%</td>
+        </tr>
+        <tr>
+            <td>Relative shape</td>
+            <td>82</td>
+            <td>7.56%</td>
+        </tr>
+        <tr>
+            <td>Size estimation</td>
+            <td>96</td>
+            <td>8.85%</td>
+        </tr>
+        <tr>
+            <td rowspan="3">Perception</td>
+            <td>Visual Grounding</td>
+            <td>200</td>
+            <td>44.64%</td>
+        </tr>
+        <tr>
+            <td>Counting</td>
+            <td>200</td>
+            <td>44.64%</td>
+        </tr>
+        <tr>
+            <td>State & Activity Understanding</td>
+            <td>48</td>
+            <td>10.71%</td>
+        </tr>
+        <tr>
+            <td rowspan="2">Prediction</td>
+            <td>Trajectory</td>
+            <td>188</td>
+            <td>76.73%</td>
+        </tr>
+        <tr>
+            <td>Future prediction</td>
+            <td>57</td>
+            <td>23.27%</td>
+        </tr>
+        <tr>
+            <td rowspan="2">Planning</td>
+            <td>Goal Decomposition</td>
+            <td>200</td>
+            <td>75.76%</td>
+        </tr>
+        <tr>
+            <td>Navigation</td>
+            <td>64</td>
+            <td>24.24%</td>
+        </tr>
+    </tbody>
+</table>
 ## EmbodiedVerse Tool - FlagEvalMM