Spaces:
Running
on
Zero
Running
on
Zero
Commit
Β·
0008a58
1
Parent(s):
908a5b7
update head
Browse files
app.py
CHANGED
@@ -379,41 +379,44 @@ with gr.Blocks(
|
|
379 |
is_example = gr.Textbox(label="is_example", visible=False, value="None")
|
380 |
num_images = gr.Textbox(label="num_images", visible=False, value="None")
|
381 |
|
382 |
-
gr.
|
383 |
-
|
384 |
-
|
385 |
-
|
386 |
-
|
|
|
|
|
387 |
|
388 |
<div style="font-size: 16px; line-height: 1.5;">
|
389 |
-
<p>Upload a video or a set of images to create a 3D reconstruction of a scene or object.
|
390 |
|
391 |
<h3>Getting Started:</h3>
|
392 |
<ol>
|
393 |
-
<li><strong>Upload Your Data:</strong> Use the
|
394 |
-
<li><strong>Preview:</strong>
|
395 |
-
<li><strong>Reconstruct:</strong> Click the
|
396 |
-
<li><strong>Visualize:</strong>
|
397 |
-
|
398 |
<strong>Adjust Visualization (Optional):</strong>
|
399 |
-
After reconstruction, you can fine-tune the visualization using the options below
|
400 |
<details style="display:inline;">
|
401 |
-
|
402 |
-
|
403 |
<li><em>Confidence Threshold:</em> Adjust the filtering of points based on confidence.</li>
|
404 |
<li><em>Show Points from Frame:</em> Select specific frames to display in the point cloud.</li>
|
405 |
<li><em>Show Camera:</em> Toggle the display of estimated camera positions.</li>
|
406 |
<li><em>Filter Sky / Filter Black Background:</em> Remove sky or black-background points.</li>
|
407 |
-
<li><em>Select a Prediction Mode:</em> Choose between
|
408 |
-
|
409 |
</details>
|
410 |
-
|
411 |
</ol>
|
412 |
<p><strong>Please note:</strong> Our method usually only needs less than 1 second to reconstruct a scene, but the visualization of 3D points may take tens of seconds, especially when the number of images is large. Please be patient or, for faster visualization, use a local machine to run our demo from our <a href="https://github.com/facebookresearch/vggt">GitHub repository</a>.</p>
|
413 |
</div>
|
414 |
"""
|
415 |
)
|
416 |
|
|
|
417 |
target_dir_output = gr.Textbox(label="Target Dir", visible=False, value="None")
|
418 |
|
419 |
with gr.Row():
|
|
|
379 |
is_example = gr.Textbox(label="is_example", visible=False, value="None")
|
380 |
num_images = gr.Textbox(label="num_images", visible=False, value="None")
|
381 |
|
382 |
+
gr.HTML(
|
383 |
+
"""
|
384 |
+
<h1>ποΈ VGGT: Visual Geometry Grounded Transformer</h1>
|
385 |
+
<p>
|
386 |
+
<a href="https://github.com/facebookresearch/vggt">π GitHub Repository</a> |
|
387 |
+
<a href="#">Project Page</a>
|
388 |
+
</p>
|
389 |
|
390 |
<div style="font-size: 16px; line-height: 1.5;">
|
391 |
+
<p>Upload a video or a set of images to create a 3D reconstruction of a scene or object. VGGT takes these images and generates a 3D point cloud, along with estimated camera poses.</p>
|
392 |
|
393 |
<h3>Getting Started:</h3>
|
394 |
<ol>
|
395 |
+
<li><strong>Upload Your Data:</strong> Use the βUpload Videoβ or βUpload Imagesβ buttons on the left to provide your input. Videos will be automatically split into individual frames (one frame per second).</li>
|
396 |
+
<li><strong>Preview:</strong> Your uploaded images will appear in the gallery on the left.</li>
|
397 |
+
<li><strong>Reconstruct:</strong> Click the βReconstructβ button to start the 3D reconstruction process.</li>
|
398 |
+
<li><strong>Visualize:</strong> The 3D reconstruction will appear in the viewer on the right. You can rotate, pan, and zoom to explore the model, and download the GLB file. Note the visualization of 3D points may be slow for a large number of input images.</li>
|
399 |
+
<li>
|
400 |
<strong>Adjust Visualization (Optional):</strong>
|
401 |
+
After reconstruction, you can fine-tune the visualization using the options below
|
402 |
<details style="display:inline;">
|
403 |
+
<summary style="display:inline;">(<strong>click to expand</strong>):</summary>
|
404 |
+
<ul>
|
405 |
<li><em>Confidence Threshold:</em> Adjust the filtering of points based on confidence.</li>
|
406 |
<li><em>Show Points from Frame:</em> Select specific frames to display in the point cloud.</li>
|
407 |
<li><em>Show Camera:</em> Toggle the display of estimated camera positions.</li>
|
408 |
<li><em>Filter Sky / Filter Black Background:</em> Remove sky or black-background points.</li>
|
409 |
+
<li><em>Select a Prediction Mode:</em> Choose between βDepthmap and Camera Branchβ or βPointmap Branch.β</li>
|
410 |
+
</ul>
|
411 |
</details>
|
412 |
+
</li>
|
413 |
</ol>
|
414 |
<p><strong>Please note:</strong> Our method usually only needs less than 1 second to reconstruct a scene, but the visualization of 3D points may take tens of seconds, especially when the number of images is large. Please be patient or, for faster visualization, use a local machine to run our demo from our <a href="https://github.com/facebookresearch/vggt">GitHub repository</a>.</p>
|
415 |
</div>
|
416 |
"""
|
417 |
)
|
418 |
|
419 |
+
|
420 |
target_dir_output = gr.Textbox(label="Target Dir", visible=False, value="None")
|
421 |
|
422 |
with gr.Row():
|