Spaces:
Runtime error
Runtime error
Commit
·
0bfa519
1
Parent(s):
12a6528
Update app.py
Browse files
app.py
CHANGED
@@ -25,13 +25,10 @@ def predict(image, labels):
|
|
25 |
description = """
|
26 |
<div class="container" style="display:flex;">
|
27 |
<div class="image">
|
28 |
-
<img src="https://
|
29 |
</div>
|
30 |
<div class="text">
|
31 |
-
<p>
|
32 |
-
as introduced in <a href="https://arxiv.org/abs/2102.05918"></a><i>"Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision"</i>. ALIGN features a dual-encoder architecture with EfficientNet and BERT as its text and vision encoders, and learns to align visual and text representations with contrastive learning.
|
33 |
-
Unlike previous work, ALIGN leverages a massive noisy dataset and shows that the scale of the corpus can be used to achieve SOTA representations with a simple recipe.
|
34 |
-
\n\nALIGN is not open-sourced and the `kakaobrain/align-base` model used for this demo is based on the Kakao Brain implementation that follows the original paper. The model is trained on the open source [COYO](https://github.com/kakaobrain/coyo-dataset) dataset by the Kakao Brain team. To perform zero-shot image classification with ALIGN, upload an image and enter your candidate labels as free-form text separated by a comma followed by a space.</p>
|
35 |
</div>
|
36 |
</div>
|
37 |
"""
|
|
|
25 |
description = """
|
26 |
<div class="container" style="display:flex;">
|
27 |
<div class="image">
|
28 |
+
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/7/73/Rafale_-_RIAT_2012_%287852045796%29.jpg/1200px-Rafale_-_RIAT_2012_%287852045796%29.jpg" alt="rafale" />
|
29 |
</div>
|
30 |
<div class="text">
|
31 |
+
<p>Compare anything</p>
|
|
|
|
|
|
|
32 |
</div>
|
33 |
</div>
|
34 |
"""
|