Update README.md
Browse files
README.md
CHANGED
@@ -13,4 +13,20 @@ The dataset was curated by re-rendering and cleaning up the Objaverse dataset.
|
|
13 |
Example image from the dataset:
|
14 |

|
15 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
16 |
|
|
|
13 |
Example image from the dataset:
|
14 |

|
15 |
|
16 |
+
I tried multiple training runs with different hyperparameters, and It seems that the model just learns the output structure at a very high level, but doesn't learn the details.
|
17 |
+
Here are some example outputs when asked to generate multiple views of a bird
|
18 |
+

|
19 |
+

|
20 |
+
|
21 |
+
|
22 |
+
Loss graphs for two best training runs
|
23 |
+

|
24 |
+
|
25 |
+
## Misc Details
|
26 |
+
|
27 |
+
The training method here is similar to FLUX Depth Control, and FLUX Canny Control.
|
28 |
+
The conditioning image is added as extra channels in the input, and the model is asked to denoise the noisy channels.
|
29 |
+
|
30 |
+
Thanks to [Modal](https://modal.com) for sponsoring the compute for this.
|
31 |
+
|
32 |
|