tencent
/

HunyuanVideo-I2V

Model card Files Files and versions

noaltian commited on Mar 6

Commit

c2b97c9

·

verified ·

1 Parent(s): 2a8a54e

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -63,7 +63,7 @@ Since we utilizes a pre-trained Multimodal Large Language Model (MLLM) with a De
 The overall architecture of our system is designed to maximize the synergy between image and text modalities, ensuring a robust and coherent generation of video content from static images. This integration not only improves the fidelity of the generated videos but also enhances the model's ability to interpret and utilize complex multimodal inputs. The overall architecture is as follows.
 <p align="center">
-  <img src="https://raw.githubusercontent.com/Tencent/HunyuanVideo-I2V/refs/heads/main/assets/backbone.png"  height=50>
 </p>

 The overall architecture of our system is designed to maximize the synergy between image and text modalities, ensuring a robust and coherent generation of video content from static images. This integration not only improves the fidelity of the generated videos but also enhances the model's ability to interpret and utilize complex multimodal inputs. The overall architecture is as follows.
 <p align="center">
+  <img src="https://raw.githubusercontent.com/Tencent/HunyuanVideo-I2V/refs/heads/main/assets/backbone.png"  style="max-width: 60%; height: auto;">
 </p>