tencent
/

HunyuanVideo-I2V

Model card Files Files and versions Community

noaltian commited on Mar 6

Commit

a29a2e6

·

verified ·

1 Parent(s): c2b97c9

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -63,7 +63,7 @@ Since we utilizes a pre-trained Multimodal Large Language Model (MLLM) with a De
 The overall architecture of our system is designed to maximize the synergy between image and text modalities, ensuring a robust and coherent generation of video content from static images. This integration not only improves the fidelity of the generated videos but also enhances the model's ability to interpret and utilize complex multimodal inputs. The overall architecture is as follows.
 <p align="center">
-  <img src="https://raw.githubusercontent.com/Tencent/HunyuanVideo-I2V/refs/heads/main/assets/backbone.png"  style="max-width: 60%; height: auto;">
 </p>
@@ -216,7 +216,7 @@ Prompt description: The trigger word is written directly in the video caption. I
 For example, AI hair growth effect (trigger): rapid_hair_growth, The hair of the characters in the video is growing rapidly. + original prompt
-After having the training video and prompt pair, refer to [here] (hyvideo/hyvae_extract/README.md) for training data construction.
 ### Training

 The overall architecture of our system is designed to maximize the synergy between image and text modalities, ensuring a robust and coherent generation of video content from static images. This integration not only improves the fidelity of the generated videos but also enhances the model's ability to interpret and utilize complex multimodal inputs. The overall architecture is as follows.
 <p align="center">
+  <img src="https://raw.githubusercontent.com/Tencent/HunyuanVideo-I2V/refs/heads/main/assets/backbone.png"  style="max-width: 45%; height: auto;">
 </p>
 For example, AI hair growth effect (trigger): rapid_hair_growth, The hair of the characters in the video is growing rapidly. + original prompt
+After having the training video and prompt pair, refer to [here](https://github.com/Tencent/HunyuanVideo-I2V/blob/main/hyvideo/hyvae_extract/README.md) for training data construction.
 ### Training