Sony
/

Text-to-Motion
kengouchida commited on
Commit
6540f60
·
verified ·
1 Parent(s): 5e7e861

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +20 -0
README.md ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ This repository is for the checkpoint of ["MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training"](https://kengouchida-sony.github.io/MoLA-demo/)
3
+
4
+ Abstract: In text-to-motion generation, controllability as well as generation quality and speed has become increasingly critical. The controllability challenges include generating a motion of a length that matches the given textual description and editing the generated motions according to control signals, such as the start-end positions and the pelvis trajectory. In this paper, we propose MoLA, which provides fast, high-quality, variable-length motion generation and can also deal with multiple editing tasks in a single framework. Our approach revisits the motion representation used as inputs and outputs in the model, incorporating an activation variable to enable variable-length motion generation. Additionally, we integrate a variational autoencoder and a latent diffusion model, further enhanced through adversarial training, to achieve high-quality and fast generation. Moreover, we apply a training-free guided generation framework to achieve various editing tasks with motion control inputs. We quantitatively show the effectiveness of adversarial learning in text-to-motion generation, and demonstrate the applicability of our editing framework to multiple editing tasks in the motion domain.
5
+
6
+ PDF: [arXiv](https://arxiv.org/abs/2406.01867)
7
+
8
+ Codebase: Training and inference codes are available at the [GitHub](https://github.com/sony/MoLA)
9
+
10
+ Citation:
11
+ ```bibtex
12
+ @article{uchida2024mola,
13
+ title={MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training},
14
+ author={Uchida, Kengo and Shibuya, Takashi and Takida, Yuhta and Murata, Naoki and Tanke, Julian and Takahashi, Shusuke and Mitsufuji, Yuki},
15
+ journal={arXiv preprint arXiv:2406.01867},
16
+ year={2024}
17
+ }
18
+ ```
19
+
20
+ License: Code is released under MIT, this model weights are released under CC-BY 4.0.