lpw commited on
Commit
ee96a04
·
1 Parent(s): 9bd2304

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -8
README.md CHANGED
@@ -12,16 +12,14 @@ datasets:
12
  - covost2
13
  - europarl_st
14
  - voxpopuli
15
- widget:
16
- - example_title: Common Voice sample 1
17
- src: https://huggingface.co/facebook/xm_transformer_600m-es_en-multi_domain/resolve/main/common_voice_es_19966634.flac
18
  ---
19
- ## xm_transformer_s2ut_800m-es-en-st-asr-bt_h1_2022
20
 
21
  Speech-to-speech translation model from fairseq S2UT ([paper](https://arxiv.org/abs/2204.02967)/[code](https://github.com/facebookresearch/fairseq/blob/main/examples/speech_to_speech/docs/enhanced_direct_s2st_discrete_units.md)):
22
- - Spanish-English
23
  - Trained on mTEDx, CoVoST 2, Europarl-ST and VoxPopuli
24
- - Speech synthesis with [facebook/unit_hifigan_mhubert_vp_en_es_fr_it3_400k_layer11_km1000_lj_dur](https://huggingface.co/facebook/unit_hifigan_mhubert_vp_en_es_fr_it3_400k_layer11_km1000_lj_dur)
25
 
26
  ## Usage
27
  ```python
@@ -42,7 +40,7 @@ import torchaudio
42
  cache_dir = os.getenv("HUGGINGFACE_HUB_CACHE")
43
 
44
  models, cfg, task = load_model_ensemble_and_task_from_hf_hub(
45
- "facebook/xm_transformer_s2ut_800m-es-en-st-asr-bt_h1_2022",
46
  arg_overrides={"config_yaml": "config.yaml", "task": "speech_to_text"},
47
  cache_dir=cache_dir,
48
  )
@@ -63,7 +61,7 @@ cache_dir = (
63
  cache_dir or (Path.home() / ".cache" / library_name).as_posix()
64
  )
65
  cache_dir = snapshot_download(
66
- f"facebook/unit_hifigan_mhubert_vp_en_es_fr_it3_400k_layer11_km1000_lj_dur", cache_dir=cache_dir, library_name=library_name
67
  )
68
 
69
  x = hub_utils.from_pretrained(
 
12
  - covost2
13
  - europarl_st
14
  - voxpopuli
15
+
 
 
16
  ---
17
+ ## xm_transformer_s2ut_en-hk
18
 
19
  Speech-to-speech translation model from fairseq S2UT ([paper](https://arxiv.org/abs/2204.02967)/[code](https://github.com/facebookresearch/fairseq/blob/main/examples/speech_to_speech/docs/enhanced_direct_s2st_discrete_units.md)):
20
+ - English-Hokkien
21
  - Trained on mTEDx, CoVoST 2, Europarl-ST and VoxPopuli
22
+ - Speech synthesis with [facebook/unit_hifigan_HK_layer12.km2500_frame_TAT-TTS](https://huggingface.co/facebook/unit_hifigan_HK_layer12.km2500_frame_TAT-TTS)
23
 
24
  ## Usage
25
  ```python
 
40
  cache_dir = os.getenv("HUGGINGFACE_HUB_CACHE")
41
 
42
  models, cfg, task = load_model_ensemble_and_task_from_hf_hub(
43
+ "facebook/xm_transformer_s2ut_en-hk",
44
  arg_overrides={"config_yaml": "config.yaml", "task": "speech_to_text"},
45
  cache_dir=cache_dir,
46
  )
 
61
  cache_dir or (Path.home() / ".cache" / library_name).as_posix()
62
  )
63
  cache_dir = snapshot_download(
64
+ f"facebook/unit_hifigan_HK_layer12.km2500_frame_TAT-TTS", cache_dir=cache_dir, library_name=library_name
65
  )
66
 
67
  x = hub_utils.from_pretrained(