kotoba-tech
/

kotoba-whisper-v1.0-ggml

Automatic Speech Recognition

Model card Files Files and versions

asahi417 commited on Sep 17, 2024

Commit

ade7231

·

verified ·

1 Parent(s): 25dfa95

Update README.md

Files changed (1) hide show

README.md +17 -0

README.md CHANGED Viewed

@@ -62,6 +62,23 @@ Also, currently whisper.cpp and faster-whisper support the [sequential long-form
 and only Huggingface pipeline supports the [chunked long-form decoding](https://huggingface.co/distil-whisper/distil-large-v3#chunked-long-form), which we empirically
  found better than the sequnential long-form decoding.
 ### Quantized Model
 To use the quantized model, download the quantized GGML weights:

 and only Huggingface pipeline supports the [chunked long-form decoding](https://huggingface.co/distil-whisper/distil-large-v3#chunked-long-form), which we empirically
  found better than the sequnential long-form decoding.
+### Conversion details
+The original model was converted with the following command:
+```
+# clone OpenAI whisper and whisper.cpp
+git clone https://github.com/openai/whisper
+git clone https://github.com/ggerganov/whisper.cpp
+# get the models
+cd whisper.cpp/models
+git clone https://huggingface.co/kotoba-tech/kotoba-whisper-v1.0
+# convert to ggml
+python3 ./convert-h5-to-ggml.py ./kotoba-whisper-v1.0/ ../../whisper .
+mv ggml-model.bin ggml-kotoba-whisper-v1.0
+```
 ### Quantized Model
 To use the quantized model, download the quantized GGML weights: