Spaces:

LAP-DEV
/

Demo

Running

App Files Files Community

LAP-DEV commited on Feb 12

Commit

119a0fb

verified ·

1 Parent(s): fc2ac0e

Upload README.md

Browse files

Files changed (1) hide show

README.md +18 -19

README.md CHANGED Viewed

@@ -26,8 +26,7 @@ A Gradio-based browser interface for [Whisper](https://github.com/openai/whisper
 - ## Run Locally
     ### Prerequisite
-    To run this WebUI, you need to have `git`, `python` version 3.8 ~ 3.10, `FFmpeg`.<BR>
-    If you're not using an Nvida GPU, or using a different `CUDA` version than 12.4,  edit the `file requirements.txt` to match your environment.
     Please follow the links below to install the necessary software:
     - git : [https://git-scm.com/downloads](https://git-scm.com/downloads)
@@ -46,25 +45,25 @@ A Gradio-based browser interface for [Whisper](https://github.com/openai/whisper
 - ## Running with Docker
     1. Install and launch [Docker-Desktop](https://www.docker.com/products/docker-desktop/)
     2. Get the repository
-    3. Build the image ( Image is about ~7GB)
-    ```sh
-    docker compose build
-    ```
-    4. Run the container
-    ```sh
-    docker compose up
-    ```
     5. Connect to the WebUI with your browser at `http://localhost:7860`
-    Note: If needed, update the `docker-compose.yaml` to match your environment
 # VRAM Usages
 - This project is integrated with [faster-whisper](https://github.com/guillaumekln/faster-whisper) by default for better VRAM usage and transcription speed.<BR>According to faster-whisper, the efficiency of the optimized whisper model is as follows:
     | Implementation    | Precision | Beam size | Time  | Max. GPU memory | Max. CPU memory |
@@ -81,4 +80,4 @@ A Gradio-based browser interface for [Whisper](https://github.com/openai/whisper
     | medium |   769 M    |    `medium.en`     |      `medium`      |     ~5 GB     |      ~2x       |
     | large  |   1550 M   |        N/A         |      `large`       |    ~10 GB     |       1x       |
-Note: `.en` models are for English only, and you can use the `Translate to English` option from the other models

 - ## Run Locally
     ### Prerequisite
+    To run this WebUI, you need to have `git`, `python` version 3.8 ~ 3.10, `FFmpeg`.<BR>If you're not using an Nvida GPU, or using a different `CUDA` version than 12.4,  edit the file `requirements.txt` to match your environment.
     Please follow the links below to install the necessary software:
     - git : [https://git-scm.com/downloads](https://git-scm.com/downloads)
 - ## Running with Docker
     1. Install and launch [Docker-Desktop](https://www.docker.com/products/docker-desktop/)
     2. Get the repository
+    3. If needed, update the `docker-compose.yaml` to match your environment
+    4. Docker commands:
+        Build the image ( Image is about ~7GB)
+        ```sh
+        docker compose build
+        ```
+        Run the container
+        ```sh
+        docker compose up
+        ```
     5. Connect to the WebUI with your browser at `http://localhost:7860`
 # VRAM Usages
 - This project is integrated with [faster-whisper](https://github.com/guillaumekln/faster-whisper) by default for better VRAM usage and transcription speed.<BR>According to faster-whisper, the efficiency of the optimized whisper model is as follows:
     | Implementation    | Precision | Beam size | Time  | Max. GPU memory | Max. CPU memory |
     | medium |   769 M    |    `medium.en`     |      `medium`      |     ~5 GB     |      ~2x       |
     | large  |   1550 M   |        N/A         |      `large`       |    ~10 GB     |       1x       |
+    Note: `.en` models are for English only, and you can use the `Translate to English` option from the other models