Spaces:

barunsaha
/

slide-deck-ai

Running

App Files Files Community

barunsaha commited on Dec 1, 2024

Commit

f054614

unverified ·

2 Parent(s): 24afa64 34cb50e

Merge pull request #59 from barun-saha/byok

Browse files

Files changed (2) hide show

README.md +20 -13
global_config.py +10 -11

README.md CHANGED Viewed

@@ -16,24 +16,17 @@ We spend a lot of time on creating the slides and organizing our thoughts for an
 With SlideDeck AI, co-create slide decks on any topic with Generative Artificial Intelligence.
 Describe your topic and let SlideDeck AI generate a PowerPoint slide deck for you—it's as simple as that!
-SlideDeck AI is powered by [Mistral-Nemo-Instruct-2407](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407).
-Originally, it was built using the Llama 2 API provided by Clarifai.
-*Update (v4.0)*: Legacy SlideDeck AI allowed one-shot generation of a slide deck based on the inputs.
-In contrast, SlideDeck AI *Reloaded* enables an iterative workflow with a conversational interface,
-where you can create and improve the presentation.
 # Process
 SlideDeck AI works in the following way:
-1. Given a topic description, it uses Mistral Nemo Instruct to generate the *initial* content of the slides.
 The output is generated as structured JSON data based on a pre-defined schema.
 2. Next, it uses the keywords from the JSON output to search and download a few images with a certain probability.
 3. Subsequently, it uses the `python-pptx` library to generate the slides,
 based on the JSON data from the previous step.
-A user can choose from a set of three pre-defined presentation templates.
 4. At this stage onward, a user can provide additional instructions to *refine* the content.
 For example, one can ask to add another slide or modify an existing slide.
 A history of instructions is maintained.
@@ -41,6 +34,20 @@ A history of instructions is maintained.
 Clicking on the button will download the file.
 # Icons
 SlideDeck AI uses a subset of icons from [bootstrap-icons-1.11.3](https://github.com/twbs/icons)
@@ -50,6 +57,7 @@ SlideDeck AI uses a subset of icons from [bootstrap-icons-1.11.3](https://github
 # Known Issues
 - **Connection timeout**: Requests sent to the Hugging Face Inference endpoint might time out. If it still does not work, wait for a while and try again.
 The following is not an issue but might appear as a strange behavior:
@@ -59,11 +67,10 @@ number of allowed characters in the textbox, pasting would not work.
 # Local Development
-SlideDeck AI uses [Mistral-Nemo-Instruct-2407](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407)
-via the Hugging Face Inference API.
 To run this project by yourself, you need to provide the `HUGGINGFACEHUB_API_TOKEN` API key,
-for example, in a `.env` file. For image search, the `PEXEL_API_KEY` should be added.
-Visit the respective websites to obtain the keys.
 # Live Demo

 With SlideDeck AI, co-create slide decks on any topic with Generative Artificial Intelligence.
 Describe your topic and let SlideDeck AI generate a PowerPoint slide deck for you—it's as simple as that!
 # Process
 SlideDeck AI works in the following way:
+1. Given a topic description, it uses a Large Language Model (LLM) to generate the *initial* content of the slides.
 The output is generated as structured JSON data based on a pre-defined schema.
 2. Next, it uses the keywords from the JSON output to search and download a few images with a certain probability.
 3. Subsequently, it uses the `python-pptx` library to generate the slides,
 based on the JSON data from the previous step.
+A user can choose from a set of pre-defined presentation templates.
 4. At this stage onward, a user can provide additional instructions to *refine* the content.
 For example, one can ask to add another slide or modify an existing slide.
 A history of instructions is maintained.
 Clicking on the button will download the file.
+# Summary of the LLMs
+Different LLMs offer different styles of content generation. Use one of the following LLMs along with relevant API keys/access tokens, as appropriate, to create the content of the slide deck:
+| LLM | Provider (code) | Requires API key                                                            | Characteristics |
+| :-------- | :------- |:----------------------------------------------------------------------------| :------- |
+| Mistral 7B Instruct v0.2 | Hugging Face (`hf`) | Optional but encouraged; [get here](https://huggingface.co/settings/tokens) | Faster, shorter content |
+| Mistral Nemo Instruct 2407 | Hugging Face (`hf`) | Optional but encouraged; [get here](https://huggingface.co/settings/tokens) | Slower, longer content |
+| Gemini 1.5 Flash | Google Gemini API (`gg`) | Mandatory; [get here](https://aistudio.google.com/apikey)                   | Faster, longer content |
+| Command R+ | Cohere (`co`) | Mandatory; [get here](https://dashboard.cohere.com/api-keys)                | Shorter, simpler content |
+The Mistral models do not mandatorily require an access token. However, you are encouraged to get and use your own Hugging Face access token.
 # Icons
 SlideDeck AI uses a subset of icons from [bootstrap-icons-1.11.3](https://github.com/twbs/icons)
 # Known Issues
+- **Model unavailable**: Mistral Nemo currently appears to be unavailable. See this [issue](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407/discussions/83).
 - **Connection timeout**: Requests sent to the Hugging Face Inference endpoint might time out. If it still does not work, wait for a while and try again.
 The following is not an issue but might appear as a strange behavior:
 # Local Development
+SlideDeck AI uses LLMs via different providers, such as Hugging Face, Google, and Gemini.
 To run this project by yourself, you need to provide the `HUGGINGFACEHUB_API_TOKEN` API key,
+for example, in a `.env` file. Alternatively, you can provide the access token in the app's user interface itself (UI). For other LLM providers, the API key can only be specified in the UI.  For image search, the `PEXEL_API_KEY` should be made available as an environment variable.
+Visit the respective websites to obtain the API keys.
 # Live Demo

global_config.py CHANGED Viewed

@@ -91,8 +91,7 @@ class GlobalConfig:
     # This is a long text, so not incorporated as a string in `strings.json`
     CHAT_USAGE_INSTRUCTIONS = (
-        'Briefly describe your topic of presentation in the textbox provided below.'
-        ' For example:\n'
         '- Make a slide deck on AI.'
         '\n\n'
         'Subsequently, you can add follow-up instructions, e.g.:\n'
@@ -101,22 +100,22 @@ class GlobalConfig:
         ' You can also ask it to refine any particular slide, e.g.:\n'
         '- Make the slide with title \'Examples of AI\' a bit more descriptive.'
         '\n\n'
-        'Finally, click on the download button to download the slide deck.'
         ' See this [demo video](https://youtu.be/QvAKzNKtk9k) for a brief walkthrough.\n\n'
-        'Currently, two LLMs are supported. **Mistral 7B Instruct v0.2** is fast and generates'
-        ' shorter outputs. On the other hand, **Mistral Nemo Instruct 2407** usually generates'
-        ' longer outputs but can also be slower. If one is not available, choose the other from'
-        ' the dropdown list.\n\n'
         ' SlideDeck AI does not have access to the Web, apart for searching for images relevant'
         ' to the slides. Photos are added probabilistically; transparency needs to be changed'
         ' manually, if required.\n\n'
         '[SlideDeck AI](https://github.com/barun-saha/slide-deck-ai) is an Open-Source project,'
         ' released under the'
         ' [MIT license](https://github.com/barun-saha/slide-deck-ai?tab=MIT-1-ov-file#readme).'
-        ' It is is powered by'
-        ' [Mistral-Nemo-Instruct-2407](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407)'
-        ' and [Mistral 7B v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2).\n\n'
-        '---\n\n'
         '© Copyright 2023-2024 Barun Saha.\n\n'
     )

     # This is a long text, so not incorporated as a string in `strings.json`
     CHAT_USAGE_INSTRUCTIONS = (
+        'Briefly describe your topic of presentation in the textbox provided below. For example:\n'
         '- Make a slide deck on AI.'
         '\n\n'
         'Subsequently, you can add follow-up instructions, e.g.:\n'
         ' You can also ask it to refine any particular slide, e.g.:\n'
         '- Make the slide with title \'Examples of AI\' a bit more descriptive.'
         '\n\n'
+        'Finally, click on the download button at the bottom to download the slide deck.'
         ' See this [demo video](https://youtu.be/QvAKzNKtk9k) for a brief walkthrough.\n\n'
+        'Currently, three LLMs providers and four LLMs are supported:'
+        ' **Mistral 7B Instruct v0.2** and **Mistral Nemo Instruct 2407** via Hugging Face'
+        ' Inference Endpoint; **Gemini 1.5 Flash** via Gemini API; and **Command R+** via Cohere'
+        ' API. If one is not available, choose the other from the dropdown list. A [summary of'
+        ' the supported LLMs]('
+        'https://github.com/barun-saha/slide-deck-ai/blob/main/README.md#summary-of-the-llms)'
+        ' is available for reference.\n\n'
         ' SlideDeck AI does not have access to the Web, apart for searching for images relevant'
         ' to the slides. Photos are added probabilistically; transparency needs to be changed'
         ' manually, if required.\n\n'
         '[SlideDeck AI](https://github.com/barun-saha/slide-deck-ai) is an Open-Source project,'
         ' released under the'
         ' [MIT license](https://github.com/barun-saha/slide-deck-ai?tab=MIT-1-ov-file#readme).'
+        '\n\n---\n\n'
         '© Copyright 2023-2024 Barun Saha.\n\n'
     )