explain_lang / el process.md
dimasdeffieux's picture
Update el process.md
6aaf9ed verified
|
raw
history blame
420 Bytes

PIPELINE:

  1. Pick language
  2. Input image/text/audio
  3. give input to chatbot which explains the words and structures
  4. also parse words and create flashcards for all of the words
  5. show flashcards and give option to add to anki

TASKS NEEDED:

  1. image - text (OCR)
  • GOT-OCR (716M parameters)
  1. text - text (chatbot)
  • chatgpt 4o
  1. audio - text
  • whisper
  1. chatbot to explain
  • chatgpt 4o
  1. text to speech