gradio torch transformers faiss-cpu numpy pillow tqdm kagglehub SpeechRecognition==3.8.1 SpeechRecognition gtts