Spaces:

varl42
/

audio_abstract42

Sleeping

audio_abstract42 / README.md

Update README.md

f44133f almost 2 years ago

1.4 kB

A newer version of the Gradio SDK is available: 5.49.1

Upgrade

metadata

title: Audio Abstract42
emoji: 😻
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 4.7.1
app_file: app.py
pinned: false

PDF Audio Summarizer

This application summarizes PDF documents and converts the summary to audio.

The core logic is in the audio_pdf function. It:

Extracts raw text from the uploaded PDF using PyPDF2
Summarizes the text using LED-Based Summarization Model from HuggingFace Transformers. This uses AutoTokenizer and AutoModelForSeq2SeqLM to load the model and generate a summary
Converts the text summary to an audio file using gTTS (Google Text-to-Speech)

The summary and audio file are returned and displayed in the Gradio web interface.

The interface is created using Gradio. The key components are:

The interface is launched via iface.launch()

Additional dependencies: