e-johnstonn / GPT-Doc-Summarizer

Streamlit webapp for token-efficient summarization of long files using GPT 3.5/4
MIT License
141 stars 68 forks source link

GPT 3.5/4/4o-mini Powered Document Summarizer

This is a tool that takes a text document (PDF or TXT) or YouTube transcript and generates a concise summary using GPT-4O-Mini, GPT-4 or GPT-3.5-turbo. It can accurately summarize hundreds of pages of text. It's built with Python and Streamlit and leverages the langchain library for text processing. While the final output is generated with the latest GPT family model from OpenAI, GPT-4O-Mini (one of the LLMs that powers ChatGPT), only a small portion of the overall document is used in the prompts. Before any call is made to either LLM, the document is separated into small sections that contain the majority of the meaning of the document.

Summarize your documents here (no API key required): https://gpt-document-summarizer.streamlit.app/

Features

Usage

  1. Launch the Streamlit app by running streamlit run main.py
  2. Upload a document (TXT or PDF) to summarize.
  3. Enter your OpenAI API key if the free usage cap has been hit.
  4. Choose whether to use GPT-4 for the summarization (recommended, requires GPT-4 API access).
  5. Click the "Summarize" button and wait for the result.

Modules

Main Functions

Utility Functions