Bug report · Feature request · Read Docs
This document describes the playscholar
automated process to summarize a web page URL with ChatGPT.
The audience for this document includes:
User who will send a command with a web page URL via a Telegram Bot, and receive a text with summarization of its content.
DevSecOps Engineer who will design system workflows, configure and manage any SaaS or selfhosted services, and plan for disaster recovery.
playscholar
?This summarization-as-a-service leverages on and applies a similar design from the playscribe
automation to its workflow.
This services uses resources in both a free tier and a pay-per-usage tier by cloud SaaS providers:
Free | Pay-per-usage |
---|---|
Pipedream | ChatGPT |
GitHub | jina-ai/reader |
Telegram |
This service is inexpensive, easy to implement, and can be duplicated for other services.
This service does not require implementing a custom frontend as the User will interact with a Telegram Bot.
This service does not require a backend server as the processing will be performed using ephemeral runners.
This service uses a distroless image that drastically reduces the image size, resulting in a faster execution time for its emphemeral runners.
Scraping can be complex and often blocked, and raw HTML is cluttered with extraneuous elements like markups and scripts. The Reader API addresses these issues by extracting the core content from a URL and converting it into clean, LLM-friendly text, ensuring high-quality input for your agent and RAG systems.
Reader API is multi-modal which means that images on the web page are automatically captioned using a vision language model in the reader and formatted as image alt tags in the output. It natively supports PDF reading and is compatible with most PDFs.
Reader API is available for free and offers flexible rate limit and pricing.
This project has several limitations:
40
lines and will include a link to the file examples/result.txt
. However, the content of this file may be overridden by another pipeline.We have a thorough guide on how to set up and get started with playscholar
in our documentation.
If you encounter any problems feel free to open an issue. If you feel the project is missing a feature, please raise a ticket on GitHub and I'll look into it. Pull requests are also welcome.