yimingliu / atomizer

Atom feed server for transforming HTML web pages to Atom feeds
3 stars 0 forks source link

Atomizer

A prototype Python Atom feed server for transforming arbitrary web pages into Atom Feeds

Get started

Major prerequisites

Example JSON spec file

{
  "uri": "https://the-target-website.example.com",
  "entries": "xpath to determine the list of things to extract as posts",
  "title": "....",
  "link": "...",
  "author": "...",
  "author_uri": "...",
  "date": "...",
  "summary": "...",
  "image": "...",
  "USER_AGENT": "Override the default user agent if needed to avoid anti-scraping defenses"
}

Installation

To run locally:

pip install -r requirements.txt

python app.py

TODO