Rogdham / bigxml

Parse big xml files and streams with ease
https://bigxml.rogdham.net/
MIT License
27 stars 3 forks source link
big library python xml xml-parser xml-parsing
# BigXML Parse big xml files and streams with ease [![GitHub build status](https://img.shields.io/github/actions/workflow/status/rogdham/bigxml/build.yml?branch=master)](https://github.com/rogdham/bigxml/actions?query=branch:master) [![Release on PyPI](https://img.shields.io/pypi/v/bigxml)](https://pypi.org/project/bigxml/) [![Code coverage](https://img.shields.io/badge/coverage-100%25-brightgreen)](https://github.com/rogdham/bigxml/search?q=fail+under&type=Code) [![Mypy type checker](https://img.shields.io/badge/type_checker-mypy-informational)](https://mypy.readthedocs.io/) [![MIT License](https://img.shields.io/pypi/l/bigxml)](https://github.com/Rogdham/bigxml/blob/master/LICENSE.txt) --- [📖 Documentation](https://bigxml.rogdham.net/)   |   [📃 Changelog](./CHANGELOG.md)

Parsing big XML files in Python is hard. On one hand, regular XML libraries load the whole file into memory, which will crash the process if the file is too big. Other solutions such as iterparse do read the file as they parse it, but they are complex to use if you don't want to run out of memory.

This is where the BigXML library shines: