:exclamation::exclamation: P2PaLA is deprecated :exclamation::exclamation:
Page to PAGE Layout Analysis (P2PaLA) is a toolkit for Document Layout Analysis based on Neural Networks.
:boom: Try our new DEMO for online baseline detection. :exclamation::exclamation:
If you find this toolkit useful in your research, please cite:
@misc{p2pala2017,
author = {Lorenzo Quirós},
title = {P2PaLA: Page to PAGE Layout Analysis tookit},
year = {2017},
publisher = {GitHub},
note = {GitHub repository},
howpublished = {\url{https://github.com/lquirosd/P2PaLA}},
}
Check this paper for more details Arxiv.
pip install tensorboardX
> A diferent conda env is recomended to keep tensorflow separated from PyTorchpython setup.py install
To install python dependencies alone, use requirements file
conda env create --file conda_requirements.yml
data_tag/page
, where images must be into the data_tag
folder and xml files into page
. For example:
mkdir -p data/{train,val,test,prod}/page;
tree data;
data
├── prod
│ ├── page
│ │ ├── prod_0.xml
│ │ └── prod_1.xml
│ ├── prod_0.jpg
│ └── prod_1.jpg
├── test
│ ├── page
│ │ ├── test_0.xml
│ │ └── test_1.xml
│ ├── test_0.jpg
│ └── test_1.jpg
├── train
│ ├── page
│ │ ├── train_0.xml
│ │ └── train_1.xml
│ ├── train_0.jpg
│ └── train_1.jpg
└── val
├── page
│ ├── val_0.xml
│ └── val_1.xml
├── val_0.jpg
└── val_1.jpg
python P2PaLA.py --config config.txt --tr_data ./data/train --te_data ./data/test --log_comment "_foo"
:exclamation: Pre-trained models available here
tensorboard --logdir ./work/runs
We recommend Transkribus or nw-page-editor to visualize and edit PAGE-xml files.
python P2PaLA.py -h
. GNU General Public License v3.0 See LICENSE to see the full text.
Code is inspired by pix2pix and pytorch-CycleGAN-and-pix2pix