mittagessen / conformer_ocr

text recognizer with a conformer
Apache License 2.0
6 stars 0 forks source link

ImportError: cannot import name 'XMLPage' from 'kraken.lib.xml' #2

Open johnlockejrr opened 2 hours ago

johnlockejrr commented 2 hours ago
(env-py3.10) incognito@DESKTOP-H1BS9PO:~/conformer_ocr$ cocr -d cuda train -f binary --workers 8 dataset.arrow
Usage: cocr train [OPTIONS] [GROUND_TRUTH]...
Try 'cocr train --help' for help.

Error: augmentation needs the `albumentations` package installed.
(env-py3.10) incognito@DESKTOP-H1BS9PO:~/conformer_ocr$ pip install `albumentations`
albumentations: command not found
ERROR: You must give at least one requirement to install (see "pip help install")
(env-py3.10) incognito@DESKTOP-H1BS9PO:~/conformer_ocr$ pip install albumentations
Collecting albumentations
  Downloading albumentations-1.4.15-py3-none-any.whl (200 kB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 200.3/200.3 KB 3.1 MB/s eta 0:00:00
Collecting pydantic>=2.7.0
  Downloading pydantic-2.9.2-py3-none-any.whl (434 kB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 434.9/434.9 KB 14.8 MB/s eta 0:00:00
Collecting eval-type-backport
  Using cached eval_type_backport-0.2.0-py3-none-any.whl (5.9 kB)
Collecting opencv-python-headless>=4.9.0.80
  Using cached opencv_python_headless-4.10.0.84-cp37-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (49.9 MB)
Collecting scikit-image>=0.21.0
  Using cached scikit_image-0.24.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (14.9 MB)
Requirement already satisfied: PyYAML in ./env-py3.10/lib/python3.10/site-packages (from albumentations) (6.0.2)
Collecting numpy>=1.24.4
  Using cached numpy-2.1.1-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (16.3 MB)
Collecting albucore>=0.0.15
  Downloading albucore-0.0.16-py3-none-any.whl (9.5 kB)
Requirement already satisfied: scipy>=1.10.0 in ./env-py3.10/lib/python3.10/site-packages (from albumentations) (1.10.1)
Requirement already satisfied: typing-extensions>=4.6.1 in ./env-py3.10/lib/python3.10/site-packages (from pydantic>=2.7.0->albumentations) (4.12.2)
Collecting pydantic-core==2.23.4
  Downloading pydantic_core-2.23.4-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (2.1 MB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 2.1/2.1 MB 30.3 MB/s eta 0:00:00
Collecting annotated-types>=0.6.0
  Using cached annotated_types-0.7.0-py3-none-any.whl (13 kB)
Collecting lazy-loader>=0.4
  Using cached lazy_loader-0.4-py3-none-any.whl (12 kB)
Requirement already satisfied: networkx>=2.8 in ./env-py3.10/lib/python3.10/site-packages (from scikit-image>=0.21.0->albumentations) (3.3)
Requirement already satisfied: pillow>=9.1 in ./env-py3.10/lib/python3.10/site-packages (from scikit-image>=0.21.0->albumentations) (10.4.0)
Requirement already satisfied: tifffile>=2022.8.12 in ./env-py3.10/lib/python3.10/site-packages (from scikit-image>=0.21.0->albumentations) (2024.8.30)
Requirement already satisfied: packaging>=21 in ./env-py3.10/lib/python3.10/site-packages (from scikit-image>=0.21.0->albumentations) (24.1)
Requirement already satisfied: imageio>=2.33 in ./env-py3.10/lib/python3.10/site-packages (from scikit-image>=0.21.0->albumentations) (2.35.1)
Collecting numpy>=1.24.4
  Using cached numpy-1.26.4-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (18.2 MB)
Installing collected packages: pydantic-core, numpy, lazy-loader, eval-type-backport, annotated-types, pydantic, opencv-python-headless, scikit-image, albucore, albumentations
  Attempting uninstall: numpy
    Found existing installation: numpy 1.23.5
    Uninstalling numpy-1.23.5:
      Successfully uninstalled numpy-1.23.5
  Attempting uninstall: scikit-image
    Found existing installation: scikit-image 0.19.3
    Uninstalling scikit-image-0.19.3:
      Successfully uninstalled scikit-image-0.19.3
ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
kraken 4.3.13 requires numpy~=1.23.0, but you have numpy 1.26.4 which is incompatible.
kraken 4.3.13 requires scikit-image<=0.19.3,>=0.17.0, but you have scikit-image 0.24.0 which is incompatible.
Successfully installed albucore-0.0.16 albumentations-1.4.15 annotated-types-0.7.0 eval-type-backport-0.2.0 lazy-loader-0.4 numpy-1.26.4 opencv-python-headless-4.10.0.84 pydantic-2.9.2 pydantic-core-2.23.4 scikit-image-0.24.0
(env-py3.10) incognito@DESKTOP-H1BS9PO:~/conformer_ocr$ cocr -d cuda train -f binary --workers 8 dataset.arrow
╭─────────────────────────────── Traceback (most recent call last) ────────────────────────────────╮
│ /home/incognito/conformer_ocr/env-py3.10/bin/cocr:8 in <module>                                  │
│                                                                                                  │
│   5 from conformer_ocr.cli import cli                                                            │
│   6 if __name__ == '__main__':                                                                   │
│   7 │   sys.argv[0] = re.sub(r'(-script\.pyw|\.exe)?$', '', sys.argv[0])                         │
│ ❱ 8 │   sys.exit(cli())                                                                          │
│   9                                                                                              │
│                                                                                                  │
│ /home/incognito/conformer_ocr/env-py3.10/lib/python3.10/site-packages/click/core.py:1157 in      │
│ __call__                                                                                         │
│                                                                                                  │
│ /home/incognito/conformer_ocr/env-py3.10/lib/python3.10/site-packages/click/core.py:1078 in main │
│                                                                                                  │
│ /home/incognito/conformer_ocr/env-py3.10/lib/python3.10/site-packages/click/core.py:1688 in      │
│ invoke                                                                                           │
│                                                                                                  │
│ /home/incognito/conformer_ocr/env-py3.10/lib/python3.10/site-packages/click/core.py:1434 in      │
│ invoke                                                                                           │
│                                                                                                  │
│ /home/incognito/conformer_ocr/env-py3.10/lib/python3.10/site-packages/click/core.py:783 in       │
│ invoke                                                                                           │
│                                                                                                  │
│ /home/incognito/conformer_ocr/env-py3.10/lib/python3.10/site-packages/click/decorators.py:33 in  │
│ new_func                                                                                         │
│                                                                                                  │
│ /home/incognito/conformer_ocr/env-py3.10/lib/python3.10/site-packages/conformer_ocr/cli/train.py │
│ :246 in train                                                                                    │
│                                                                                                  │
│   243 │   import json                                                                            │
│   244 │   import torch                                                                           │
│   245 │                                                                                          │
│ ❱ 246 │   from conformer_ocr.dataset import TextLineDataModule                                   │
│   247 │   from conformer_ocr.model import RecognitionModel                                       │
│   248 │                                                                                          │
│   249 │   from lightning.pytorch import Trainer                                                  │
│                                                                                                  │
│ /home/incognito/conformer_ocr/env-py3.10/lib/python3.10/site-packages/conformer_ocr/dataset.py:2 │
│ 4 in <module>                                                                                    │
│                                                                                                  │
│    21                                                                                            │
│    22 from torch.utils.data import DataLoader, Subset, random_split                              │
│    23                                                                                            │
│ ❱  24 from kraken.lib.xml import XMLPage                                                         │
│    25 from kraken.lib.codec import PytorchCodec                                                  │
│    26 from kraken.lib.dataset import (ArrowIPCRecognitionDataset,                                │
│    27 │   │   │   │   │   │   │   │   ImageInputTransforms,                                      │
╰──────────────────────────────────────────────────────────────────────────────────────────────────╯
ImportError: cannot import name 'XMLPage' from 'kraken.lib.xml' (/home/incognito/conformer_ocr/env-py3.10/lib/python3.10/site-packages/kraken/lib/xml.py)
johnlockejrr commented 2 hours ago

Installed latest kraken that resolves the import problem but:

(env-py3.10) incognito@DESKTOP-H1BS9PO:~/conformer_ocr$ cocr -d cuda train -f binary --workers 8 dataset.arrow
Usage: cocr train [OPTIONS] [GROUND_TRUTH]...
Try 'cocr train --help' for help.

Error: not enough values to unpack (expected 2, got 1)