PaddlePaddle / PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
https://paddlepaddle.github.io/PaddleOCR/
Apache License 2.0
44.18k stars 7.82k forks source link

Error in training Slanet for table recognition #11002

Open vani-mcm opened 1 year ago

vani-mcm commented 1 year ago

I am new to paddlepaddle. Pls help me to solve this error. System environment : ubuntu 20.04 python version : 3.8.18 Installed libraries using requirements.txt file in paddle repo The error message:

  1. ValueError: could not broadcast input array from shape (8,) into shape (4,)

  2. return self._fmt.find(self.asctime_search) >= 0 RecursionError: maximum recursion depth exceeded while calling a Python object

  3. SystemError: (Fatal) Blocking queue is killed because the data reader raises an exception. [Hint: Expected killed != true, but received killed:1 == true:1.] (at /paddle/paddle/fluid/operators/reader/blocking_queue.h:175) If any information is needed, I will provide the necessary information.

sandy-git-only commented 1 year ago

I got the same problem: ValueError: could not broadcast input array from shape (8,) into shape (4,)

I have changed the values below: ver : release/2.6 I used the pretrained model : ch_ppstructure_mobile_v2.0_SLANet

weiwuxian1998 commented 4 months ago

i find the original box labeling format is [][][][], and the first and the third box is the top left and bottom right cords, so you can set box[0::2] in label_ops.py line 731, and can solve it