by256 / imagedataextractor

ImageDataExtractor 2.0 - a Python library for electron microscopy image quantification.
https://imagedataextractor.org
MIT License
14 stars 2 forks source link

Finetuning IDE scalebar detection #12

Open dyhl0 opened 2 years ago

dyhl0 commented 2 years ago

Hi OP, I've managed to run imagedataextractor successfully and would like to use the contents in the fine tune file in the scalebar section to improve the detection rate of the scalebar. However I have some trouble with running the 'make training' line. I am running ide on windows 10 and the solutions that I have found so far are using Chocolatey to install the 'make' command or using a Linux subsystem, both of which I do not have any experience. Is there a guide on how to use that? Thank you and I appreciate your help with the previous question I had for ide.

by256 commented 2 years ago

Hi,

Sorry, but I don't think I can be of much help. I don't have much experience with or even have a Windows computer, so my ability to assist is limited.

Unless you are adding more of your own training data, there is no need to finetune the scalebar detection model again with the existing data in the finetune directory, as this has already been done for the Tesseract model used by IDE. I agree that more finetuning is necessary though, as IDE's scalebar detection fails quite often.

dyhl0 commented 2 years ago

Hi OP,

Understood and thank you very much. I have 2 questions:

  1. Per the instructions in the fine-tune section, the documents in the scalebar-truth should be unzipped to tessdata/data. Did you mean tesstrain in the fine-tune folder or does this refer to the one in tesseract?
  2. Is ocrd-testset required for to improve the scalebar detection?

Thank you