akosbalasko / yarle

Yarle - The ultimate converter of Evernote notes to Markdown
https://github.com/akosbalasko/yarle
MIT License
1.42k stars 82 forks source link

Frozen while converting Enex file #48

Closed NathanSkene closed 1 year ago

NathanSkene commented 4 years ago

Hi,

I'm trying to move from Evernote to Obsidian. I came across this in the Obsidian forum. I exported all my notes from Evernote to an Enex file. Then I set this running with:

npm run start -- --enexSource="/Users/natske/OneDrive - Imperial College London/Evernote Export/My Notes2.enex" --outputDir=./out --include-metadata --zettelkasten --outputFormat=ObsidianMD

It ran fine through the first 2675 notes then just stopped and didn't progress any further.

image

So it's been looking like that for 30 mins. Is there a debug log I could look in?

Thanks for developing this!

akosbalasko commented 4 years ago

Hi @NathanSkene ! Thanks for trying Yarle! It should not stop anywhere, I run some heavy stress-tests against it and they were passed. Unfortunately, no debug logs. But as it converts notes one by one, the problematic one should be "Background genes". Could you please export and convert only that one? Just to see that the problem is not around the amount of the notes. And please check if is there any specific in that note, pdf attached or something. Thank you!

NathanSkene commented 4 years ago

I'd figured it was 'Umbrella'... hadn't read it properly.

Turns out 'background genes' doesn't even open in evernote properly. It's an absolutely massive single column table.

Have attached it to this email but this might be to specific a bug to bother fixing!

On Thu, 17 Sep 2020 at 12:14, Akos Balasko notifications@github.com wrote:

This email from notifications@github.com originates from outside Imperial. Do not click on links and attachments unless you recognise the sender. If you trust the sender, add them to your safe senders list https://spam.ic.ac.uk/SpamConsole/Senders.aspx to disable email stamping for this address.

Hi @NathanSkene https://github.com/NathanSkene ! Thanks for trying Yarle! It should not stop anywhere, I run some heavy stress-tests against it and they were passed. Unfortunately, no debug logs. But as it converts notes one by one, the problematic one should be "Background genes". Could you please export and convert only that one? Just to see that the problem is not around the amount of the notes. And please check if is there any specific in that note, pdf attached or something. Thank you!

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/akosbalasko/yarle/issues/48#issuecomment-694163668, or unsubscribe https://github.com/notifications/unsubscribe-auth/AH5ZPE6XA3RNXCY6IA2J3ATSGHVQJANCNFSM4RQFV45A .

akosbalasko commented 4 years ago

Could you please send it as an attachment to my personal address? it's akos0215@gmail.com . Unfortunately github does not handle well the attachments. Thank you very much!

akosbalasko commented 4 years ago

Hi @NathanSkene ! I was able to reproduce this issue, and it looks like that the problem appears in a 3rd party dependency. I try to resolve that on my own, and/or raise this stuff on their git board. Whenever we have a solution, I'll update you.

nikiluk commented 3 years ago

I confirm that some of the notes convert significantly longer than others. For example, this note took me almost 10 minutes to convert. 2021-01-11 23 58 34 npm

akosbalasko commented 3 years ago

@nikiluk , does this note have some exceptional property? I'm mean, does it contain many attachments, large texts, or anything?
And it is finished, right?

nikiluk commented 3 years ago

Hi @akosbalasko, yes finished. I did a small add to check the outliers in terms of conversion time. I can also commit my couple of lines change, so that it could be easier to identify issues.

Finished conversion: Wed Jan 13 2021 17:05:55 GMT+0200 (Eastern European Standard Time) Note "coach.pocketconfidant.com | Website Review for coach.pocketconfidant.com | WooRank.com" converted successfully in 418.754 seconds. Notes processed: 4

akosbalasko commented 3 years ago

Hi @nikiluk !

Thank you for your tests, it takes more time on my machine :) So, as I see the problem is around the parsing and the conversion of HTML to MD. I use turndown for this, I see a discussion about the performance on their side, but I'm not sure they will fix it soon, hence I'm thinking on replacing turndown to something similar but faster.