visualcurrent / Notion-2-Obsidan

Conversion routines to convert all Notion .md exports to full Obsidian compatibility
262 stars 47 forks source link

Bad work with Cyrillic symbols #7

Closed MrModest closed 3 years ago

MrModest commented 3 years ago

Thanks for this useful script!

But it have some trouble with Cyrillic, can you fix this please? (

Given: folder with list of pages, some page have name with Cyrillic symbols

image

After applying script I have got this in folder page:

b'[[\xd0\x98\xd1\x81\xd1\x81\xd0\xbb\xd0\xb5\xd0\xb4\xd0\xbe\xd0\xb2\xd0\xb0\xd0\xbd\xd0\xb8\xd0\xb5 \xd0\xbd\xd0\xb0 \xd1\x82\xd0\xb5\xd0\xbc\xd1\x83 \xd0\xb2\xd0\xbe\xd0\xb7\xd0\xbc\xd0\xbe\xd0\xb6\xd0\xbd\xd1\x8b\xd1\x85 \xd0\xbf\xd1\x80\xd0\xbe\xd1\x84\xd0\xb5\xd1\x81\xd1\x81\xd0\xb8\xd0\xb9]]'
b'[[\xd0\x9e\xd0\xb1\xd0\xbc\xd0\xb0\xd0\xbd\xd0\xb8 \xd0\xb2\xd1\x81\xd0\xb5\xd1\x85 2 1]]'
b'[[\xd0\x98\xd0\xb4\xd0\xb5\xd1\x8f \xd0\xb4\xd0\xbb\xd1\x8f \xd0\xb7\xd0\xb8\xd0\xbc\xd0\xbd\xd0\xb5\xd0\xb9 \xd0\xb4\xd0\xb2\xd0\xb8\xd0\xb6\xd1\x83\xd1\x85\xd0\xb8]]'
b'[[\xd0\x98\xd0\xb4\xd0\xb5\xd1\x8f \xd0\xb4\xd0\xbb\xd1\x8f \xd0\xbe\xd1\x80\xd0\xb3\xd0\xb0\xd0\xbd\xd0\xb8\xd0\xb7\xd0\xb0\xd1\x86\xd0\xb8\xd0\xb8]]'
b'[[\xd0\x9f\xd0\xb5\xd1\x82\xd0\xb0\xd1\x80\xd0\xb4\xd1\x8b \xd0\xbd\xd0\xb0 \xd1\x81\xd0\xb0\xd0\xbc\xd0\xbe\xd0\xbb\xd1\x91\xd1\x82\xd0\xb8\xd0\xba\xd0\xb0\xd1\x85]]'
b'[[\xd0\x90\xd0\xbd\xd1\x82\xd0\xb8\xd0\xba\xd0\xb0\xd1\x84\xd0\xb5 FriendZone]]'
b'[[\xd0\xa1\xd0\xbb\xd0\xb5\xd1\x82\xd0\xb0\xd1\x82\xd1\x8c \xd0\xb2 \xd0\xaf\xd0\xbf\xd0\xbe\xd0\xbd\xd0\xb8\xd1\x8e \xd0\xb2 \xd0\xbf\xd0\xb5\xd1\x80\xd0\xb8\xd0\xbe\xd0\xb4 \xd1\x86\xd0\xb2\xd0\xb5\xd1\x82\xd0\xb5\xd0\xbd\xd0\xb8\xd1\x8f \xd1\x81\xd0\xb0\xd0\xba\xd1\x83\xd1\x80\xd1\x8b]]'

What the decoder gave me (urlencoded -> UTF-8): http://www.online-decoder.com/

b'[[Исследование на тему возможных профессий]]'
b'[[Обмани всех 2 1]]'
b'[[Идея для зимней движухи]]'
b'[[Идея для организации]]'
b'[[Петарды на самолётиках]]'
b'[[Антикафе FriendZone]]'
b'[[Слетать в Японию в период цветения сакуры]]' 

But files in folder have normal names:

image

Also:

Path to image in page contain urlencoded name of page so path is broken. If I delete this and left only image name, image is showing correct.

image

%D0%9C%D1%83%D0%B6%D1%81%D0%BA%D0%B0%D1%8F %D1%87%D0%B5%D1%80%D0%BD%D0%B0%D1%8F %D1%82%D0%BE%D0%BB%D1%81%D1%82%D0%BE%D0%B2%D0%BA%D0%B0 %D0%BD%D0%B0 %D0%BC%D0%BE%D0%BB%D0%BD%D0%B8%D0%B8

urlencoded -> UTF-8

Мужская черная толстовка на молнии
MrModest commented 3 years ago

I also look in Notion export files

Ideas 994a2b4738c2434faf4756dbe12cffa8.csv:

Name,Created,IsNotActual,Tags,Updated
Исследование на тему возможных профессий,"Sep 11, 2020 10:17 AM",No,,"Sep 11, 2020 10:19 AM"
Обмани всех 2.1,"May 1, 2020 7:09 PM",No,game,"May 1, 2020 7:10 PM"
Идея для зимней движухи,"Apr 17, 2020 11:08 PM",No,,"Apr 17, 2020 11:08 PM"
Идея для организации,"Apr 17, 2020 11:06 PM",No,,"Apr 17, 2020 11:06 PM"
Петарды на самолётиках,"Oct 3, 2020 5:27 PM",No,,"Oct 3, 2020 5:27 PM"
"Антикафе ""FriendZone""","Nov 22, 2020 1:15 PM",No,,"Nov 22, 2020 1:17 PM"
Слетать в Японию в период цветения сакуры,"Dec 7, 2020 8:47 PM",No,,"Dec 7, 2020 8:49 PM"

It's look like is in UTF-8: image

And folder content: image

sithamet commented 3 years ago

Joining the request to fix it. Having the same issue

skorphil commented 3 years ago

Same here, still no fix