cjnaz / rclonesync-V2

A Bidirectional Cloud Sync Utility using rclone
MIT License
355 stars 39 forks source link

Exception in load_list loading due to non-UTF8 filenames #62

Open petoetje opened 4 years ago

petoetje commented 4 years ago

I get (replaced my real name with XXX) 2020-09-13 21:26:27,302: Exception in load_list loading /home/christo/.rclonesyncwd/LSL_google_christophe.XXX@gmail.com__Path2_NEW: <'utf-8' codec can't decode byte 0xe9 in position 5908: invalid continuation byte> Line # 9922: 2689437 2020-09-12 23:43:27.922891265 photo/Goele/Goele_2015_06_28/Goele_2015_06_28 (97).JPG

2020-09-13 21:26:27,303: ERROR Failed loading current Path2 list file /home/christo/.rclonesyncwd/LSL_google_christophe.XXX@gmail.com__Path2_NEW -

This with the version from trunk

cjnaz commented 4 years ago

Run rclonesyncs with --verbose and post please. Are you trying to sync to your email address? It that a configued remote on rclone? Does rclone lsl <that remote> show you your files?

<'utf-8' codec can't decode byte 0xe9 in position 5908: invalid continuation byte>
Line # 9922: 2689437 2020-09-12 23:43:27.922891265 photo/Goele/Goele_2015_06_28/Goele_2015_06_28 (97).JPG

This looks like a locale problem. Rclonesync is hard coded to utf8. What is the locale of your system?

petoetje commented 4 years ago

2020-09-14 09:34:29,908: ERROR Failed loading current Path2 list file /home/christo/.rclonesyncwd/LSL_google_christophe.XXX@gmail.com__Path2_NEW - 2020-09-14 09:34:30,042: Lock file removed: /tmp/rclonesync_LOCK_google_christophe.XXX@gmail.com_ 2020-09-14 09:34:30,042: Error Abort. Try running rclonesync again.

cjnaz commented 4 years ago

rclone is finding and replacing invalid utf8 characters. I assume that the created lsl file thus has modified file names that differ from the actual files on the disk. Then, when rclonesync tries to read/load the LSL file it is also erroring for invalid utf8 characters - so maybe rclone isn't actually changing characters and the resultant LSL file is not valid utf8.

Are there a lot of these errors, or just the two?

If you are willing, please upload the entire console output from a rclonesync google: christophe.XXX@gmail.com --verbose --verbose --rc-verbose --rc-verbose and with --first-sync I assume, and whatever other switches you are using. Upload to https://drive.google.com/drive/folders/1FuHvtoezlesiK4btn0Jr8yhi4VQQ1xOr?usp=sharing. I will delete the files once received. Even better, create a directory with a few of the problem files and rclonesync just that folder (with the verbose switches), then upload that console log, the created LSL _Path1 and _Path2 files, and the directory itself.

rclonesync cannot just ignore files that are problems without breaking the integrity of the sync. Your options include

petoetje commented 4 years ago
cjnaz commented 4 years ago

Thanks. In a future release I may be able to gracefully and safely ignore invalid filenames with just a warning message. Probably with a switch to enable this behavior.

I'll leave this issue open.