Open GoogleCodeExporter opened 8 years ago
For the sake of simplicity, run this foo.rb instead.
(Can't "edit post" here, :/)
Original comment by loliloli...@gmail.com
on 16 Oct 2010 at 2:25
Attachments:
I've attached a _temporary_ fix, in case anyone is need.
BUT, I highly doubt this force_encoding fix should have been employed at all...
Again, I'd like to ask: What's the problem with "# encoding: utf-8" exactly?
Original comment by loliloli...@gmail.com
on 16 Oct 2010 at 2:41
Attachments:
This has probably to do with the fact that the encoding in freedb is not UTF-8
compliant. While refactoring rubyripper, I discovered a problem where the
Freedb source is coded in ISO-8859-1. Rubyripper will falsely treat this as
UTF-8 and the result is the mess you're seeing.
The ISO-8859-1 case is already fixed, but perhaps there is another encoding
here. You can help by providing the discid for this disc. The result from
cd-discid or discid for this disc will do great ;) I can write a test to ensure
the encoding will be fine from now on.
Original comment by boukewou...@gmail.com
on 30 Nov 2010 at 8:17
[deleted comment]
Sorry for the delay.
The discid is A70C310C.
I see from the web search the encoding is the same garbage. :/
Original comment by loliloli...@gmail.com
on 19 Dec 2010 at 7:12
I saw the issue now. The freedb server is serving the file as
"charset=iso-8859-1" while it's actually GB18030. e.g.
$ curl http://www.freedb.org/freedb/data/a70c310c | iconv -f GB18030 -t UTF8 | less
Original comment by loliloli...@gmail.com
on 19 Dec 2010 at 7:16
Technical this is a freedb error, since only UTF-8 records are allowed in the
spec. Older files may still be encoded as ISO-8859-1. Anything else is a buggy
record. But I guess users just want to have a fix ;)
Original comment by boukewou...@gmail.com
on 19 Dec 2010 at 8:02
I had the same bug, but the fix utf-8-fix.diff correct the bug for French audio
CD.
Original comment by mayeu....@gmail.com
on 25 Dec 2010 at 11:42
I have the same problem with, rubyripper 0.6.2, flac disabled, a german cd with
umlauts (ü) and the freedb gateway from musicbrainz. First I got
Should all tracks be ripped ? (y/n) [y]
Tracks to rip are 1 2 3 4
/usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:1471:in `expand_path': incompatible
character encodings: ASCII-8BIT and UTF-8 (Encoding::CompatibilityError)
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:1471:in `giveDir'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:1447:in `block in setDirectory'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:1445:in `each'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:1445:in `setDirectory'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:1385:in `initialize'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:2528:in `new'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:2528:in `settingsOk'
from /usr/bin/rrip_cli:410:in `prepareRip'
from /usr/bin/rrip_cli:331:in `showFreedbOptions'
from /usr/bin/rrip_cli:305:in `showFreedb'
from /usr/bin/rrip_cli:265:in `handleFreedb'
from /usr/bin/rrip_cli:244:in `get_cd_info'
from /usr/bin/rrip_cli:47:in `initialize'
from /usr/bin/rrip_cli:486:in `new'
from /usr/bin/rrip_cli:486:in `<main>'
then I restarted rrip_cli and manually entered all titles with umlauts, which
resulted in
01 - Türlich, türlich (sicher, Dicker)
02 - Ich heb mich ab
03 - Nur der Zorn zählt
04 - Session mit Don Dougie (1998)
ADVANCED TOC ANALYSIS (with cdrdao)
...please be patient, this may take a while
/usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:1726:in `join': incompatible character
encodings: UTF-8 and ASCII-8BIT (Encoding::CompatibilityError)
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:1726:in `getFile'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:1016:in `writeFileLine'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:1005:in `block in createCuesheet'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:981:in `each'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:981:in `createCuesheet'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:958:in `block in allCodecs'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:954:in `each'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:954:in `allCodecs'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:950:in `initialize'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:511:in `new'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:511:in `updateSettings'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:2565:in `waitForToc'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:2537:in `startRip'
from /usr/bin/rrip_cli:412:in `prepareRip'
from /usr/bin/rrip_cli:331:in `showFreedbOptions'
from /usr/bin/rrip_cli:305:in `showFreedb'
from /usr/bin/rrip_cli:397:in `editTrackInfo'
from /usr/bin/rrip_cli:333:in `showFreedbOptions'
from /usr/bin/rrip_cli:305:in `showFreedb'
from /usr/bin/rrip_cli:363:in `editDiscInfo'
from /usr/bin/rrip_cli:332:in `showFreedbOptions'
from /usr/bin/rrip_cli:305:in `showFreedb'
from /usr/bin/rrip_cli:265:in `handleFreedb'
from /usr/bin/rrip_cli:244:in `get_cd_info'
from /usr/bin/rrip_cli:169:in `edit_settings'
from /usr/bin/rrip_cli:45:in `initialize'
from /usr/bin/rrip_cli:486:in `new'
from /usr/bin/rrip_cli:486:in `<main>'
After restarting rrip again it goes until after the TOC analysis
ADVANCED TOC ANALYSIS (with cdrdao)
...please be patient, this may take a while
/usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:1726:in `join': incompatible character
encodings: UTF-8 and ASCII-8BIT (Encoding::CompatibilityError)
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:1726:in `getFile'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:1016:in `writeFileLine'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:1005:in `block in createCuesheet'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:981:in `each'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:981:in `createCuesheet'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:958:in `block in allCodecs'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:954:in `each'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:954:in `allCodecs'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:950:in `initialize'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:511:in `new'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:511:in `updateSettings'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:2565:in `waitForToc'
from /usr/lib64/ruby/site_ruby/1.8/rr_lib.rb:2537:in `startRip'
from /usr/bin/rrip_cli:427:in `dir_exists'
from /usr/bin/rrip_cli:447:in `update'
from /usr/bin/rrip_cli:414:in `prepareRip'
from /usr/bin/rrip_cli:331:in `showFreedbOptions'
from /usr/bin/rrip_cli:305:in `showFreedb'
from /usr/bin/rrip_cli:265:in `handleFreedb'
from /usr/bin/rrip_cli:244:in `get_cd_info'
from /usr/bin/rrip_cli:47:in `initialize'
from /usr/bin/rrip_cli:486:in `new'
from /usr/bin/rrip_cli:486:in `<main>'
Original comment by fschm...@gmail.com
on 10 Sep 2012 at 10:29
I've verified that this issue still exists as of today (2014-02-17). I've
fixed the issue for all non-EOL rubies (1.9, 2.0 and 2.1 tested). The change
is pretty simple, just two lines. The fix can be seen on my clone. I've
ripped several CDs with diacritical marks (e.g.
http://www.freedb.org/freedb/rock/8f0e690d) with expected results.
Original comment by t...@wuest.me
on 17 Feb 2014 at 11:33
A similar crash occurs ripping a home-made disk for which NO FREEDB ENTRY
EXISTS. So, at least in my case, encoding of data from freedb may not have
anything to do with the problem.
I typed track names into the gui, using some non-ascii Unicode characters
entered with Linux compose-key sequences (e.g. ç ő á ó é).
(rrip_gui 0.6.2 - the latest in the GetDb repo for Ubuntu 15.04)
/usr/lib/ruby/1.8/rr_lib.rb:1659:in `gsub!': incompatible encoding regexp match
(UTF-8 regexp with ASCII-8BIT string) (Encoding::CompatibilityError)
from /usr/lib/ruby/1.8/rr_lib.rb:1659:in `allFilter'
from /usr/lib/ruby/1.8/rr_lib.rb:1636:in `tagFilter'
from /usr/lib/ruby/1.8/rr_lib.rb:1597:in `block in setMetadata'
from /usr/lib/ruby/1.8/rr_lib.rb:1596:in `times'
from /usr/lib/ruby/1.8/rr_lib.rb:1596:in `setMetadata'
from /usr/lib/ruby/1.8/rr_lib.rb:1479:in `attemptDirCreation'
from /usr/lib/ruby/1.8/rr_lib.rb:1685:in `overwriteDir'
from /usr/lib/ruby/1.8/rr_lib.rb:2661:in `overwriteDir'
from /usr/bin/rrip_cli:428:in `dir_exists'
from /usr/bin/rrip_cli:447:in `update'
from /usr/bin/rrip_cli:414:in `prepareRip'
from /usr/bin/rrip_cli:331:in `showFreedbOptions'
from /usr/bin/rrip_cli:305:in `showFreedb'
from /usr/bin/rrip_cli:265:in `handleFreedb'
from /usr/bin/rrip_cli:244:in `get_cd_info'
from /usr/bin/rrip_cli:47:in `initialize'
from /usr/bin/rrip_cli:486:in `new'
from /usr/bin/rrip_cli:486:in `<main>'
Original comment by jim.av...@gmail.com
on 14 Jul 2015 at 12:59
P.S. I think this is a regression. I've typed non-ASCII characters into
rrip_gui track names many times in the past. This is the first time using this
version, though.
Original comment by jim.av...@gmail.com
on 14 Jul 2015 at 1:01
Original issue reported on code.google.com by
loliloli...@gmail.com
on 16 Oct 2010 at 2:14