ddavisqa / google-refine

Automatically exported from code.google.com/p/google-refine
0 stars 0 forks source link

Importing attached file strips backslashes #121

Closed GoogleCodeExporter closed 8 years ago

GoogleCodeExporter commented 8 years ago
What steps will reproduce the problem?

1. Create a new project by importing the attached file (there's only one line 
in it).
2. Inspect the imported JSON columns.  One was escaped using the "html" 
encoder; the other is raw JSON.

What is the expected output? What do you see instead?

Embedded newlines characters (\n) and UTF-8 characters (e.g. \u201c) in both 
columns are no longer intact; the backslashes are stripped.

If you use an encoding that also encodes backslashes, such as "url" it works 
fine.  (Something I wish I would have thought of earlier...)

This problem does not appear to occur with simpler files with embedded 
backslashes; I created a very simple file:

Foo,Bar
Hello\nWorld,3

and it worked fine.  

What version of the product are you using? On what operating system?

Trunk, Windows 7.

Original issue reported on code.google.com by bil...@gmail.com on 2 Sep 2010 at 6:35

Attachments:

GoogleCodeExporter commented 8 years ago
Fixed by r1369.

Original comment by dfhu...@google.com on 28 Sep 2010 at 3:36

GoogleCodeExporter commented 8 years ago

Original comment by tfmorris on 18 Sep 2012 at 2:58