Closed arunbhat closed 5 months ago
Hey @arunbhat ah right yes we assume no special characters such as whitespace
Let me just quickly try your suggested change and see what happens with the circleCI checks... https://github.com/projectglow/glow/pull/475
Yes I expect other places in the codebase will need adjusting and unit tests will have to be written.
For now is it possible to use underscores instead of spaces for paths?
For now is it possible to use underscores instead of spaces for paths?
Thanks @williambrandler. Unfortunately the path is not always under our control (our customers use it). And yes for now we have suggested the workaround proposed by you
good news is the change you suggested doesn't break any of the tests!
However, yes it does look like there are a bunch of places in the codebase where we will need this fix.
I just wonder if implementing the fix may cause more harm than good. It will inevitably cause problems when using command line tools in linux on the same files.
Another option is to put a tip in the docs not to use special characters.
Resolved
This issue is similar to the issues reported in SPARK-21996 and SPARK-23148. Would this line need to be modified to
val hPath = new Path(new URI(path))
? (probably also other places too. see the changes for CSV and JSON datasources in this PRSimple code that fails is below. Note that reading a csv file from the same path works