Closed jinmu0410 closed 1 year ago
excel path like hdfs://lake-node1:8020/jinmu/test/test_simple.xlsx
What versions of Spark and Spline Agent were used?
@cerveada 1.0
spark 3.3.1
like file:///Users/jinmu/Downloads/test_simple.xlsx is ok! but hdfs://..... is error
That is what I thought, I will try to simulate the issue and fix this.
thanks
@jinmu0410 I was able to reproduce the issue. Unfortunately, the needed url is an arg of some lambda expression and I don't know how to extract it. I would need more time to do it, which I don't have now.
But, spark-excel also supports Sparks's data source V2 which should work out of the box. I added some test and even test it on hdfs, and it was working fine. So I recommend using DSV2 and that should fix the lineage issues as well.
see: https://github.com/crealytics/spark-excel#excel-api-based-on-datasourcev2
@jinmu0410 I was able to reproduce the issue. Unfortunately, the needed url is an arg of some lambda expression and I don't know how to extract it. I would need more time to do it, which I don't have now.
But, spark-excel also supports Sparks's data source V2 which should work out of the box. I added some test and even test it on hdfs, and it was working fine. So I recommend using DSV2 and that should fix the lineage issues as well.
see: https://github.com/crealytics/spark-excel#excel-api-based-on-datasourcev2
ok thank you i will try