Closed islamtg closed 3 years ago
After changing the version of the cluster to use spark 2.4 I see that I still get issue with trying to create a new manifest.cdm.json or reading an exisitng manifest.cdm.json I now get this issue -1 error code: null error message: InvalidAbfsRestOperationExceptionjava.net.UnknownHostException: https
Py4JJavaError Traceback (most recent call last)
Hi, you need to give Storage Blob Data Contributer access to the identity.
@srichetar The account already has storage blob data contributor access to the identity.
Please email asksparkcdm@microsoft.com if you are still facing the issue.
I faced this issue when I was using spark-cdm connector 0.19.1 with databricks runtime 8.x - they are incompatible with each other, I started using databricks 6.4 which fixed this issue.
databricks 6.4 which fixed this issue.
I am following this example here and getting the following error when I run this portion
Creates the CDM manifest and adds the entity to it with gzip'd parquet partitions
with both physical and logical entity definitions
(df.write.format("com.microsoft.cdm") .option("storage", StorageAccount) .option("manifestPath", "/powerbi/adlsgen2isleghaz/covid19datasetmlDataset/default.manifest.cdm.json") .option("entity", "TestEntity") .option("format", "parquet") .option("compression", "gzip") .save())
java.lang.NoClassDefFoundError: org/apache/spark/sql/sources/v2/ReadSupport
Py4JJavaError Traceback (most recent call last)