databricks / spark-xml

XML data source for Spark SQL and DataFrames
Apache License 2.0
499 stars 226 forks source link

Can't import XML file #647

Closed sanyam-dev closed 1 year ago

sanyam-dev commented 1 year ago

Exception in thread "main" java.lang.ClassNotFoundException: Failed to find data source: xml. Please find packages at http://spark.apache.org/third-party-projects.html

this.spark = SparkSession.builder().appName(this.appName).master(this.master).getOrCreate();

Dataset<Row> ds = spark.read().
                                option("rootTag", "persons").
                option("rowTag", "person").
                format("xml").
                load(testPathString);

in my pom.xml:

    <dependency>
        <groupId>com.databricks</groupId>
        <artifactId>spark-xml_2.12</artifactId>
        <version>0.14.0</version>
    </dependency>
srowen commented 1 year ago

It isn't part of your application. I bet you didn't make a JAR that bundles its dependencies.