issues
search
databricks
/
spark-xml
XML data source for Spark SQL and DataFrames
Apache License 2.0
500
stars
226
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
I create a spark datasource table, but failed
#690
MyqueWooMiddo
opened
3 weeks ago
0
Fix Scala ArrayBuffer cast to Spark ArrayData for Scala 2.13
#689
hipp0gryph
opened
1 month ago
3
Problem use Scala functions in python. Scala error on Spark 3.3.2. Scala Version 2.13.8.
#688
hipp0gryph
opened
1 month ago
1
<xs:choice maxOccurs="unbounded"> does not produce array type
#687
marpetr
opened
2 months ago
3
Error using from_xml with StructType for schema
#686
ianepreston
closed
3 months ago
2
The problem with the case of words for identical names
#685
hipp0gryph
opened
3 months ago
3
Found duplicates, but no duplicates into files
#684
hipp0gryph
closed
4 months ago
2
Empty line between tags when writing xml
#683
sarg90
closed
4 months ago
1
Getting Multi Generator issue while flattening the XML file
#682
a-sameer18
closed
4 months ago
1
Caused by: java.io.InvalidClassException: com.databricks.spark.xml.XmlOptions; local class incompatible
#681
gc-avanade
opened
5 months ago
2
Update for 0.18.0, move CICD configs to supported Spark versions
#680
srowen
closed
5 months ago
0
Fix for xml expression to not parse arbitrary strings
#679
xanderbailey
closed
5 months ago
6
Wrapping elements of nested array
#678
vitaliyb-adorama
closed
6 months ago
1
NumPartitions == num files. Can I choose partitions manually?
#677
hipp0gryph
closed
6 months ago
1
Remove New Line coming in between records during spark write dataframe to XML
#676
avinashpandu
closed
6 months ago
5
XSDToSchema fails on choice of sequence
#675
iWantToKeepAnon
closed
6 months ago
2
Add notes about file extensions and _corrupt_record to documentation
#674
dolfinus
closed
8 months ago
0
parse XML without the default AttributePrefix "_" in PySpark
#673
schneifejan
closed
9 months ago
2
Splitting XML into single-column rows
#672
rjrudin
closed
9 months ago
9
Incorrect inferring schema if ignoreNamespace is true and namespace = tag
#671
hipp0gryph
closed
6 months ago
4
ignoreSurroundingSpaces not working - Pyspark
#670
DeemoONeill
closed
9 months ago
8
Convert xml to dataframe based on pyspark - using rowValidationXSDPath
#669
yu-tracy
closed
10 months ago
4
Extract multiple tables from the same XML file
#668
vwiencek
closed
11 months ago
1
Reading file with ContentType application/octet-stream
#667
mahmoud-masmoudi-dev
closed
10 months ago
3
Azure Synapse Spark 3.3 Runtime : spark-xml fails on writing xml
#666
thinh-ngu
closed
11 months ago
1
Use defined timezone on write for formats that need TZ info
#665
srowen
closed
11 months ago
0
Generated files does not have .xml extension
#664
dolfinus
closed
11 months ago
2
Cannot write dataframe with custom timestampFormat
#663
dolfinus
closed
11 months ago
7
Timestamps not matching format are replaced with nulls
#662
dolfinus
closed
11 months ago
2
Failed to find data source: xml.
#661
luisenriqueramos1977
closed
11 months ago
3
Shortcut common type inference cases to fail fast, speed up inference
#660
srowen
closed
1 year ago
1
Update to test vs Spark 3.4, and tested Spark/Scala/Java configs
#659
srowen
closed
1 year ago
0
Vulnerabilities from dependencies: CVE-2023-22946
#658
sasauz
closed
1 year ago
1
restrict access in hive meta store tables with Unity Catalog single user cluster
#657
ChackoSmitha
closed
1 year ago
2
Problem with extra line breaks inside tags during writing XML file
#656
VladIsLuve
closed
1 year ago
1
Problem with reading cp1251 file
#655
VladIsLuve
closed
1 year ago
7
Note plan to merge spark-xml to Apache Spark 4.0
#654
srowen
closed
1 year ago
0
Document that spark-xml is in maintenance mode
#653
HyukjinKwon
closed
1 year ago
0
strange tag while writing xml with nullValue
#652
groneveld
closed
1 year ago
8
Attribute values of nested fields are lost if option "attributePrefix" has empty value
#651
voban
closed
11 months ago
3
spark.sql.session.timeZone not taken into account while reading XML
#650
BaptistePiron
closed
1 year ago
3
"hidden" _metadata column is not identifying for the XML input file format
#648
ChackoSmitha
closed
1 year ago
3
Can't import XML file
#647
sanyam-dev
closed
1 year ago
1
Using spark-xml to parse nested xml structure in jupyter notebook
#646
Xabitsuki
closed
1 year ago
2
Reader can't read XML file if the rootTag and rowTag are the same
#645
irajhedayati
closed
1 year ago
6
Disallow strings ending in D or F as doubles when inferring schema
#644
srowen
closed
1 year ago
0
Schema for stringvalue not inferred correctly
#643
ShubhamG25
closed
1 year ago
2
fs.azure.account.key error when reading files from Azure and OAuth
#642
DragonEnergy
closed
1 year ago
2
EMRServerless
#640
akash1302
closed
1 year ago
1
ignoreCorruptFiles and GZIP corrupted xml files
#639
slavokx
closed
1 year ago
3
Next