AbsaOSS / cobrix

A COBOL parser and Mainframe/EBCDIC data source for Apache Spark
Apache License 2.0
136 stars 79 forks source link

DataFrame with some columns in EBCDIC #690

Open sfuenteRoot opened 1 month ago

sfuenteRoot commented 1 month ago

I have this code:

val df = spark .read .format("za.co.absa.cobrix.spark.cobol.source") .option("ebcdic_code_page", "common_extended") .option("copybook", copybook) .load("ruta" .select("Campo")

but get this error:

za.co.absa.cobrix.cobol.parser.encoding.EBCDIC$ cannot be cast to za.co.absa.cobrix.cobol.parser.encoding.Encoding

if i read the parquet, with spark reading, the column contains infromation such as "1b64b427", i need convert to ascii

the file copyboock i`ve wirtten like

01 Campo String

And

01 Campo X(76)

But get the same error. What its no correct?

yruslan commented 1 month ago

Hi, I can't reproduce the issue. Could you please, paste the exact code snipped you are using, and the exact copybook?

Also, which Cobrix version are you using?