Closed sergionsk8 closed 5 years ago
Found a simple workaround: project all columns between the two Union operations.
df1.union(df2).select(df1.schema.fieldNames.head, df1.schema.fieldNames.tail: _*).union(df3).select("name").write.mode(SaveMode.Overwrite).text("/tmp/people.out")
Thanks @sergionsk8, good catch! It should be fixed in release/0.3 branch now. Can you give it a try? Will release it in a few days if no other bugs are found.
Hello @wajda, sorry for the late response. This is working for me now. Thank you!
A chain of two union operations on dataframes causes scala.MatchError in DataLineageBuilder.
Simple code to reproduce the issue:
Stacktrace: