dadosfera / Bugsfera

Other
1 stars 0 forks source link

SQLServer Pipelines cannot handle `geography` data type. #41

Open rafaelsantanaep opened 1 year ago

rafaelsantanaep commented 1 year ago

Mandatory information:

There are customers directly impacted by this bug. Which?

Bug Category

Describe the bug

While trying to collect data from SQL Server, the pipeline failed because there was a type that Spark could not handle.

An error occurred while calling o42.load. : java.sql.SQLException: Unrecognized SQL type -158 at org.apache.spark.sql.execution.datasources.jdbc.JdbcUtils$.getCatalystType(JdbcUtils.scala:251) at org.apache.spark.sql.execution.datasources.jdbc.JdbcUtils$.$anonfun$getSchema$1(JdbcUtils.scala:321) at scala.Option.getOrElse(Option.scala:189) at org.apache.spark.sql.execution.datasources.jdbc.JdbcUtils$.getSchema(JdbcUtils.scala:321) at org.apache.spark.sql.execution.datasources.jdbc.JDBCRDD$.resolveTable(JDBCRDD.scala:63) at org.apache.spark.sql.execution.datasources.jdbc.JDBCRelation$.getSchema(JDBCRelation.scala:226) at org.apache.spark.sql.execution.datasources.jdbc.JdbcRelationProvider.createRelation(JdbcRelationProvider.scala:35) at org.apache.spark.sql.execution.datasources.DataSource.resolveRelation(DataSource.scala:339) at org.apache.spark.sql.DataFrameReader.loadV1Source(DataFrameReader.scala:279) at org.apache.spark.sql.DataFrameReader.$anonfun$load$2(DataFrameReader.scala:268) at scala.Option.getOrElse(Option.scala:189) at

How to replicate this issue:

  1. Create a pipeline using the:
    • database: AdventureWorks
    • schema: Person
    • table_name: Address
  2. Run the Pipeline

Possible plan of action:

Does this bug impact any demos or sales?

No

Dadosfera Customer:


Workaround

What environment of software are you using?

When the bug happened: … 2022-12-16

beatrizaantunes commented 1 year ago

Thank you, Santana. Limitation already at the external docs