moj-analytical-services / splink_demos

Interactive notebooks containing demonstration code of the splink library
38 stars 27 forks source link

linker.profile_columns() not working while running on aws #94

Open shailesh-albanero opened 1 year ago

shailesh-albanero commented 1 year ago

I am trying to run linker.profile_columns() and getting following error.

TypeError: profile_columns() missing 1 required positional argument: 'column_expressions'

while running the same by passing a given column name works fine.

Also multiple cols in dict is not working and giving following error

Py4JJavaError: An error occurred while calling o1941.collectToPython. : java.lang.IllegalStateException: You hit a query analyzer bug. Please report your query to Spark user mailing list.

ThomasHepworth commented 1 year ago

Hey there!

Would you mind making a new issue in our main repo over here - https://github.com/moj-analytical-services/splink/issues.

Could you also detail the version of Splink you're using and give us a code snippet up to the error.

This is just so we have a log of everything internally.

Cheers!