a hive udf method to do fuzzy string matching using Jaro Winkler, Levenstein or NGram distance
9
stars
3
forks
source link
I keep getting this error when I try to run it in HIVE: Error while processing statement: FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.mr.MapRedTask #3
I get this error when I try to run it in Hive Shell: Failed with exception java.io.IOException:org.apache.hadoop.hive.ql.metadata.HiveException: Error evaluating evaluate match of two string
I get this error when I run it in HUE: Error while processing statement: FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.mr.MapRedTask
The following query runs fine when a string is input, but it fails when I try to compare actual fields:
SELECT fuzzy_match('Roberto','Robert', "JW")
FROM my_table
I get this error when I try to run it in Hive Shell: Failed with exception java.io.IOException:org.apache.hadoop.hive.ql.metadata.HiveException: Error evaluating evaluate match of two string
I get this error when I run it in HUE: Error while processing statement: FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.mr.MapRedTask
The following query runs fine when a string is input, but it fails when I try to compare actual fields: SELECT fuzzy_match('Roberto','Robert', "JW") FROM my_table