Humorloos / IE683

0 stars 0 forks source link

The `title` field is not mapped to anything for IMDb dataset. #52

Closed ashishrana160796 closed 2 years ago

ashishrana160796 commented 2 years ago

Working on the mapping fix for this issue to alongside the #50. Wow, how did this one was missed out 😃 !!

ashishrana160796 commented 2 years ago

Also, one-to-one column mappings are missing too on this part. Need some more time on the fix for this issue, probably by afternoon.

ashishrana160796 commented 2 years ago

Column budget data is not correctly mapped into the target_df schema as well.

imdb_df['budget'] = imdb_df['budget'].str.extract('(\d+)', expand=False)
target_df['budget'] = target_df['budget'].astype(float)
ashishrana160796 commented 2 years ago

Also, mapping avg vote to imdb score field as both of them are pointed out to be common in our integrated schema.

Humorloos commented 2 years ago

Thanks for the effort, Ashish!

ashishrana160796 commented 2 years ago

PR #53, completely removes this issue.