achouhan93 / Data-Wrecker

Repository for Data-Wrecker-Framework Project
1 stars 2 forks source link

Rahul: Analysis of Data Profiling tools and Data Preparation Tools #2

Open achouhan93 opened 5 years ago

rahulsarode commented 5 years ago

Analysis of following Data profiling tools Open Source Data Quality and Profiling, Data Cleaner and Talend is done and observations/features useful for our project are shared with the Team.

achouhan93 commented 5 years ago

Rahul: Can you please provide the information which was shared with the team as a comment for future reference.

rahulsarode commented 5 years ago

Below information was shared with the team : -Identification of data type of column -Analyse the column and provide column statistics (pattern, format , range, count etc. ) -Allocate internal index for each record (row) -Data masking can be used to unclean duplicate records