Talend is a tool used for data cleaning purpose. Basically, this tool detects the datatype of each column and says which are the entries in a column are valid and invalid.
In the figure above the green shows that the data entries are valid and yellow shows that the entries are invalid.
We can also observe that the tool not only detected the type of data entry but also have analysed to which category that the data belongs to.
This implies that the algorithm first reads a particular column and performs an operation through which it classifies the datatype first later based on the datatype it predicts what kind of data does this column contain.
For example suppose there is a column with countries
the data type detected is String and the data in the form of string will be classified into country based on the libraries the the tool has .
Data Profiling features:
Talend Tool Analysis
Talend is a tool used for data cleaning purpose. Basically, this tool detects the datatype of each column and says which are the entries in a column are valid and invalid.
In the figure above the green shows that the data entries are valid and yellow shows that the entries are invalid.
We can also observe that the tool not only detected the type of data entry but also have analysed to which category that the data belongs to.
This implies that the algorithm first reads a particular column and performs an operation through which it classifies the datatype first later based on the datatype it predicts what kind of data does this column contain.
For example suppose there is a column with countries the data type detected is String and the data in the form of string will be classified into country based on the libraries the the tool has .