Closed linziyingzz closed 3 years ago
I see the problem. Thank you! The data in the feature vector is very scattered and the small data are swallowed by the big data. I would use standardization to deal with this situation. Z-score standardization may be best suitable in this case. The data will be converted to the center by removing the mean of each feature and then be scaled via dividing the non-normal feature by its standard deviation. By doing so, there could exist more comparability of the data in different dimensions and orders of magnitude.
Hi, I might have some suggestion, there is no intuitive comparability of data put together in different dimensions and orders of magnitude. could you improve that?