Open udithac opened 5 months ago
So much input. I just have an idea for the first one, how to encode "Years of Professional Coding Experience": Think about one hot encoding. Create a table such as
sample | 1-2 years | 3-4 years | 5-6 years | ... |
---|---|---|---|---|
1 | 0 | 1 | 0 | 0 |
2 | 1 | 0 | 0 | 0 |
3 | 0 | 0 | 1 | 0 |
So you encode every feature as a sparse vector with 0 in every place except for a 1 with the selection of the user in the survey. This works really well with linear regression and other models.
Question 1 and 2 are similar to https://github.com/novik2713/data-circle-team-C/issues/10 IMXO we have to use just one of three that have the strongest relation/corelation to Salary.
Working on the followibg Future Engineering tasks: