Closed stevehadd closed 4 years ago
The ability to split on cruise ID has now been implemented in the dataset class. The performance of the ML algorithms on unseen cruises has been demonstrated for decision trees and neural networks. Further analysis of the relationship between cruise ID and performance will explored in other issues.
There seems to be a correlation between type and cruise ID. Cruise ID has not previously been included as a feature in classification. Its effect on results should be explored. How cruise data is split between training and test could be important. The following for be tried