A production engineer that has downloaded logging information from his machines and understands that can theoretically predict the lifetime from the sensor data. He follows the instruction provided to convert his data into multivariate timeseries with a column that contains the time until the next failure detected.
A web shop owner understands that he might be able to predict the number of purchases in his webshop within a month. Following the instructions he collected a large amount of data in the past years. He follows the instruction provided to convert his data.
A manifacturer wants to use the package to predict manifacturing times for his products. He collected the data of the past years and saves them in a csv-file. There are some missing information for some samples.
these situations should be considered:
should provide a basic methods for the one who has multiple files, especially, the feature and target are not in the same database (It is better to do with software, not code).
-- with merge key
-- without merge key
should provide basic methods for the one has the other type of data: txt, excel, json, database(sql,...)
should automatically check for potential issues in the data
should provide proposals for the user to solve the issues and then excute them
should provide methods to deal with large data set (larger than 10GB)
A production engineer that has downloaded logging information from his machines and understands that can theoretically predict the lifetime from the sensor data. He follows the instruction provided to convert his data into multivariate timeseries with a column that contains the time until the next failure detected.
A web shop owner understands that he might be able to predict the number of purchases in his webshop within a month. Following the instructions he collected a large amount of data in the past years. He follows the instruction provided to convert his data.
A manifacturer wants to use the package to predict manifacturing times for his products. He collected the data of the past years and saves them in a csv-file. There are some missing information for some samples.
these situations should be considered: