Closed ZhihengShen closed 1 month ago
The data scraping process was a success, as it managed to gather a significant amount of information from WordPress regarding various attributes such as checkboxes, country, date, and descriptions. However, the dataset still requires thorough cleaning to ensure the quality and consistency of the information. Proper cleaning is essential to remove any irrelevant or redundant data and address any inconsistencies that may affect the model’s accuracy during fine-tuning. Once the data is refined, further steps involving fine-tuning with 70% training data, 20% validation data, and 10% test data can be executed to improve the model’s performance.
[x] add the form of attributes of checkbox, country, date, description......
[x] fine-tune with 70% training data, 20% validation data, and 10% test data.