dotnet / machinelearning-modelbuilder

Simple UI tool to build custom machine learning models.
Creative Commons Attribution 4.0 International
267 stars 56 forks source link

Data classification: The error message is not very clear after changing the "SentimentText" column's Data type from "String" to "Boolean". #2986

Open v-Hailishi opened 3 days ago

v-Hailishi commented 3 days ago

System Information (please complete the following information): Windows OS: Windows-11-Enterprise-24H2 ML.Net Model Builder 2022: 17.19.0.2456401 (Main Build) Microsoft Visual Studio Enterprise: 2022(17.11.5) .Net: 6.0, 8.0

Describe the bug

TestMatrix https://testpass.blob.core.windows.net/test-pass-data/wikipedia-detox-250-line-data.tsv

To Reproduce Steps to reproduce the behavior:

  1. Select Create a new project from the Visual Studio start window.
  2. Choose the C# Console App (.NET Core) project template.
  3. Add model builder by right click on the project.
  4. Select "Data classification" scenario.
  5. On the Data page, choose the data source, column.
  6. Changing the "SentimentText" column's Data type from "String" to "Boolean". Image
  7. On the Train page. click "Start training".
  8. Check the error message.

Expected behavior The error message should be like "Schema mismatch for Context column 'SentimentText': expected String, got Boolean".

Screenshot Image

Additional context Changing the "SentimentText" column's Data type to "Boolean", actually all data could not be parsed. Because the "context" column's Data type should be "String". But in the error message it only mentioned the data in line 66.