Open RandomFractals opened 2 years ago
Something is off while trying to load smaller 2022 crimes CSV data file with msft DataFrame:
@colombod from .Net Interactive team suggested to try the latest preview version of .Net ML libs using:
#i "nuget:https://pkgs.dev.azure.com/dnceng/public/_packaging/MachineLearning/nuget/v3/index.json"
#r "nuget:Microsoft.Data.Analysis,0.20.0-preview.22514.1"
This is using a daily build that will be out soon for the Dataframe nuget.
Sample ml project notebook:
Updated .Net Interactive notebooks setup to use new Polyglot Notebooks ext.:
https://marketplace.visualstudio.com/items?itemName=ms-dotnettools.dotnet-interactive-vscode
Changed imports to ML .Net preview nugets listed above.
Still getting load CSV data error, even for the smaller 33Mb file:
ML .net nuget is very beta and can't parse CSV with missing data fields yet.
Devs suggested to try 3rd party parquet library instead:
Use .Net Interactive Notebooks extension: https://marketplace.visualstudio.com/items?itemName=ms-dotnettools.dotnet-interactive-vscode
and
Microsoft.Data.Analysis
api: https://learn.microsoft.com/en-us/dotnet/api/microsoft.data.analysis.dataframe?view=ml-dotnet-preview