dotnet / machinelearning

ML.NET is an open source and cross-platform machine learning framework for .NET.
https://dot.net/ml
MIT License
8.99k stars 1.88k forks source link

Samples pointing at wrong resources #6329

Open tarekgh opened 1 year ago

tarekgh commented 1 year ago

The following sample is pointing at https://aka.ms/mlnet-resources/datasets/cifar10.zip but looks this resources is not valid.

https://github.com/dotnet/machinelearning/blob/bca5736c23e093006bb6a5a3f85a789c4a1cdcf2/docs/samples/Microsoft.ML.Samples/Dynamic/Trainers/MulticlassClassification/ImageClassification/LearningRateSchedulingCifarResnetTransferLearning.cs

Looks we have code in different places trying to get the same resources from other sources https://github.com/dotnet/machinelearning/blob/bca5736c23e093006bb6a5a3f85a789c4a1cdcf2/docs/samples/Microsoft.ML.AutoML.Samples/Cifar10.cs#L13

Also there is other samples pointing at https://github.com/onnx/models/tree/master/vision/classification/squeezenet which exists but the zipped files there contains a different structured files than what the sample expect. https://github.com/dotnet/machinelearning/blob/04dda55ab0902982b16309c8e151f13a53e9366d/docs/samples/Microsoft.ML.Samples/Dynamic/Transforms/ApplyONNXModelWithInMemoryImages.cs#L16 even the comment suggest there is Microsoft.ML.Onnx.TestModels nuget which not exist either. The correct structured file can be found in https://s3.amazonaws.com/download.onnx/models/opset_8/squeezenet.tar.gz

ericstj commented 1 year ago

First off the aka.ms link doesn't exist for datasets/cifar10.zip. I took a look at the blob storage under datasets and didn't see the cifar10.zip there. @michaelgsharp do we have this archived somewhere?

tarekgh commented 1 year ago

CC @luisquintanilla

michaelgsharp commented 1 year ago

We have all the data archived. I have downloaded it on my comp, but it doens't appear to be in the right format the sample is expecting. Will have to re-order the files/folders for that.