GoogleCloudPlatform / cloudml-samples

Cloud ML Engine repo. Please visit the new Vertex AI samples repo at https://github.com/GoogleCloudPlatform/vertex-ai-samples
https://cloud.google.com/ai-platform/docs/
Apache License 2.0
1.52k stars 859 forks source link

Added cleanup step to remove unsupported characters in labels #450

Closed kweinmeister closed 5 years ago

kweinmeister commented 5 years ago

AutoML Natural Language previously allowed labels such as comp.graphics but would change them to become valid, e.g. comp_graphics. Now, AutoML NL throws an error.

The change in the notebook is to add this step:

# Change newsgroup titles to use underscores rather than periods
df['categories'] = df['categories'].str.replace('.','_')

This change is Reviewable

review-notebook-app[bot] commented 5 years ago

Check out this pull request on ReviewNB: https://app.reviewnb.com/GoogleCloudPlatform/cloudml-samples/pull/450

You'll be able to see notebook diffs and discuss changes. Powered by ReviewNB.