aws-samples / sagemaker-101-workshop

Hands-on demonstrations for data scientists exploring Amazon SageMaker
76 stars 47 forks source link

feat(nlp): migrate TF GloVe to FastText vectors #8

Closed athewsey closed 3 years ago

athewsey commented 3 years ago

Issue #, if available: #1

Description of changes:

In TF.Keras NLP notebooks:

FastText offers multi-lingual pre-trained embeddings (vs English-only) and marginally faster download time (although in current implementation this is offset by increases in downstream processing times)

PyTorch alternatives not yet updated, pending further testing.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

athewsey commented 3 years ago

Cloned to clean environment and ran through - all seems fine