dylanjcastillo / blog_comments

dylancastillo.co comments
1 stars 0 forks source link

posts/nlp-snippets-clean-and-tokenize-text-with-python/ #4

Open utterances-bot opened 3 years ago

utterances-bot commented 3 years ago

NLP Snippets #1: Clean and Tokenize Text With Python

The first step in a Machine Learning project is cleaning the data. In this article, you'll find 20 code snippets to clean and tokenize text data using Python.

https://dylancastillo.co/nlp-snippets-clean-and-tokenize-text-with-python/

Elenamo71 commented 3 years ago

Dear Mr Castillo, I am a poor linguist that has succeeded in analysing her text for the Master Thesis with your tutorial, but I am desperately trying to download the files from Colab. How can do it? I can only download empty files....Thanks for any support you can give me in advance. Cheers. Elena

dylanjcastillo commented 3 years ago

Hi @Elenamo71,

I don’t have enough context to really know what’s the issue. Maybe these links could be useful:

Let me know if that helps!

Best, Dylan

Elenamo71 commented 3 years ago

Hi @dylan! Thank you for your help, I have more or less obtained the files, also thanks to you. I am still learning Python (maybe you can reccomend an easy tutorial online where I can get the jest of it....). Thanks again! Elena

dylanjcastillo commented 3 years ago

HI @Elenamo71,

Great! I'm glad you figured it out.

I'd suggest starting with this one: Automate the boring stuff with Python. It's fun and you'll learn by doing projects.

Best, Dylan

Elenamo71 commented 3 years ago

Great! I will try it right now! Thank you again for all your help. Much appreciated!