sehsanm / embedding-benchmark

Word Embedding benchmark project By Shahid Beheshti University NLP Lab
GNU General Public License v3.0
6 stars 16 forks source link

Find, Upload and Cleanse Persian Wiki Dump #4

Open sehsanm opened 5 years ago

sehsanm commented 5 years ago
FullDataAlchemist commented 5 years ago

in ro chetori mishe bardasht ?

sehsanm commented 5 years ago

Please accept invitation for collaboration : https://github.com/sehsanm/embedding-benchmark/invitations

On Tue, Dec 4, 2018 at 12:24 PM Pouria Nikvand notifications@github.com wrote:

in ro chetori mishe bardasht ?

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/sehsanm/embedding-benchmark/issues/4#issuecomment-444020905, or mute the thread https://github.com/notifications/unsubscribe-auth/ADWBPP9KO4gAmUNIU9SbeXn-fzlJmyTUks5u1jhTgaJpZM4Y94Nj .

FullDataAlchemist commented 5 years ago

thanks.

FullDataAlchemist commented 5 years ago

Hi. I upload the wiki dump cleaned text data and the sentences are also segmented. The raw text is also uploaded in another file by mistake. I think it is unusable and you can delete that file.

thanks.

sehsanm commented 5 years ago

can you please update the Readme.md and create a corpus section and place the link there ?

On Sun, 16 Dec. 2018, 6:08 pm Pouria Nikvand <notifications@github.com wrote:

Hi. I upload the wiki dump cleaned text data and the sentences are also segmented. The raw text is also uploaded in another file by mistake. I think it is unusable and you can delete that file.

thanks.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/sehsanm/embedding-benchmark/issues/4#issuecomment-447648148, or mute the thread https://github.com/notifications/unsubscribe-auth/ADWBPNbymQhW0K_1rg2wnU7-KgdS-Ytfks5u5lrTgaJpZM4Y94Nj .

FullDataAlchemist commented 5 years ago

Of course. I sent a pull request.