bangla-tokenizer Search Results

29 results
for bangla-tokenizer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

csebuetnlp/banglabert #2

Can I extract word embeddings using BanglaBERT ?

Hi, Is it possible to extract/generate word embeddings using **BanglaBERT?** I have **tokenized** my Bangla sentence using BanglaBERT. Now I want to generate **Word Embeddings** from my tokenized s…

MusfiqDehan updated 2 years ago
1
apache/lucene #2451

Add HTMLStripReader and WordDelimiterFilter from SOLR [LUCEN…

SOLR has two classes HTMLStripReader and WordDelimiterFilter which are very useful for a wide variety of use cases. It would be good to place them into core Lucene. --- Migrated from [LUCENE-1377]…

asfimport updated 2 years ago
35
bigscience-workshop/multilingual-modeling #2

Incrementally adding new languages to pre-trained models

# Experiments design Follow discussion [here](https://docs.google.com/document/d/110tlidAcpiNteKnA27tR5KPS_VahNqYKqCeJlu1MWww/edit#heading=h.wmf5tyes1tfk) ## pointers to code and datasets ### …

hadyelsahar updated 2 years ago
9
ManimCommunity/manim #2285

`log_to_file` in configuration gives `'Nonetype' is not iter…

## Description of bug / unexpected behavior trying to set up logging in a config file fails ## Expected behavior logs written to a file ## How to reproduce the issue Code for reproduci…

citrusmunch updated 2 years ago
1
vgaraujov/Question-Answering-Tutorial #2

Error when try this

Gives an error when running this format with transformers 3.5.1- as huggingface -transformers update their script i found it in their legacy folder then download in colab & run ```py !export SQUA…

whoafridi updated 3 years ago
4
sagorbrur/bnlp #16

RuntimeError: Internal: /sentencepiece/python/bundled/senten…

- [ ] Training SentencePiece ```python from bnlp import SentencepieceTokenizer bsp = SentencepieceTokenizer() data = "raw_text.txt" model_prefix = "test" vocab_size = 5 bsp.train(data, mode…

rakib06 updated 4 years ago
4
huggingface/transformers #2241

How to load the finetuned model for retraining from checkpoi…

Because of bad internet connection and computational issues its hard for us to train a large number of epochs. We're trying to use the run_squad.py script for bangla QA system training. We have traine…

Tahsin-Mayeesha updated 4 years ago
2
scikit-learn/scikit-learn #7379

sklearn.CountVectorizer.get_feature_names() return broken w…

from sklearn.feature_extraction.text import CountVectorizer corpus = ['ভৌতিক গল্প পড়তে চাইলে লাইক দেন','শয়তান সহজে মরেনা ওতো একটা মানুষ রুপী শয়তান','তুমি ছুয়ে দিলে মন'] vec = CountVectorizer() x = v…

sulaimankhan7 updated 8 years ago
4
codelucas/newspaper #34

Other language support.

Can you add a new section where describing how to add a new language support?

mushfiq updated 10 years ago
10

上一页 1...1 2 3...3 下一页

29 results for bangla-tokenizer

29 results
for bangla-tokenizer