-
I am building a "rule editor" for NLP purposes. I want to give editors the ability to see "in real time" (after saving), how their rule would work in practice for training sentences they have added to…
-
**Phrases/sentence's complexity assessment** is a typical NLP tasks with a dedicated academic literature.
### Discussion
A difficulty of the project is its multilingualism.
- Script approach : th…
-
### System Info
- `transformers` version: 4.45.0.dev0
- Platform: macOS-14.6.1-arm64-arm-64bit
- Python version: 3.12.4
- Huggingface_hub version: 0.24.6
- Safetensors version: 0.4.5
- Acceler…
-
**개요**
바이너리 형태의 Ko-BERT vocabulary를 huggingface 모듈이 읽을 수 있는 text 형태의 vocabulary로 추출한다
-
* [Linked Data at the BBC](http://www.pilod.nl/w/images/2/29/7OliverBartlett_PilotLinkedData_Keynote1.pdf)
* [`schema.org` and data scraping](https://kb.apify.com/tips-and-tricks/scraping-data-from-w…
-
I have gensim installed but I keep getting errors similar to this one when I import POSpair.
The code I'm using:
`import POSPair`
`sentences = POSPair.POSPairWordEmbeddings(data[0])`
Outpu…
-
### Fun Idea
My Gmail alone has over 10k emails.
The plan is to utilize this data to build a simple smart email reply feature.
The initial MVP will be crude API that accepts an incoming email pay…
-
Malayalam is a highly inflectional and agglutinative language compared to other languages. And very few people seem to have applied techniques in machine learning and deep learning in Malayalam. A lo…
-
Thanks for making this available!
While the text itself is nice to have, some more interesting tasks can be done if the data is split into separate speeches, in some form: e.g. looking at how his rhe…
-
### Initial Checks
- [ ] I have searched GitHub for a duplicate issue and I'm sure this is something new
- [ ] I have read and followed [the docs & demos](https://github.com/modelscope/modelscope-age…