-
I am often frustrated because I can only segment documents into sentences on corpus, but I came up with an idea to make it possible on tokens with boundary marker. I would use this often for word-embe…
-
**Background**:
My "2Vec refactor wishlist" (#1623) suggested among other things:
> 7. separating vocabulary-management into explicitly different classes/objects, for more control/customization,…
-
Hello,
I'm trying to fuzzy a contract using assert mode with testLimit 50000 . It is taking a long time and after they reach 6000 trials, echidna suddenly stops.
So in order to debug the proble…
-
Post questions here for this week's oritenting readings: Kozlowski, Austin, Matt Taddy, James Evans. 2019. “The Geometry of Culture: Analyzing the Meanings of Class through Word Embeddings.” American …
-
In the latest lucene package there is no stemmer for Latin language. I have a stemmer for latin language which is a rule based program based on the grammar and rules of Latin
---
Migrated from [LUC…
-
First, pose a research question you would like to answer (in one, artfully worded sentence...ending with a question mark). This could be the same question you posed for the first week's assignment, or…
-
### Links
- Paper: https://hal.archives-ouvertes.fr/hal-02437881/document
### Abstract
- Addressing the needs of visually impaired people is of continued interest in Human Computer Interaction (H…
-
This issue is applicable to N'Ko.
Certain constructs in N'Ko text mean 'each and every ....', and they appear with dash on the baseline with spaces either side. For example:
This is also used …
r12a updated
5 months ago
-
If we are using the CREDIT taxonomy (https://web.archive.org/web/20200711223353/https://casrai.org/credit/ ) I'd like to suggest that we start filling in the roles early on.
These are the categories …
-
Does the package provide a utility for reading and integrating data (removing redundancy) or should i handle that by myself?