-
Look ahead is too expensive and wasting CPU
The alternative (worse one, is LRUcache for lexemes), but we will still need to lex at least once. Also create a lot of strings.
-
This would allow to better capture more complex constructs like [matrix-assisted laser desorption/ionization time-of-flight mass spectrometry](https://www.wikidata.org/w/index.php?sort=relevance&searc…
-
### Terms
- [X] I have searched [open and closed data issues](https://github.com/scribe-org/Scribe-Data/issues?q=is%3Aissue+label%3Adata+)
- [X] I agree to follow Scribe-Data's [Code of Conduct](http…
-
When Exporting to LIFT from FLEx, all filled standard and custom fields are exported with each entry.
When Exporting the same Dictionary from TheCombine, only a subset of fields are exported (lexeme,…
-
Is it possible to indicate tokens to be case-sensitive or case-insensitive? I'm needing for a SQL-like parser to have case-insensitive tokens.
-
விக்கிப்பொதுவகத்தில் ஏற்கனவே தமிழ் சொற்களுக்குரிய ஒலிப்புக்கோப்புகள் ஏறத்தாழ ஒன்பதாயிரம் உள்ளன. அவற்றை விக்கித்தரவில் இணைப்பதற்கானத் தானியங்க நிரல் தேவை. இது தமிழுக்கு மட்டும் பயன்படப்போகும் தானியக்கம…
-
all error handling need to be done.
atm a Basictype object is created with the lexeme "ERROR".
need to change each to the matching output::errorFunction().
cbfbl updated
4 years ago
-
Hi, I suggest we remove matching verbs for now. The reason is that we need to first create a good structure in the Qitems to handle the verbs. I'm about to propose a new property "verbform of " or "ac…
-
We're having an issue with pattern==3.6 where if there are duplicates, etc in the model documents, getting the nsmallest fails for vector_space_search:
```python
from pattern.en import lexeme
fro…
-
It seems the config isn't quite right to include token probabilities. I'm not 100% sure of the solution, but this issue should help (https://github.com/explosion/spaCy/discussions/6388#discussioncomme…