-
A desideratum is to enrich the resource's data with keywords extracted from ToCs (Table of Content). See e.g. http://digitool.hbz-nrw.de:1801/webclient/DeliveryManager?pid=7240453&custom_att_2=simple_…
-
I have some feedback for the slides:
Feature extractions:
Size height width type are relevant features, but they are not the most interesting and my intuition not the most relevant. I would rather th…
-
### Title of the resource
Automatic Text Recognition (ATR) - Video 3: Image Optimisation
### Resource type
External Resource
### Authors, editors and contributors
Hippolyte Souvay, Lari…
-
### Title of the resource
Automatic Text Recognition (ATR) - Video 2: Get images
### Resource type
External Resource
### Authors, editors and contributors
Anna Busch, David Lassner, An…
-
I did all the setup as explained, but when I run the python file nothing is printed to the terminal? What did I do wrong.
```
from rake_nltk import Rake
# Uses stopwords for english from NLTK, …
-
## 🐛 Bug
aimrocks.errors.RocksIOError: b'IO error: While open a file for random read: /.aim/meta/chunks/d9b1e76ebed74634b9545c8e/000009.sst: Too many open files'
Made close to 900 API calls to…
-
-
extraction of https://www.eclipse.org/downloads/download.php?file=/oomph/drops/release/1.34.0-a/products/**eclipse-inst-jre-linux64.tar.gz** archive is not working on linux.
tar is giving error messa…
-
* [ ] First, make some designs to explain what the outcomes are
* [ ] Show how it is achieved (from documentation)
* [ ] Explain how this can lead to small, simple search engines in 50+ languages.
eklem updated
3 years ago
-
thanks for sharing! here's the rake.py file edited to use spacy instead of nltk. it removes certain verb types in _get_phrase_list_from_words, which i found to improve performance a bit (in small samp…