-
I am interested in finding/contributing to efforts to create an open source data analysis platform, similar to what Tableau offers. I have found a couple of solid contenders, such as [Apache Superset]…
-
This is a list of terms (in IsiZulu language) related to Data Science that I think should be included in the glossary. I see that two terms already exist for IsiZulu (i.e. **I-algorithm** and **ukumbi…
-
-
Why?
1. Solve issue #8
2. Provide info about all Python versions
3. Make data available from the web for data mining, and 3rd party services / tools
4. Provide a stable data channel for future versio…
-
Scanned PDFs:
References:
1. http://xiaofeima1990.github.io/2016/12/19/extract-text-from-sanned-pdf/
2. https://datascience.blog.wzb.eu/2017/02/16/data-mining-ocr-pdfs-using-pdftabextract-to-liber…
-
Explore the status of open science in Kenya through literature search and data mining. The idea is to analyze how open science tools have been used in research. For example, for a given period of time…
-
In the original paper about the automated topic labeling (Mei, Q., Shen, X., & Zhai, C. 2007, Automatic labeling of multinomial topic models, in: Proceedings of the 13th ACM SIGKDD International Confe…
-
num | name | result | fork | color | tag1 | tag2
-- | -- | -- | -- | -- | -- | --
0 | Bitcoin-and-Cryptocurrency-Technologies | keep | TRUE | | |
1 | CognosTM1-DevKit | keep | TRUE | yellow | …
-
Running pytest (in a venv) on commit `6cd14fb` *sometimes* fails with the following error:
```
(.venv) jfreige@sl-akali-p-cs1:easy-entrez (main)$ pytest
=========================================…
-
Just to be a bit more efficient in Cython implementations (which is often used in conjunction with Numpy for data mining and scientific applications), we should at least create a `.pxd` file which bas…