issues
search
datasciencecampus
/
pygrams
Extracts key terminology (n-grams) from any large collection of documents (>1000) and forecasts emergence
https://datasciencecampus.github.io/pygrams
Other
63
stars
23
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
287 update system requirements section
#288
user624086
closed
5 years ago
1
update System Requirements section
#287
user624086
closed
4 years ago
0
285 data uspto
#286
thanasions
closed
5 years ago
1
cached USPTO data upload
#285
thanasions
closed
5 years ago
0
Changed folders for cached outputs (#281)
#284
IanGrimstead
closed
5 years ago
1
278 move mask
#283
thanasions
closed
5 years ago
1
Warn user that certain arguments are ignored
#282
IanGrimstead
closed
4 years ago
0
Change pickle outputs to put outputs from a run into a single folder
#281
IanGrimstead
closed
5 years ago
1
279 small adjustments
#280
thanasions
closed
5 years ago
1
small code adjustments
#279
thanasions
closed
4 years ago
0
unbias ngrams and unigrams should be done before cache
#278
thanasions
closed
5 years ago
0
273 dates as ints
#277
IanGrimstead
closed
5 years ago
1
exponential like emergence escore
#276
user624086
closed
5 years ago
2
resolves #272
#275
thanasions
closed
5 years ago
0
exponential-emergence
#274
user624086
closed
4 years ago
0
reduce df size: store dates as ints
#273
thanasions
closed
5 years ago
1
reduce df size: cpc dictionary
#272
thanasions
closed
5 years ago
0
257 add nmf code
#271
thanasions
closed
5 years ago
1
save time series to file
#270
user624086
closed
5 years ago
1
cache 2 initial commit!
#269
thanasions
closed
5 years ago
2
save time series to file
#268
user624086
closed
4 years ago
0
nmf topic modelling
#267
user624086
closed
5 years ago
1
reduce size of tfidf matrix using float16
#266
thanasions
closed
5 years ago
0
CPC filter does not produce predictions
#265
IanGrimstead
closed
4 years ago
3
Replace nltk with spacy (or textacy)
#264
IanGrimstead
closed
4 years ago
0
tfidf wrapper can be streamlined
#263
IanGrimstead
closed
4 years ago
0
redundant calculations
#262
thanasions
closed
5 years ago
0
blank line and comment
#261
thanasions
closed
5 years ago
1
list check on cpc codes is misplaced
#260
thanasions
closed
5 years ago
0
parallelize parts of the code using numba
#259
thanasions
closed
4 years ago
0
-pt argument double use
#258
user624086
closed
4 years ago
0
add nmf code
#257
thanasions
closed
5 years ago
0
Technical report
#256
IanGrimstead
closed
5 years ago
1
Convert R scripts
#255
mshodge
closed
4 years ago
0
248 tfidf filter
#254
IanGrimstead
closed
5 years ago
2
upgrade scipy?
#253
thanasions
closed
4 years ago
0
test
#252
thanasions
closed
5 years ago
0
resolves #250
#251
thanasions
closed
5 years ago
1
Bug: List type-check for cpc filtering
#250
thanasions
closed
5 years ago
0
filtering rows now gets rid of corresponding rows in df
#249
thanasions
closed
5 years ago
1
Filter tfidf matrix to n max features using our popularity scores (sum of column tfidfs)
#248
thanasions
closed
5 years ago
1
Bug: droping tfidf matrix rows was not dropping the corresponding df rows
#247
thanasions
closed
5 years ago
0
added argument for embeddings threshold
#246
emily-barrington
closed
5 years ago
1
add argument for term filter threshold
#245
emily-barrington
closed
4 years ago
0
fixed embeddings threshold
#244
emily-barrington
closed
4 years ago
0
Embeddings threshold not working
#243
emily-barrington
closed
4 years ago
0
various fixes
#242
thanasions
closed
5 years ago
0
Timeseries computations contain a bug
#241
thanasions
closed
5 years ago
3
periods configurable by user as option
#240
thanasions
closed
4 years ago
0
Remove leading zero trimming (#235)
#239
IanGrimstead
closed
5 years ago
1
Previous
Next