issues
search
MichaelAquilina
/
Reddit-Recommender-Bot
Indentifying Interesting Documents for Reddit using Recommender Techniques
7
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Bump lxml from 3.3.5 to 4.6.5
#119
dependabot[bot]
opened
2 years ago
0
Bump lxml from 3.3.5 to 4.6.3
#118
dependabot[bot]
closed
2 years ago
1
Bump lxml from 3.3.5 to 4.6.2
#117
dependabot[bot]
closed
3 years ago
1
Bump nltk from 2.0.4 to 3.4.5
#116
dependabot[bot]
opened
4 years ago
0
Bump requests from 2.3.0 to 2.20.0
#115
dependabot[bot]
opened
4 years ago
0
Use JSON Configuration files rather than Command Line Arguments
#114
MichaelAquilina
closed
10 years ago
0
Fast TFIDF lookup may work on the latest second-order-improvement branch
#113
MichaelAquilina
closed
10 years ago
0
Consider implementing PEBL as you now mention its use in your thesis
#112
MichaelAquilina
closed
10 years ago
2
Merge in second order ranking improvements
#111
MichaelAquilina
closed
10 years ago
0
Create Singular main script
#110
MichaelAquilina
closed
10 years ago
0
Would n-grams work?
#109
MichaelAquilina
closed
10 years ago
0
Check out `normalize-incoming-links` branch
#108
MichaelAquilina
closed
10 years ago
1
load_stopwords in wikiindex.py will fail if not in the correct directory
#107
MichaelAquilina
closed
10 years ago
0
Numerous tables are not automatically generated for WikiIndex
#106
MichaelAquilina
closed
10 years ago
0
Why is Python (genus) showing Python (programming language) in its results?
#105
MichaelAquilina
closed
10 years ago
2
Better post_extractor filtering?
#104
MichaelAquilina
closed
10 years ago
0
Prune concepts which occur once or too many times?
#103
MichaelAquilina
closed
10 years ago
1
This url should work, check whats going on
#102
MichaelAquilina
closed
10 years ago
1
Better performance with get_documents using TfidfValues table?
#101
MichaelAquilina
closed
10 years ago
1
Noise often generated by current word_concepts / second_order_ranking
#100
MichaelAquilina
closed
10 years ago
0
Determine which pages are being misclassified in nlptest.py
#99
MichaelAquilina
closed
10 years ago
0
Urls which fail word_concepts
#98
MichaelAquilina
closed
10 years ago
11
Writeup
#97
MichaelAquilina
closed
10 years ago
0
Append multiple instances of tokens in the title to boost their rank
#96
MichaelAquilina
closed
10 years ago
0
WikiIndex should gracefully degrade methods based on table support
#95
MichaelAquilina
closed
10 years ago
0
Goose returns empty text on certain pages
#94
MichaelAquilina
closed
10 years ago
0
Current word_concepts really easily confuses programming languages
#93
MichaelAquilina
closed
10 years ago
2
Add goose to main.py and datasource
#92
MichaelAquilina
closed
10 years ago
0
Consider sending pull request for Unicode fix to Goose
#91
MichaelAquilina
closed
10 years ago
0
Use python-boilerpipe or python-goose to extract the main text body from articles
#90
MichaelAquilina
closed
10 years ago
0
Noise within pages is affecting some results
#89
MichaelAquilina
closed
10 years ago
1
Some form of PageRank procedure could significantly increase accuracy
#88
MichaelAquilina
closed
10 years ago
0
second_order_ranking can be made faster with more numpy operations
#87
MichaelAquilina
closed
10 years ago
0
Create a webcrawler that follows standard conventions
#86
MichaelAquilina
closed
10 years ago
1
Pages should be penalised for having a lot of high weighted tfidf terms
#85
MichaelAquilina
closed
10 years ago
2
Consider Deleting all TermOccurrences with Counter=1
#84
MichaelAquilina
closed
10 years ago
2
Wiki.py should occasionally update DocumentFrequencies table for fast lookup
#83
MichaelAquilina
closed
10 years ago
0
Calculating corpus size takes too long, should be precalculated
#82
MichaelAquilina
closed
10 years ago
2
Consider changing back to InnoDB
#81
MichaelAquilina
closed
10 years ago
2
Delete all unprocessed pages on Index completion
#80
MichaelAquilina
closed
10 years ago
1
Ordering by Counter is slow because of lack of Index
#79
MichaelAquilina
closed
10 years ago
2
Reduce number of terms by filtering tfidf values of query vector
#78
MichaelAquilina
closed
10 years ago
0
SQL Queries slow due to Fetch performance
#77
MichaelAquilina
closed
10 years ago
1
No proper way to resolve page disambiguation over Captilisation
#76
MichaelAquilina
closed
10 years ago
0
Consider creating SQLIndex class which follows same structure of HashedIndex
#75
MichaelAquilina
closed
10 years ago
1
Query expansion using wordnet?
#74
MichaelAquilina
closed
10 years ago
1
Counting the number of links in Wiki pages could improve accuracy
#73
MichaelAquilina
closed
10 years ago
4
Consider more extensive stopword list
#72
MichaelAquilina
closed
10 years ago
0
Word Tokeniser failures
#71
MichaelAquilina
closed
10 years ago
0
Current word_tokenizer handles http urls really badly
#70
MichaelAquilina
closed
10 years ago
0
Next