MichaelAquilina Reddit-Recommender-Bot issues

MichaelAquilina / Reddit-Recommender-Bot

Indentifying Interesting Documents for Reddit using Recommender Techniques

7 stars 0 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Bump lxml from 3.3.5 to 4.6.5

#119 dependabot[bot] opened 2 years ago
0
Bump lxml from 3.3.5 to 4.6.3

#118 dependabot[bot] closed 2 years ago
1
Bump lxml from 3.3.5 to 4.6.2

#117 dependabot[bot] closed 3 years ago
1
Bump nltk from 2.0.4 to 3.4.5

#116 dependabot[bot] opened 3 years ago
0
Bump requests from 2.3.0 to 2.20.0

#115 dependabot[bot] opened 3 years ago
0
Use JSON Configuration files rather than Command Line Arguments

#114 MichaelAquilina closed 10 years ago
0
Fast TFIDF lookup may work on the latest second-order-improvement branch

#113 MichaelAquilina closed 10 years ago
0
Consider implementing PEBL as you now mention its use in your thesis

#112 MichaelAquilina closed 10 years ago
2
Merge in second order ranking improvements

#111 MichaelAquilina closed 10 years ago
0
Create Singular main script

#110 MichaelAquilina closed 10 years ago
0
Would n-grams work?

#109 MichaelAquilina closed 10 years ago
0
Check out `normalize-incoming-links` branch

#108 MichaelAquilina closed 10 years ago
1
load_stopwords in wikiindex.py will fail if not in the correct directory

#107 MichaelAquilina closed 10 years ago
0
Numerous tables are not automatically generated for WikiIndex

#106 MichaelAquilina closed 10 years ago
0
Why is Python (genus) showing Python (programming language) in its results?

#105 MichaelAquilina closed 10 years ago
2
Better post_extractor filtering?

#104 MichaelAquilina closed 10 years ago
0
Prune concepts which occur once or too many times?

#103 MichaelAquilina closed 10 years ago
1
This url should work, check whats going on

#102 MichaelAquilina closed 10 years ago
1
Better performance with get_documents using TfidfValues table?

#101 MichaelAquilina closed 10 years ago
1
Noise often generated by current word_concepts / second_order_ranking

#100 MichaelAquilina closed 10 years ago
0
Determine which pages are being misclassified in nlptest.py

#99 MichaelAquilina closed 10 years ago
0
Urls which fail word_concepts

#98 MichaelAquilina closed 10 years ago
11
Writeup

#97 MichaelAquilina closed 10 years ago
0
Append multiple instances of tokens in the title to boost their rank

#96 MichaelAquilina closed 10 years ago
0
WikiIndex should gracefully degrade methods based on table support

#95 MichaelAquilina closed 10 years ago
0
Goose returns empty text on certain pages

#94 MichaelAquilina closed 10 years ago
0
Current word_concepts really easily confuses programming languages

#93 MichaelAquilina closed 10 years ago
2
Add goose to main.py and datasource

#92 MichaelAquilina closed 10 years ago
0
Consider sending pull request for Unicode fix to Goose

#91 MichaelAquilina closed 10 years ago
0
Use python-boilerpipe or python-goose to extract the main text body from articles

#90 MichaelAquilina closed 10 years ago
0
Noise within pages is affecting some results

#89 MichaelAquilina closed 10 years ago
1
Some form of PageRank procedure could significantly increase accuracy

#88 MichaelAquilina closed 10 years ago
0
second_order_ranking can be made faster with more numpy operations

#87 MichaelAquilina closed 10 years ago
0
Create a webcrawler that follows standard conventions

#86 MichaelAquilina closed 10 years ago
1
Pages should be penalised for having a lot of high weighted tfidf terms

#85 MichaelAquilina closed 10 years ago
2
Consider Deleting all TermOccurrences with Counter=1

#84 MichaelAquilina closed 10 years ago
2
Wiki.py should occasionally update DocumentFrequencies table for fast lookup

#83 MichaelAquilina closed 10 years ago
0
Calculating corpus size takes too long, should be precalculated

#82 MichaelAquilina closed 10 years ago
2
Consider changing back to InnoDB

#81 MichaelAquilina closed 10 years ago
2
Delete all unprocessed pages on Index completion

#80 MichaelAquilina closed 10 years ago
1
Ordering by Counter is slow because of lack of Index

#79 MichaelAquilina closed 10 years ago
2
Reduce number of terms by filtering tfidf values of query vector

#78 MichaelAquilina closed 10 years ago
0
SQL Queries slow due to Fetch performance

#77 MichaelAquilina closed 10 years ago
1
No proper way to resolve page disambiguation over Captilisation

#76 MichaelAquilina closed 10 years ago
0
Consider creating SQLIndex class which follows same structure of HashedIndex

#75 MichaelAquilina closed 10 years ago
1
Query expansion using wordnet?

#74 MichaelAquilina closed 10 years ago
1
Counting the number of links in Wiki pages could improve accuracy

#73 MichaelAquilina closed 10 years ago
4
Consider more extensive stopword list

#72 MichaelAquilina closed 10 years ago
0
Word Tokeniser failures

#71 MichaelAquilina closed 10 years ago
0
Current word_tokenizer handles http urls really badly

#70 MichaelAquilina closed 10 years ago
0