issues
search
Sotera
/
webpageclassifier
Categorizes a website given URL into one of blog|wiki|news|forum|classified|shopping|undecided.
Apache License 2.0
8
stars
3
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
error while executing script
#21
ghost
opened
4 years ago
0
Throws ValueError: Unicode... on some websites
#20
ctwardy
closed
7 years ago
1
Parallel jpl
#19
ctwardy
closed
7 years ago
0
Drop bleach/reload from _score_url()
#18
ctwardy
closed
7 years ago
0
Goldwords files fail on Cyrillic text.
#17
ctwardy
closed
7 years ago
0
Simplify the JPL_Classifier
#16
ctwardy
closed
7 years ago
0
Confusion Matrix labels are wrong
#15
ctwardy
closed
7 years ago
1
Finish integrating ERROR category into scores
#14
ctwardy
closed
7 years ago
1
Merge scikit learn branch back to master.
#13
ctwardy
closed
7 years ago
0
Include errors in results
#12
ctwardy
closed
7 years ago
1
Max scores
#11
ctwardy
closed
7 years ago
0
Merge in max_scores branch
#10
ctwardy
closed
7 years ago
0
Save HTML to file for faster retesting.
#9
ctwardy
closed
7 years ago
0
Improve blog detection
#8
ctwardy
opened
7 years ago
1
Add to web app
#7
ctwardy
opened
7 years ago
0
Test whether max(scores) would outperform sequential rules.
#6
ctwardy
closed
7 years ago
3
Always calculate all 4 cosine scores.
#5
ctwardy
closed
7 years ago
1
ERROR on craigslist.com
#4
ctwardy
closed
7 years ago
6
Improve results
#3
ctwardy
opened
7 years ago
2
forum class_list always blank
#2
ctwardy
closed
7 years ago
1
Handle redirect loops
#1
ctwardy
closed
7 years ago
0