issues
search
DistrictDataLabs
/
baleen
An automated ingestion service for blogs to construct a corpus for NLP research.
MIT License
85
stars
38
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
conect with mongodb
#97
nikolandrush
closed
4 years ago
1
xmlPaths in .opml feed definition files are unescaped
#96
agodbehere
opened
6 years ago
1
Export to directory other than '.' fails
#95
agodbehere
opened
6 years ago
1
Fix code block markup display in README
#94
DanielJohnBenton
closed
6 years ago
2
Issue #89 - add a command line flag to export for sanitize level
#93
janetriley
closed
10 months ago
7
Corrected README code snippet
#92
dmahendrakar
closed
7 years ago
4
Export Compressed Posts
#91
bbengfort
opened
7 years ago
1
Add load from csv
#90
janetriley
opened
7 years ago
3
move sanitize to its own exporter option
#89
janetriley
opened
7 years ago
2
Issue #87 Make sanitization happen in Post.htmlize()
#88
janetriley
closed
7 years ago
6
Move html sanitization to Post
#87
janetriley
closed
7 years ago
2
Issue #83 PEP8 cleanup
#86
janetriley
closed
7 years ago
1
Issue #80
#85
janetriley
closed
7 years ago
4
Change post object in order to avoid duplicate fetch
#84
tmeshorer
opened
7 years ago
0
PEP8 cleanup
#83
janetriley
closed
7 years ago
5
Add tests to sanitize
#82
janetriley
closed
7 years ago
2
Develop
#81
marcocarranza
closed
7 years ago
0
Update baleen github repo url in docs
#80
janetriley
closed
7 years ago
4
Configurable Scheduling
#79
will2041
opened
7 years ago
0
Examples for documentation
#78
rebeccabilbro
opened
7 years ago
1
Baleen add2venv
#77
bbengfort
closed
7 years ago
8
README Markdown messed up
#76
bbengfort
closed
7 years ago
1
Formalize Mongo Schema
#75
will2041
opened
7 years ago
0
Use Timeout Decorator
#74
will2041
opened
7 years ago
0
Unicode decode error
#73
bbengfort
opened
7 years ago
3
55 update docker image to python35
#72
lauralorenz
closed
7 years ago
2
Write tests to make clear which Feed attributes could be changed
#71
olgert
opened
8 years ago
0
Write tests for exporter sanitization functions
#70
echolabstech
closed
7 years ago
3
document exporter commandline options
#69
echolabstech
opened
8 years ago
0
export commandline options
#68
echolabstech
closed
7 years ago
1
Develop
#67
echolabstech
closed
8 years ago
0
NotUniqueError caused by downloading non-changed feed content #52
#66
olgert
closed
8 years ago
2
Make posts.htmlize() smarter
#65
echolabstech
opened
8 years ago
0
More tests for improved code coverage
#64
will2041
closed
8 years ago
1
Additional tests for export
#63
will2041
closed
8 years ago
1
Add timeouts for fetching and wrangling posts
#62
pingihu
closed
8 years ago
1
Additional tests for version for 100% coverage
#61
will2041
closed
8 years ago
1
Add example feeds files
#60
will2041
closed
8 years ago
1
Issue 45: add version to footer
#59
janetriley
closed
8 years ago
2
Updates to Python version numbers and remove unused function
#58
will2041
closed
8 years ago
2
Python 3.X migration of README.md
#57
dongbohu
closed
8 years ago
1
Update README.md for Python 3.x Migration
#56
dongbohu
closed
8 years ago
2
Update Docker image to Python 3.5
#55
bbengfort
closed
7 years ago
3
Python3 support for application and tests
#54
will2041
closed
8 years ago
2
Docker image is empty
#53
janetriley
closed
7 years ago
2
NotUniqueError caused by downloading non-changed feed content
#52
olgert
closed
8 years ago
0
Python 3.5 Support
#51
bbengfort
closed
8 years ago
1
commit seed file to /fixtures
#50
echolabstech
closed
8 years ago
6
Update Quickstart documentation
#49
janetriley
opened
8 years ago
4
Update to use Python 3.5
#48
janetriley
opened
8 years ago
20
Next