issues
search
Leibniz-HBI
/
newsfeedback
Tool for extracting and saving news article metadata (and optionally content) at regular intervals.
MIT License
3
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
ImportError: Update the lxml dependency
#51
rwinterschlaf
opened
5 days ago
0
CSV splitting issue
#50
rwinterschlaf
opened
5 days ago
1
Make newsfeedback functional without having the repository installed
#49
rwinterschlaf
opened
5 days ago
2
Missing backslash at end of URL causes pipeline-picker error
#48
rwinterschlaf
opened
5 days ago
0
Add in empty dataframe warning for non-pur abo platforms / Rework infos/warnings
#47
rwinterschlaf
opened
1 year ago
2
Adapted browser language and added further waits
#46
rwinterschlaf
closed
1 year ago
0
Pur abo related fixes
#45
rwinterschlaf
closed
1 year ago
1
Multiple changes to WebDriver and added mocking
#44
rwinterschlaf
closed
1 year ago
1
Adjustments to minimize issues caused by timeouts at start
#43
rwinterschlaf
closed
1 year ago
1
updated page load time out and further timeoutexception circumvention
#42
rwinterschlaf
closed
1 year ago
0
Pur abo related fixes
#41
rwinterschlaf
closed
1 year ago
0
Pur abo related fixes
#40
rwinterschlaf
closed
1 year ago
0
Tackle TimeOut Errors by mocking them
#39
rwinterschlaf
opened
1 year ago
0
Added waits and changed the default page load timeout
#38
rwinterschlaf
closed
1 year ago
2
Publish on PyPI
#37
FlxVctr
closed
1 year ago
2
Moved the driver.quit() command around
#36
rwinterschlaf
closed
1 year ago
0
Remove cookies upon start of new run
#35
rwinterschlaf
closed
1 year ago
0
Set up email notification feature if data extraction failed
#34
rwinterschlaf
opened
1 year ago
0
Added formatting rules for comments and texts
#33
rwinterschlaf
closed
1 year ago
0
Fix issues with text and comment extraction
#32
rwinterschlaf
opened
1 year ago
2
Config-writing and picking tests were added
#31
rwinterschlaf
closed
1 year ago
0
Improve tests and functions dealing with config reading/writing
#30
rwinterschlaf
opened
1 year ago
6
test pull request for github actions
#29
rwinterschlaf
closed
1 year ago
3
Refactored, tidied and now featuring scheduling!
#28
rwinterschlaf
closed
1 year ago
0
Add and test scheduling function
#27
rwinterschlaf
closed
1 year ago
2
Interactive file writing to add new URLs to homepage config
#26
rwinterschlaf
closed
1 year ago
1
Config file implementation issue 23
#25
rwinterschlaf
closed
1 year ago
0
Create pipelines by chaining functions
#24
rwinterschlaf
closed
1 year ago
1
Implement config files for ease of usage and customization
#23
rwinterschlaf
closed
1 year ago
2
Cleaned up code, all tests up and running
#22
rwinterschlaf
closed
1 year ago
0
Example for not-so-weird function calls
#21
FlxVctr
opened
1 year ago
0
Consolidate, refactor and/or reorganize functions/tests to tackle the duplicates
#20
rwinterschlaf
closed
1 year ago
1
Add tests for various metadata configurations
#19
rwinterschlaf
closed
1 year ago
1
Proofread, finalize and streamline documentation and help texts
#18
rwinterschlaf
closed
1 year ago
2
Rename Best and Worst Case Pipelines
#17
rwinterschlaf
closed
1 year ago
2
Try headless Selenium browser automation
#16
FlxVctr
closed
1 year ago
0
Pur abo bypass in pipelines issue 13
#15
rwinterschlaf
closed
1 year ago
0
Make filter whitelist configurable
#14
rwinterschlaf
opened
1 year ago
0
Add Pur Abo bypass to existing extraction pipelines
#13
rwinterschlaf
closed
1 year ago
2
Successful bypass of pur abo barrier
#12
rwinterschlaf
closed
1 year ago
0
Filtration based on URL structure + relevant tests
#11
rwinterschlaf
closed
1 year ago
0
Bring tool to the command line with click
#10
rwinterschlaf
closed
1 year ago
2
Bypass "Pur Abo" Barriers by clicking the accept button
#9
rwinterschlaf
closed
1 year ago
0
9 tests passing - best case and worst case pipeline
#8
rwinterschlaf
closed
1 year ago
1
Setup Github actions to test on unix system
#7
FlxVctr
closed
1 year ago
4
Six successful tests
#6
rwinterschlaf
closed
1 year ago
0
Allow users to filter post-extraction
#5
rwinterschlaf
closed
1 year ago
1
Resolve date_parser/pyzt warning
#4
rwinterschlaf
closed
1 year ago
2
Ensure tool functionality with "big" German news media outlets
#3
rwinterschlaf
closed
1 year ago
3
Translate user stories into functional tests
#2
rwinterschlaf
closed
1 year ago
1
Next