issues
search
jakopako
/
goskyr
A configurable command-line web scraper written in go with auto configuration capability
GNU General Public License v3.0
33
stars
5
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
auto date extraction sets date_location to CEST but this can't be parsed
#209
jakopako
closed
1 year ago
1
auto config extract doesn't properly work for winterthurer musikfestwochen
#208
jakopako
closed
3 months ago
2
Add dutch to date auto format extraction + other fix
#207
jakopako
closed
1 year ago
0
Add dutch to date auto format extraction
#206
jakopako
closed
1 year ago
0
auto extract bug for https://www.bimhuis.nl/?agenda=true
#205
jakopako
closed
3 months ago
1
Jakopako/issue200
#204
jakopako
closed
1 year ago
0
Add paging auto detect to auto config extraction
#203
jakopako
opened
1 year ago
0
Add json interpreter
#202
jakopako
opened
1 year ago
1
Jakopako/issue193
#201
jakopako
closed
1 year ago
0
Auto config extraction doesn't properly when nth-child() would be needed
#200
jakopako
closed
1 year ago
2
Bump github.com/goodsign/monday from 1.0.0 to 1.0.1
#199
dependabot[bot]
closed
1 year ago
0
Bump github.com/chromedp/chromedp from 0.8.8 to 0.9.1
#198
dependabot[bot]
closed
1 year ago
0
Jakopako/issue188
#197
jakopako
closed
1 year ago
0
Auto config bug with url https://www.leuvenjazz.be/nl/programma?discipline=3
#196
jakopako
closed
3 months ago
2
Try https://github.com/jszwec/csvutil for feature extraction
#195
jakopako
closed
5 months ago
1
Document ML capability
#194
jakopako
closed
1 year ago
0
Improve auto extract: automatically find date format of extracted date fields
#193
jakopako
closed
1 year ago
1
Bump golang.org/x/net from 0.7.0 to 0.8.0
#192
dependabot[bot]
closed
1 year ago
0
Bump github.com/chromedp/chromedp from 0.8.7 to 0.8.8
#191
dependabot[bot]
closed
1 year ago
0
Document ML capability
#190
jakopako
closed
1 year ago
0
Ml 1 - first attempt of using machine learning to further ease the process of auto generating a config
#189
jakopako
closed
1 year ago
0
Add concept of 'multi-fields'
#188
jakopako
closed
1 year ago
0
why is itemsChannel of length len(config.Scrapers)?
#187
jakopako
closed
5 months ago
1
autoextract config for https://www.heldenbar.ch/programm/ doesn't generate working config
#186
jakopako
closed
1 year ago
0
allow configuring a default for the date components.
#185
jakopako
opened
1 year ago
0
autoextract config for https://www.heldenbar.ch/programm/ doesn't generate working config
#184
jakopako
closed
1 year ago
1
Enable extracting text for fields that have the same selector as the list item
#183
jakopako
closed
1 year ago
0
contributors & new contributors mention in releases
#182
jakopako
opened
1 year ago
0
auto release new version on tagging
#181
jakopako
opened
1 year ago
0
Add configuration and functionality for transforming date components before futher processing
#180
MarkJaroski
closed
1 year ago
1
Update README.md
#179
laerm
closed
1 year ago
1
Bump github.com/gdamore/tcell/v2 from 2.5.4 to 2.6.0
#178
dependabot[bot]
closed
1 year ago
0
Add description of `entire_subtree` param to readme
#177
jakopako
closed
1 year ago
0
Bump github.com/PuerkitoBio/goquery from 1.8.0 to 1.8.1
#176
dependabot[bot]
closed
1 year ago
0
Enable extracting text for fields that have the same selector as the list item
#175
jakopako
closed
1 year ago
0
Find systemic solution for parsing dates that https://github.com/goodsign/monday doesn't parse correctly
#174
jakopako
closed
1 year ago
1
Hack solution for #172 three char abbrs. at Pole Sud,
#173
MarkJaroski
closed
1 year ago
1
Pôle Sud events in February not processed correctly
#172
MarkJaroski
closed
1 year ago
1
Documented entire_subtree
#171
MarkJaroski
closed
1 year ago
0
Autoextract bug with url https://www.koko.co.uk/whats-on
#170
jakopako
closed
1 year ago
0
Add description of `entire_subtree` param to readme
#169
jakopako
closed
1 year ago
0
Support times with only the hour part
#168
MarkJaroski
closed
1 year ago
2
Autoextract bug with url https://gaskessel.ch/programm/
#167
jakopako
closed
1 year ago
0
Autoextract bug with url https://www.koko.co.uk/whats-on
#166
jakopako
closed
1 year ago
3
automatic retry when no fields found
#165
jakopako
opened
1 year ago
1
upgrade to codeql v2
#164
jakopako
closed
5 months ago
0
make field names of auto extract more meaningful
#163
jakopako
opened
1 year ago
1
Bump github.com/chromedp/chromedp from 0.8.6 to 0.8.7
#162
dependabot[bot]
closed
1 year ago
0
make wait time of dynamic fetcher configurable
#161
jakopako
opened
1 year ago
1
Bump github.com/gdamore/tcell/v2 from 2.5.3 to 2.5.4
#160
dependabot[bot]
closed
1 year ago
0
Previous
Next