issues
search
rgarner
/
cma-tna-crawlers
Scraping old cases from TNA for CMA, no TLAs.
0
stars
3
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
There are some duplicate cases in the output
#30
rgarner
closed
9 years ago
1
Some newer merger cases have no body copy *or* PDF
#29
rgarner
closed
9 years ago
2
CA98 cases not getting body
#28
rgarner
closed
9 years ago
0
After all crawlers run, these cases referenced in sheets are not in the index
#27
rgarner
opened
9 years ago
3
"Criminal cartels - no charges" is not a schema outcome
#26
rgarner
closed
9 years ago
1
Mergers 10-11 has three rows with no Archive URL
#25
rgarner
closed
9 years ago
1
Work out why so many cases missing from competition/cartels
#24
rgarner
closed
9 years ago
2
Wire up competition/cartels sheets
#23
rgarner
closed
9 years ago
0
Allow `case_type` to be overridden by spreadsheet
#22
rgarner
closed
9 years ago
1
Link mergers sheets to augmenter
#21
rgarner
closed
9 years ago
0
Mergers crawler: no case for ASSET
#20
rgarner
closed
9 years ago
1
Let sheet titles be authoritative
#19
rgarner
closed
9 years ago
0
Create body generator for old-style Mergers cases and Markets
#18
rgarner
closed
9 years ago
2
Fill in mergers summary automatically from title
#17
rgarner
closed
9 years ago
0
Restrict markets crawler to just one URL
#16
rgarner
closed
9 years ago
2
Not sure about full sitemap
#15
rjc123
closed
9 years ago
0
Super-complaints - how to treat?
#14
rgarner
closed
9 years ago
2
Write a crawler for each completed case page
#13
rgarner
closed
9 years ago
2
Stericycle breaking case
#12
rgarner
closed
9 years ago
0
CC crawler not saving subpages
#11
rgarner
closed
9 years ago
0
Link the remaining OFT sheets to the Current Cases crawler
#10
rgarner
closed
9 years ago
1
Link the CC sheet to the crawler
#9
rgarner
closed
9 years ago
2
Bring across body generation code
#8
rgarner
closed
9 years ago
4
OFT current crawlers occasionally truncate summary
#7
rgarner
closed
9 years ago
0
Fill in market_sector
#6
rgarner
closed
9 years ago
1
Fill in case type
#5
rgarner
closed
9 years ago
1
Write a mergers crawler
#4
rgarner
closed
9 years ago
5
Find out if OFT URLs we're explicitly ignoring are important
#3
rgarner
closed
9 years ago
6
Prevent overwriting of some cases
#2
rgarner
closed
9 years ago
1
Filter cases we've already seen
#1
rgarner
closed
9 years ago
2