issues
search
Letractively
/
abot
Automatically exported from code.google.com/p/abot
Apache License 2.0
0
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Work on IThreadManagers
#80
GoogleCodeExporter
closed
9 years ago
8
compare mono vs windows performance on apples to apples hardware
#79
GoogleCodeExporter
opened
9 years ago
3
CsQuery blows up on double encoding
#78
GoogleCodeExporter
closed
9 years ago
1
HtmlAgilityPack throws StackOverflowException on pages with lots of nested tags
#77
GoogleCodeExporter
closed
9 years ago
9
Add IEnumerable<Uri> PageLinks
#76
GoogleCodeExporter
closed
9 years ago
6
Implement robots no follow
#75
GoogleCodeExporter
closed
9 years ago
7
Add automatic throttling
#74
GoogleCodeExporter
opened
9 years ago
1
Create BulkCrawler that manages multiple instance of the IWebCrawler
#73
GoogleCodeExporter
closed
9 years ago
1
Make CrawledPage.CsQueryDocument & CrawledPage.HtmlDocument ILazy<T>
#72
GoogleCodeExporter
closed
9 years ago
1
add a dynamic crawlbag so users may pass in a custom object that will be added to the crawl context
#71
GoogleCodeExporter
closed
9 years ago
0
[deleted issue]
#70
GoogleCodeExporter
closed
9 years ago
0
[deleted issue]
#69
GoogleCodeExporter
closed
9 years ago
0
Make crawlconfiguration modifiable after it is loaded from app.config file
#68
GoogleCodeExporter
closed
9 years ago
1
Completely remove ilmerge due to several issues
#67
GoogleCodeExporter
closed
9 years ago
3
Remove log4net from ilmerge command
#66
GoogleCodeExporter
closed
9 years ago
2
Extract postbuild commands into bat and bash files
#65
GoogleCodeExporter
closed
9 years ago
2
Add IsDecisionPermanent property to CrawlDecision
#64
GoogleCodeExporter
closed
9 years ago
1
CrawlResult.ErrorOccurred and CrawlResult.ErrorMessage are never set outside of unit tests
#63
GoogleCodeExporter
closed
9 years ago
2
Add crawl context to event args to make them available to event subscribers
#62
GoogleCodeExporter
closed
9 years ago
2
Create new log file with website name on every crawl
#61
GoogleCodeExporter
closed
9 years ago
1
Add IScheduler to the crawl context so people can add urls during the crawl.
#60
GoogleCodeExporter
closed
9 years ago
2
Add htmlagiliypack loaded html document to crawled page so more parsing can take place
#59
GoogleCodeExporter
closed
9 years ago
2
Crawler crawls over MaxPagesToCrawl by up to X pages. X being the number of MaxConcurrentThreads
#58
GoogleCodeExporter
closed
9 years ago
1
Update documentation to reflect 1.1 changes
#57
GoogleCodeExporter
closed
9 years ago
12
Reconsider targeting .net 4.0 so VS 2010 users can work with the source code.
#56
GoogleCodeExporter
closed
9 years ago
2
Limit memory usage for the process running Abot
#55
GoogleCodeExporter
closed
9 years ago
5
Add SimulateUserClicks config value
#54
GoogleCodeExporter
opened
9 years ago
7
[deleted issue]
#53
GoogleCodeExporter
closed
9 years ago
0
Abot.Tests.Integration is not logging all library log statements
#52
GoogleCodeExporter
closed
9 years ago
3
Add config value for MaxPagesToCrawlPerDomain
#51
GoogleCodeExporter
opened
9 years ago
3
Make Abot check its version an if less than the latest "featured" version log a message suggesting an update
#50
GoogleCodeExporter
closed
9 years ago
2
Add lic text to each page
#49
GoogleCodeExporter
closed
9 years ago
2
[deleted issue]
#48
GoogleCodeExporter
closed
9 years ago
0
Add page for custom crawler work by hour
#47
GoogleCodeExporter
closed
9 years ago
2
Create google groups discussion
#46
GoogleCodeExporter
closed
9 years ago
2
Add constructor to webcrawler that takes only ICrawlDecisionMaker and both ICrawlDecisionMaker and CrawlConfiguration
#45
GoogleCodeExporter
closed
9 years ago
2
Add abot version dynamically to user agent string
#44
GoogleCodeExporter
closed
9 years ago
2
Think about moving unique uri crawling check/logic to IScheduler
#43
GoogleCodeExporter
closed
9 years ago
1
Use concurrent collections for Scheduler and CrawlContext.CrawledUris
#42
GoogleCodeExporter
closed
9 years ago
3
Implement use of isUriRecrawlingEnabled
#41
GoogleCodeExporter
closed
9 years ago
2
Implement use of downloadableContentTypes config value
#40
GoogleCodeExporter
closed
9 years ago
2
Implement manual crawl delay
#39
GoogleCodeExporter
closed
9 years ago
4
Implement crawl timeout
#38
GoogleCodeExporter
closed
9 years ago
1
Implement crawl depth
#37
GoogleCodeExporter
closed
9 years ago
4
Update all assemblies to 4.5
#36
GoogleCodeExporter
closed
9 years ago
2
Update documentation/Downloads
#35
GoogleCodeExporter
closed
9 years ago
4
Use Vs fakes to raise code coverage on untestable code
#34
GoogleCodeExporter
closed
9 years ago
6
Consider using CsQuery as the parser
#33
GoogleCodeExporter
closed
9 years ago
4
Spread the word
#32
GoogleCodeExporter
closed
9 years ago
2
Use ILMerge to create a single Abot.dll with all dependent dlls
#31
GoogleCodeExporter
closed
9 years ago
6
Previous
Next