Letractively / flaxcrawler

Automatically exported from code.google.com/p/flaxcrawler
0 stars 0 forks source link

How to create custom parser #3

Open GoogleCodeExporter opened 8 years ago

GoogleCodeExporter commented 8 years ago
I think that it would be useful if you could provide a short documentation on 
how to create custom parser. You could write an example where parser extracts 
the images or whatever you like from the Page that it parses.

Original issue reported on code.google.com by g.anagnostaros on 3 May 2011 at 7:11

GoogleCodeExporter commented 8 years ago
Note: It is not defect, but I don't know how to report it properly.

Original comment by g.anagnostaros on 3 May 2011 at 7:12

GoogleCodeExporter commented 8 years ago
Ok, I'll try to make it this week.

Original comment by ay.mesh...@gmail.com on 13 May 2011 at 12:13

GoogleCodeExporter commented 8 years ago
Just reviewed code, it is really not so obvious. In few words: you should 
inherit DefaultParser, set this new class as a generic parser in parser 
controller and implement your own ParserCallback class (that will extract 
images, etc.). I'll create simple sample.

Original comment by ay.mesh...@gmail.com on 13 May 2011 at 12:16