jesbin / crawler4j

Automatically exported from code.google.com/p/crawler4j
0 stars 0 forks source link

JVM heap size keeps increasing considerably #343

Open GoogleCodeExporter opened 8 years ago

GoogleCodeExporter commented 8 years ago
What steps will reproduce the problem?
1.a while loop ( creating crawlController and crawling a website ) at each 
iteration
2. the crawlController have been slightly modified so it could have something 
to do with it
3.

What is the expected output? What do you see instead?

After each crawling operation, the heap size is supposed to go back the way it 
was before it, seeing how it's a series of independent operations.

but instead, the heap size keeps increasing with no end until it crashes.

What version of the product are you using?

3.5

Please provide any additional information below.

After using yourKit Java profiler to locate what lines of code were exactly 
consuming this memory, I ended up at this : 

Environment env = new Environment(envHome, envConfig);

this line from the crawlController, was consuming over 620 MB from a total of 
650 MB.

Now I don't know what's exactly in this env object so I can't diagose it, but I 
hope you guys can help me with that.

I will include a screenshot from the profiler's result and the new 
crawlController I have wrote.
(I think you should check the constructor and the start function) 

Original issue reported on code.google.com by feuoo...@gmail.com on 27 Jul 2015 at 11:43

Attachments: