Open slowkow opened 6 years ago
This issue is not specific for htmltab, it occurs within the XML package that htmltab uses internally. This package in turn calls the system library libxml2. This library may be outdated and updating it might solve the issue.
To check the currently installed version of libxml2 from R:
> library(XML)
> libxmlVersion()
$major
[1] "2"
$minor
[1] "09"
$patch
[1] "03"
In this example the version is 2.9.3, the most recent is 2.9.7. RPMs are available on ftp://xmlsoft.org/libxml2/.
I encountered a similar memory leak. Finally found that the issue was caused in a table within a table html files. Since this was there just once in each file, I simply used sed to remove that table within table, and the memory leak was gone. Not sure if this can be helped somehow.
Problem existed both with 2.9.4 and 2.9.9 of libxml2.
Hi Christian,
Thanks for the great package! It is a life saver.
Unfortunately, I ran into a problem. I hope you can help me fix this.
Here is the file I'm reading. I get a segfault when I try to extract the second table:
motif1.info.html.txt
(I had to add .txt to upload to this issue.)
Anyway, I think I'll have to find another way to extract the information out of the file. It turns out that the information I want is actually spread across multiple tables, not just one table. If you have any tips, I'd appreciate it!
Session info