mikavilpas / link_title

Irssi plugin for manipulating links sent by others
1 stars 0 forks source link

idea: use HTML::TokeParser::Simple to get title from document #32

Open mikavilpas opened 14 years ago

mikavilpas commented 14 years ago

Idea from here, seems good:

http://stackoverflow.com/questions/29869/regex-to-match-all-html-tags-except-p-and-p/29985#29985

mikavilpas commented 14 years ago

HTML::Parser is already loaded (as HTML::Entities), so it would be better to use that module instead of loading a completely new one.

mikavilpas commented 14 years ago

This one looks even better, and it's included in the parser module as well:

http://search.cpan.org/~gaas/HTML-Parser-3.65/lib/HTML/HeadParser.pm

mikavilpas commented 14 years ago

Opened new branch titleParser to implement HTML::HeadParser in getting page title.