- bump NEED_VERSION
--
HTML::TokeParser's get_tag() method now takes multiple
tags to match. Hopefully the documentation is also a bit clearer.
#define PERL_NO_GET_CONTEXT: Should speed up things for thread
enabled versions of perl.
Quote some more entities that also happens to be perl keywords.
This avoids warnings on perl-5.004.
Unicode entities only triggered for perl-5.7.0 or higher.
guess what, ok'ed by brad@ !
--
This is a collection of modules that parse and extract information
from HTML documents. Bug reports and discussions about these modules
can be sent to the <libwww@perl.org> mailing list. Remember to
also look at the HTML-Tree package that creates and extracts
information from HTML syntax trees.
The modules present in this collection are:
HTML::Parser - The parser base class. It receives arbitrary sized
chunks of the HTML text, recognizes markup elements, and
separates them from the plain text. As different kinds of
markup and text are recognized, the corresponding event
handlers are invoked.
HTML::Entities - Provides functions to encode and decode text
with embedded HTML >entities>.
HTML::HeadParser - A lightweight HTML::Parser subclass that
extractsinformation from the <HEAD> section of an HTML document.
HTML::LinkExtor - An HTML::Parser subclass that extracts links
from an HTML document.
HTML::TokeParser - An alternative interface to the basic parser
that does not require event driven programming. Most simple
parsing needs are probably best attacked with this module.