Commit Graph

10 Commits

Author SHA1 Message Date
avsm
c9b288fc3a update to p5-HTML-Parser-3.23 2001-05-06 19:32:00 +00:00
avsm
ba7e9726e2 update to p5-HTML-Parser-3.22 2001-04-18 21:46:14 +00:00
avsm
9e50bc376e update to p5-HTML-Parser-3.21 2001-04-11 15:23:59 +00:00
avsm
3aef40a3c9 update to p5-HTML-Parser-3.20 and integrate COMMENT 2001-04-10 00:15:51 +00:00
avsm
e6105e2375 Update to p5-HTML-Parser-3.17
Fixes the somewhat obscure perl multiplicity builds
2001-02-24 09:02:07 +00:00
avsm
8787baa606 Upgrade to p5-HTML-Parser-3.16
Relevant changes:

The unbroken_text option now works across ignored tags.
Expose internal decode_entities() function at the Perl level.
2001-02-24 01:24:49 +00:00
avsm
3fdc6aad7e cull .packlist from the PLIST 2001-01-08 21:07:39 +00:00
avsm
05866963a8 - upgrade to p5-HTML-Parser-3.15
- bump NEED_VERSION

--
 HTML::TokeParser's get_tag() method now takes multiple
 tags to match.  Hopefully the documentation is also a bit clearer.

 #define PERL_NO_GET_CONTEXT: Should speed up things for thread
 enabled versions of perl.

 Quote some more entities that also happens to be perl keywords.
 This avoids warnings on perl-5.004.

 Unicode entities only triggered for perl-5.7.0 or higher.
2000-12-27 21:52:07 +00:00
avsm
325c73ba3d - upgrade to HTML-Parser-3.14
- bump NEED_VERSION
- deactivate experimental unicode support since it doesnt work yet on our
  perl setup (make tests all fail with missing symbols)
2000-12-05 14:05:05 +00:00
avsm
6c45943bf6 import HTML-Parser-3.13 from CPAN
guess what, ok'ed by brad@ !
--

This is a collection of modules that parse and extract information
from HTML documents.  Bug reports and discussions about these modules
can be sent to the <libwww@perl.org> mailing list.  Remember to
also look at the HTML-Tree package that creates and extracts
information from HTML syntax trees.

The modules present in this collection are:

  HTML::Parser - The parser base class.  It receives arbitrary sized
  chunks of the HTML text, recognizes markup elements, and
  separates them from the plain text.  As different kinds of
  markup and text are recognized, the corresponding event
  handlers are invoked.

  HTML::Entities - Provides functions to encode and decode text
  with embedded HTML &gt;entities&gt;.

  HTML::HeadParser - A lightweight HTML::Parser subclass that
  extractsinformation from the <HEAD> section of an HTML document.

  HTML::LinkExtor - An HTML::Parser subclass that extracts links
  from an HTML document.

  HTML::TokeParser - An alternative interface to the basic parser
  that does not require event driven programming.  Most simple
  parsing needs are probably best attacked with this module.
2000-09-29 10:50:53 +00:00