import HTML-Parser-3.13 from CPAN

guess what, ok'ed by brad@ !
--

This is a collection of modules that parse and extract information
from HTML documents.  Bug reports and discussions about these modules
can be sent to the <libwww@perl.org> mailing list.  Remember to
also look at the HTML-Tree package that creates and extracts
information from HTML syntax trees.

The modules present in this collection are:

  HTML::Parser - The parser base class.  It receives arbitrary sized
  chunks of the HTML text, recognizes markup elements, and
  separates them from the plain text.  As different kinds of
  markup and text are recognized, the corresponding event
  handlers are invoked.

  HTML::Entities - Provides functions to encode and decode text
  with embedded HTML &gt;entities&gt;.

  HTML::HeadParser - A lightweight HTML::Parser subclass that
  extractsinformation from the <HEAD> section of an HTML document.

  HTML::LinkExtor - An HTML::Parser subclass that extracts links
  from an HTML document.

  HTML::TokeParser - An alternative interface to the basic parser
  that does not require event driven programming.  Most simple
  parsing needs are probably best attacked with this module.
This commit is contained in:
avsm 2000-09-29 10:50:53 +00:00
parent f0d5e44dce
commit 6c45943bf6
6 changed files with 82 additions and 0 deletions

View File

@ -0,0 +1,22 @@
# $OpenBSD: Makefile,v 1.1.1.1 2000/09/29 10:50:53 avsm Exp $
DISTNAME= HTML-Parser-3.13
PKGNAME= p5-${DISTNAME}
CATEGORIES= www perl5
NEED_VERSION= 1.336
MASTER_SITES= ${MASTER_SITE_PERL_CPAN}
MASTER_SITE_SUBDIR= HTML
BUILD_DEPENDS= ${LOCALBASE}/libdata/perl5/site_perl/HTML/Tagset.pm::www/p5-HTML-Tagset
RUN_DEPENDS= ${BUILD_DEPENDS}
MAINTAINER= Anil Madhavapeddy <avsm@openbsd.org>
PERMIT_PACKAGE_CDROM= Yes
PERMIT_PACKAGE_FTP= Yes
PERMIT_DISTFILES_CDROM= Yes
PERMIT_DISTFILES_FTP= Yes
CONFIGURE_STYLE= perl
.include <bsd.port.mk>

View File

@ -0,0 +1,3 @@
MD5 (HTML-Parser-3.13.tar.gz) = 75a61c3207923774d028bebbadee35aa
RMD160 (HTML-Parser-3.13.tar.gz) = fa7da552fa015c5299bf1cc654ee36d0be4d75be
SHA1 (HTML-Parser-3.13.tar.gz) = 36ce14f754fb750e8832380cbbb89cf3a10feff9

View File

@ -0,0 +1,13 @@
$OpenBSD: patch-Makefile.PL,v 1.1.1.1 2000/09/29 10:50:53 avsm Exp $
--- Makefile.PL.orig Tue Sep 26 18:36:25 2000
+++ Makefile.PL Tue Sep 26 18:38:07 2000
@@ -19,7 +19,8 @@ select no to the question below such ent
only entities in the Latin-1 range is decoded.
EOT
- my $ans = prompt("Do you want decoding on unicode entities?", "no");
+# my $ans = prompt("Do you want decoding on unicode entities?", "no");
+ my $ans = 'yes';
if ($ans =~ /^y(es)?$/i) {
push(@define, "-DUNICODE_ENTITIES");
}

View File

@ -0,0 +1 @@
modules to parse and extract information from HTML

View File

@ -0,0 +1,26 @@
This is a collection of modules that parse and extract information
from HTML documents. Bug reports and discussions about these modules
can be sent to the <libwww@perl.org> mailing list. Remember to
also look at the HTML-Tree package that creates and extracts
information from HTML syntax trees.
The modules present in this collection are:
HTML::Parser - The parser base class. It receives arbitrary sized
chunks of the HTML text, recognizes markup elements, and
separates them from the plain text. As different kinds of
markup and text are recognized, the corresponding event
handlers are invoked.
HTML::Entities - Provides functions to encode and decode text
with embedded HTML &gt;entities&gt;.
HTML::HeadParser - A lightweight HTML::Parser subclass that
extractsinformation from the <HEAD> section of an HTML document.
HTML::LinkExtor - An HTML::Parser subclass that extracts links
from an HTML document.
HTML::TokeParser - An alternative interface to the basic parser
that does not require event driven programming. Most simple
parsing needs are probably best attacked with this module.

View File

@ -0,0 +1,17 @@
@comment $OpenBSD: PLIST,v 1.1.1.1 2000/09/29 10:50:53 avsm Exp $
libdata/perl5/site_perl/${ARCH}-openbsd/HTML/Entities.pm
libdata/perl5/site_perl/${ARCH}-openbsd/HTML/Filter.pm
libdata/perl5/site_perl/${ARCH}-openbsd/HTML/HeadParser.pm
libdata/perl5/site_perl/${ARCH}-openbsd/HTML/LinkExtor.pm
libdata/perl5/site_perl/${ARCH}-openbsd/HTML/Parser.pm
libdata/perl5/site_perl/${ARCH}-openbsd/HTML/TokeParser.pm
libdata/perl5/site_perl/${ARCH}-openbsd/auto/HTML/Parser/.packlist
libdata/perl5/site_perl/${ARCH}-openbsd/auto/HTML/Parser/Parser.bs
libdata/perl5/site_perl/${ARCH}-openbsd/auto/HTML/Parser/Parser.so
man/man3/HTML::Entities.3p
man/man3/HTML::Filter.3p
man/man3/HTML::HeadParser.3p
man/man3/HTML::LinkExtor.3p
man/man3/HTML::Parser.3p
man/man3/HTML::TokeParser.3p
@dirrm libdata/perl5/site_perl/${ARCH}-openbsd/auto/HTML/Parser