The stem function takes a scalar as a parameter and stems the word
according to Martin Porters Danish stemming algorithm, which can be
found at the Snowball website: <http://snowball.tartarus.org/>.
This module implements a Portuguese stemming algorithm proposed in the
paper A Stemming Algorithm for the Portuguese Language by Moreira, V.
and Huyck, C.
This module provides simple word wrapping. It breaks long lines, but
does not alter spacing or remove existing line breaks. If you're
looking for more sophisticated text formatting, try the Text::Format
module.
In short, Text::Wrapper is the object-oriented equivalent of Text::Wrap,
but with fewer bugs (I hope).
Text::Quoted examines the structure of some text which
may contain multiple different levels of quoting, and
turns the text into a nested data structure.
By default, this module exports a single hash (`%RE') that stores or
generates commonly needed regular expressions. Patterns currently
provided include:
* balanced parentheses and brackets
* delimited text (with escapes)
* integers and floating-point numbers in any base (up to 36)
* comments in C, C++, Perl, and shell
* offensive language
* lists of any pattern
* IPv4 addresses
XML::RAI is an object-oriented layer that maps
overlapping and alternate tags in RSS to one common
simplified interface.
from Sam Smith <s at msmith.net>
XML::RSS::Parser is a lightweight liberal parser of RSS
feeds. This parser is "liberal" in that it does not
demand compliance of a specific RSS version and will
attempt to gracefully handle tags it does not expect or
understand.
from Sam Smith <s at msmith.net>
Universal Feed Parser is a Python module for downloading and parsing
syndicated feeds. It can handle RSS 0.90, Netscape RSS 0.91, Userland
RSS 0.91, RSS 0.92, RSS 0.93, RSS 0.94, RSS 1.0, RSS 2.0, Atom, and CDF
feeds.
Universal Feed Parser is easy to use; the module is self-contained in a
single file, feedparser.py, and it has only one public function, parse.
parse takes a number of arguments, but only one is required, and it can
be a URL, a local filename, or a raw string containing feed data in any
format.