--
This module converts strings from and to 2-byte Unicode UCS2 format. All
mappings happen via 2 byte UTF16 encodings, not via 1 byte UTF8 encoding. To
transform these use Unicode::String.
For historical reasons this module coexists with Unicode::Map8. Please use
Unicode::Map8 unless you need to care for two byte character sets, e.g. chinese
GB2312. Anyway, if you stick to the basic functionality (see documentation)
you can use both modules equivalently.
Practically this module will disappear from earth sooner or later as Unicode
mapping support needs somehow to get into perl's core. If you like to work on
this field please don't hesitate contacting Gisle Aas!
This module can't deal directly with utf8. Use Unicode::String to convert utf8
to utf16 and vice versa.
Character mapping is according to the data of binary mapfiles in Unicode::Map
hierarchy. Binary mapfiles can also be created with this module, enabling you
to install own specific character sets. Refer to mkmapfile or file REGISTRY in
the Unicode::Map hierarchy.
--
Provides an adapter layer between core routines for converting to and
from UTF8 and other encodings. In essence, a way to give multiple
existing Unicode modules a single common interface so you don't have to
know the underlaying implementations to do simple UTF8 to-from other
character set encoding conversions. As such, it wraps the
Unicode::String, Unicode::Map8, Unicode::Map and Jcode modules in a
standardized and simple API.
This also provides general character set conversion operation based on
UTF8 - it is possible to convert between any two compatible and
supported character sets via a simple two step chaining of conversions.
As with most things Perlish - if you give it a few big chunks of text to
chew on instead of lots of small ones it will handle many more
characters per second.
By design, it can be easily extended to encompass any new charset
encoding conversion modules that arrive on the scene.
* Fix buffer overflow. (Obtained from Debian)
* Strip leading "../" when creating new files. (Obtained from Debian)
Clean-up by yours truly:
* Parameter parsing fix. (Obtained from Debian)
* Prefer native getopt() and MD5 functions over the included ones.
* Make munpack synopsis agree with actual parameters.
* Respect CC.
- Regen patch
- use distinfo
- Add p5-XML-LibXML to DEPENDS
Dependence not yet completed, it need XML::Checker which
need a more recent version of p5-XML-DOM and p5-XML-Parser.
The Text::Iconv module provides a Perl interface to the iconv()
function as defined by the Single UNIX Specification. The convert()
method converts the encoding of characters in the input string from
the fromcode codeset to the tocode codeset, and returns the result.
--
wv is a library which allows access to Microsoft Word files. It can
load and parse the word 2000, 97, 95 and 6 file formats. These are
the file formats known internally as Word 9, 8, 7 and 6. Work is
underway to support reading earlier formats as well: Word 2 docs
are converted to plaintext.
--
xlHtml is an Excel 95 and later file converter. Its HTML output can
be used as a Netscape Plugin to let you view xls e-mail attachments.
It can also extract regions of a spreadsheet and convert the
spreadsheet to pure text rather than html.