Text files are nowadays usually encoded in Unicode, and may consist of
very different scripts - from Latin letters to Chinese Hanzi -, with
many kinds of special characters - accents, right-to-left writing marks,
hyphens, Roman numbers, and much more. But the POSIX platform APIs for
text do not contain adequate functions for dealing with particular
properties of many Unicode characters. In fact, the POSIX APIs for text
have several assumptions at their base which don't hold for Unicode
text.
This library provides functions for manipulating Unicode strings and for
manipulating C strings according to the Unicode standard.
small tweaks and ok jasper@
Convert text files with DOS or Mac line breaks to Unix line breaks and
vice versa. Features:
* Automatically skips binary and non-regular files.
* In-place, paired, or stdio mode conversion.
* Keep original file dates option.
* 7-bit and iso conversion modes like SunOS dos2unix.
* Conversion of Windows UTF-16 files to Unix UTF-8.
ok/reminder about /usr/local (now subst'ed in do-configure) ajacoutot@
ok pirofti
64-bit fixes for multi-volume format. From SASANO Takayoshi.
Make a few minor changes while there: Honour CC flag, and use a do-install:
target rather than upstream's makefile which always strips the binaries,
even when DEBUG is set.
Lots of Ruby libraries utilize JSON parsing in some form, and everyone
has their favorite JSON library. In order to best support multiple JSON
parsers and libraries, multi_json is a general-purpose swappable JSON
backend library.
MultiJSON tries to have intelligent defaulting. That is, if you have any
of the supported engines already loaded, it will utilize them before
attempting to load any. When loading, libraries are ordered by speed.
First Yajl-Ruby, then the JSON gem, then JSON pure. If no JSON library
is available, MultiJSON falls back to a bundled version of OkJson.
OK claudio@
Enca is an Extremely Naive Charset Analyser. It detects character set
and encoding of text files and can also convert them to other encodings
using either a built-in converter or external libraries and tools like
libiconv, librecode, etc.
ok aja@
This module provides functions to convert string from/to RFC4648
Base32 encoding, designed to encode non-ASCII characters in
DNS-compatible host name parts (A..Z, 2..7).
This package provide the following perl modules:
* Text::LineFold - Line Folding for Plain Text
* Unicode::GCString - String as Sequence of UAX #29 Grapheme Clusters
* Unicode::LineBreak - UAX #14 Unicode Line Breaking Algorithm
Previously, we were using ruby->=1.8,<=1.9, instead of
ruby->=1.8,<1.9. While this wouldn't cause an issue, since
our ruby-1.9.2 package isn't included in ruby->=1.8,<=1.9,
it's still wrong and should be fixed. This also fixes the
following minor issues:
Switch from using FLAVOR to MODRUBY_FLAVOR for *_DEPENDS.
Currently we don't have a ruby port that uses FLAVORs that
would differ from MODRUBY_FLAVOR, but it's possible we will
in the future.
Switch from BASE_PKGPATH to BUILD_PKGPATH in a few cases in
REGRESS_DEPENDS. This probably is not strictly necessary, but
BUILD_PKGPATH is used in more cases, so it is good for
consistency.
Switch to new style *_DEPENDS, with the version specification
at the end. The remaining cases where this is not done is
because a specific version is used.
Some FULLPKGNAME added to REGRESS_DEPENDS, to make sure that if
the old version is installed when you run a regress test, it
will install the new version first.
Some conversion of spaces to tabs for consistency.
OK landry@
into MODRUBY_WANTLIB and using CONFIGURE_STYLE = ruby gem ext. Use the
lang/ruby module for all dependent ports, setting
MODRUBY_{BUILD,RUN}DEP=No if necessary.
ok landry, phessler, sthen
All ruby .gem files are now hosted on rubygems.org in the same
directory. If the ruby gem CONFIGURE_STYLE is used, make the
default MASTER_SITES that directory.
There are still a few uses of MASTER_SITE_RUBYFORGE in the tree, for
some ports that aren't gems, or where the .gem file isn't hosted on
rubygems.org, or where the hashes don't match. Most of these will be
dealt with in the near future.
OK landry@
In the upgrade from ruby 1.8.6 to 1.8.7, the PLISTs changed
due to differences in how RDoc processes files.
This also has a number of changes to the regress tests to
work with the changes to devel/ruby-rake. It moves most of
the regress tests to use MODRUBY_REGRESS.
OK jcs@, landry@, jasper@, sthen@
This module provides an easy-to-use interface for encoding and decoding
Internationalized Domain Names (IDNs), as described in RFC 3490.
(This module replaces converters/p5-IDN-Punycode)
ok sthen@
This module implements the stringprep framework for preparing Unicode
text strings in order to increase the likelihood that string input and
string comparison work in ways that make sense for typical users
throughout the world.
ok sthen@
breaking cd /usr/ports && SUBDIR=some/path make something for
category makefiles. While there, also put spaces around += uniformously.
okay naddy@, jasper@
This port comes in 2 packages now: -main and -python. That is to keep
the -main dependencies lighter.
Successfully tested by jasper@ and myself.
ok jasper@
From the changelog:
Security Fix for JSON::Pure::Parser. A specially designed string could
cause catastrophic backtracking in one of the parser's regular expressions
in earlier 1.1.x versions. JSON::Ext::Parser isn't affected by this issue.
IMAP mailbox names are encoded in a modified UTF7 when names contain
international characters outside of the printable ASCII range. The
modified UTF-7 encoding is defined in RFC2060 (section 5.1.3).