Treetop is a Ruby-based DSL for text parsing and interpretation.
It facilitates an extension of the object-oriented paradigm called
syntax-oriented programming.
ok bernd@
Namazu is a full-text indexer/search engine intended for easy use.
Not only does it work as a small or medium scale Web search engine,
but also as a personal search system for email or other files.
It provides a CGI interface for web searches, and a command-line
search tool. Third-party frontends are available such as namazu.el
and Wanderlust on Emacs and Tknamazu on X Window System.
Filters enable namazu to index various formats of files. Some are
standalone (e.g. Mail/News); others require external dependencies.
Libtextcat is a library with functions that implement the classification
technique described in Cavnar & Trenkle, "N-Gram-Based Text
Categorization". It was primarily developed for language guessing, a
task on which it is known to perform with near-perfect accuracy.
Based on the FreeBSD port.
This is a prerequisite for pinot.
Hunspell is a spell checker and morphological analyzer library and
program designed for languages with rich morphology and complex word
compounding or character encoding.
Note that this is not to be considered as an aspell replacement just
yet. We install no hunspell dictionnaries for now but use the ones from
mozilla.
Reworked from an original port by Edd Barrett (maintainer).
Tested by sthen@ in a bulk, thanks!
ok sthen@
This module parses a query string into a data structure to be handled
by external search engines. For examples of such engines, see
File::Tabular and Search::Indexer.
The query string can contain simple terms, "exact phrases", field
names and comparison operators, '+/-' prefixes, parentheses,
and boolean connectors.
from Ian Mcwilliam (MAINTAINER)
Catfish is a handy file searching tool for linux and unix. Basically it
is a frontend for different search engines (daemons) which provides a
unified interface. The interface is intentionally lightweight and
simple, using only GTK+2. You can configure it to your needs by using
several command line options.
ok ajacoutot@
Meld is a visual diff and merge tool. You can compare two or three files
and edit them in place (diffs update dynamically). You can compare two
or three folders and launch file comparisons. You can browse and view a
working copy from popular version control systems such such as CVS,
Subversion, Bazaar-ng and Mercurial if the corresponding commands are
installed.
ok ajacoutot@ wcmaier@