lttoolbox is a toolbox for lexical processing, morphological analysis
and generation of words. Analysis is the process of splitting a word
(e.g., cats) into its lemma 'cat' and the grammatical information
<n><pl>. Generation is the opposite process.
The package is split into three programs: lt-comp, the compiler;
lt-proc, the processor; and lt-expand, which generates all possible
mappings between surface forms and lexical forms in the dictionary.
ok jasper@
apertium-lex-tools provides a module for compiling lexical selection rules
and processing them in the pipeline. Rules can be manually written, or
learnt from monolingual or parallel corpora.
ok jasper@
Apertium is a toolbox to build open-source shallow-transfer machine
translation systems, especially suitable for related language pairs: it
includes the engine, maintenance tools, and open linguistic data for
several language pairs.
ok jasper@
TinyXML-2 is a simple, small, efficient, C++ XML parser that can be easily
integrated into other programs.
In brief, TinyXML-2 parses an XML document, and builds from that a Document
Object Model (DOM) that can be read, modified, and saved.
tweak + ok sthen@
pugixml is a portable and light-weight C++ XML processing library.
It features:
- DOM-like interface with rich traversal/modification capabilities
- Extremely fast non-validating XML parser which constructs the DOM tree
from an XML file/buffer
- XPath 1.0 implementation for complex data-driven tree queries
- Full Unicode support with Unicode interface variants and automatic
encoding conversions