Sphinx is a full-text search engine.
Generally, it's a standalone search engine, meant to provide fast,
size-efficient and relevant fulltext search functions to other
applications. Sphinx was specially designed to integrate well with SQL
databases and scripting languages. Currently built-in data sources
support fetching data either via direct connection to MySQL, or from
an XML pipe.
As for the name, Sphinx is an acronym which is officially decoded as
SQL Phrase Index.
Submitted and maintained by Rama McIntosh.
resulted in pointer truncation on LP64 arch.
patch-debugXML_c - string.h is wrapped in #ifdef, but the macro is
never set by autoconf. Most other source here unconditionally includes
string.h, so follow suit - fgsch@ would prefer mending configure.in.
Explanatory notes added to patch. ok martynas@.
patch-xmlmemory_h - stdlib.h missing. ok fgsch@ martynas@.
based on their docstrings. A lightweight markup language called
epytext can be used to format docstrings, and to add information about
specific fields, such as parameters and instance variables. Epydoc
also understands docstrings written in reStructuredText, Javadoc, and
plaintext.
from Jason Meltzer
tweaks and ok ajacoutot@
- new license (BSD).
- merges in and deprecates textproc/p5-Text-MultMarkdown.
- install {Multi,}Markdown scripts for command line use.
ok merdely, "looks good" sanity check espie@
gtkaml is an XML parser that extends the Vala.Parser (literally) and
transforms all your tags into a valid Gtk+ UI class described in the
Vala language.
ok ajacoutot@
Haml and Sass are templating engines for the two most common types of
documents on the web: HTML and CSS, respectively.
They are designed to make it both easier and more pleasant to code
HTML and CSS documents, by eliminating redundancy, reflecting the
underlying structure that the document represents, and providing
elegant, easily understandable, and powerful syntax.
A framework to allow Ruby applications to generate file/folder stubs
(like the `rails` command does for Ruby on Rails, and the 'script/generate'
command within a Rails application during development).
Hpricot is a fast, flexible HTML parser written in C. It's designed
to be very accommodating (like Tanaka Akira's HTree) and to have a
very helpful library (like some JavaScript libs -- JQuery, Prototype
-- give you.) The XPath and CSS parser, in fact, is based on John
Resig's JQuery.
Also, Hpricot can be handy for reading broken XML files, since many of
the same techniques can be used. If a quote is missing, Hpricot tries
to figure it out. If tags overlap, Hpricot works on sorting them out.
- install egg-info file so that we don't end up with errors like
"pkg_resources.DistributionNotFound: PyXML..." when using some apps that
need this module
lxml is a Pythonic, mature binding for the libxml2 and libxslt libraries.
It provides safe and convenient access to these libraries using the
ElementTree API.
It extends the ElementTree API significantly to offer support for XPath,
RelaxNG, XML Schema, XSLT, C14N and much more.
from Benoit Chesneau <benoitc at metavers dot net>
parts of xpdf, and xpdf -utils provides non-x-depending parts (replacing the
no_x11 flavor). Appropriate @conflict marker makes upgrade flawless.
While here, remove dependency on a specific version of auto* (prompted by naddy@)
(and remember me to never _ever_ touch xpdf again)
Change requested by naddy@ espie@
ok sthen@ naddy@
Markdown is a text-to-HTML filter; it translates an easy-to-read /
easy-to-write structured text format into HTML. Markdown's text format
is most similar to that of plain text email, and supports features such
as headers, *emphasis*, code blocks, blockquotes, and links.
This module implements MultiMarkdown, which is an extension to Markdown.
ok merdely@
libical is an Open Source (MPL/LGPL) implementation of the IETF's
iCalendar Calendaring and Scheduling protocols. (RFC 2445, 2446, and
2447). It parses iCal components and provides a C API for manipulating
the component properties, parameters, and subcomponents.
feedback and ok merdely@ and simon@
documents and output them in various formats, such as HTML or LaTeX. For
details on reStructuredText, see http://docutils.sourceforge.net/rst.html.
From Okan Demirmen
ok simon@
Markdown is a text-to-HTML filter; it translates an easy-to-read /
easy-to-write structured text format into HTML. Markdown's text format
is most similar to that of plain text email, and supports features such
as headers, *emphasis*, code blocks, blockquotes, and links.
from okan at demirmen.com
CGI::FastTemplate manages templates and parses templates replacing
variable names with values. It was designed for mid to large scale web
applications (CGI, mod_perl) where there are great benefits to
separating the logic of an application from the specific implementation
details.
ok jasper@
In general, once you have an XML::Generator object, you then simply call
methods on that object named for each XML tag you wish to generate.
feedback & ok jasper@
AsciiDoc is a text document format for writing short documents, articles, books
and UNIX man pages. AsciiDoc files can be translated to HTML and DocBook
markups using the asciidoc(1) command.
XML::RSS::LibXML uses XML::LibXML (libxml2) for parsing RSS instead of
XML::RSS' XML::Parser (expat), while trying to keep interface
compatibility with XML::RSS.
ok jasper@
XML::Feed is a syndication feed parser for both RSS and Atom feeds. It
also implements feed auto-discovery for finding feeds, given a URI.
XML::Feed supports the following syndication feed formats:
- RSS 0.91
- RSS 1.0
- RSS 2.0
- Atom
The goal of XML::Feed is to provide a unified API for parsing and using
the various syndication formats.
ok jasper@
Atom is a syndication, API, and archiving format for weblogs and other
data. XML::Atom implements the feed format as well as a client for the
API.
ok jasper@
This module provides the user with methods to connect to a remote
server, send a stream of XML to the server, and receive/parse an XML
stream from the server.
ok simon@
Mini-XML is a small XML parsing library that you can use to read XML and
XML-like data files in your application without requiring large
non-standard libraries.
okay steven@
FastRI is an alternative to the ri command-line tool. It is *much* faster,
and also allows you to offer RI lookup services over DRb. FastRI is a bit
smarter than ri, and can find classes anywhere in the hierarchy without
specifying the "full path". It also knows about gems, and can tell you e.g.
which extensions to a core class were added by a specific gem.
RedCloth is a module for using Textile in Ruby. Textile is a text format.
A very simple text format. Another stab at making readable text that can
be converted to HTML.
library. This can be used to embed the swish-e search code into your
perl program avoiding the need to run the swish-e binary for searching.
From simon@
ok sturm@, steven@
collections of Web pages or other files. Swish-e is ideally suited for
collections of a million documents or smaller. Using the GNOME libxml2
parser and a collection of filters, Swish-e can index plain text,
e-mail, PDF, HTML, XML, Microsoft Word/PowerPoint/Excel and just about
any file that can be converted to XML or HTML text. Swish-e is also
often used to supplement databases like the MySQL DBMS for very fast
full-text searching.
help from simon, ok steven@, sturm@
there is only one way to make sure you don't have to bump PKGNAME, build
the package with PLIST_DB and ONLY if it doesn't complain, you don't have
to bump
like grep, aimed at programmers with large trees of heterogeneous source
code. ack is written purely in Perl, and takes advantage of the power
of Perl's regular expressions.
ok simon@