042280b96b
collections of Web pages or other files. Swish-e is ideally suited for collections of a million documents or smaller. Using the GNOME libxml2 parser and a collection of filters, Swish-e can index plain text, e-mail, PDF, HTML, XML, Microsoft Word/PowerPoint/Excel and just about any file that can be converted to XML or HTML text. Swish-e is also often used to supplement databases like the MySQL DBMS for very fast full-text searching. help from simon, ok steven@, sturm@
9 lines
310 B
Plaintext
9 lines
310 B
Plaintext
See ${PREFIX}/share/doc/swish-e/INSTALL for more setup information.
|
|
|
|
Additional indexing functionality:
|
|
* For PDF Documents, install xpdf package.
|
|
* For MS Word Documents, install catdoc package.
|
|
* For MP3 ID3 Tags, install p5-MP3-Tag package.
|
|
* For MS Excel Files, install p5-Spreadsheet-ParseExcel package.
|
|
|