3 Commits

Author SHA1 Message Date
landry
cc22a424d3 bump ports depending on python3 in textproc 2017-01-03 19:27:51 +00:00
sthen
be8dfb30fb sync WANTLIB 2016-07-09 08:46:24 +00:00
shadchin
d381682891 Import py-stemmer 1.3.0, ok sthen@
PyStemmer provides access to efficient algorithms for calculating a
"stemmed" form of a word. This is a form with most of the common
morphological endings removed; hopefully representing a common
linguistic base form. This is most useful in building search engines
and information retrieval software; for example, a search with stemming
enabled should be able to find a document containing "cycling" given the
query "cycles".

PyStemmer provides algorithms for several (mainly european) languages,
by wrapping the libstemmer library from the Snowball project in a Python
module. It also provides access to the classic Porter stemming algorithm
for english: although this has been superceded by an improved algorithm,
the original algorithm may be of interest to information retrieval
researchers wishing to reproduce results of earlier experiments
2016-01-20 05:09:38 +00:00