4 Commits

Author SHA1 Message Date
naddy
8c80a751f8 drop RCS Ids 2022-03-11 20:02:12 +00:00
sthen
df7beffe1b update to libexttextcat-3.4.6 which has some more classifiers
- set USE_GMAKE so the upstream install target works properly, which
means we don't need to handroll installing the lm files in post-install,
which means they get installed to the correct directory. tracked
down from a problem with Dovecot reported by Johan Huldtgren.

- build/install the testtextcat program that's used in regression tests,
as a simple way to see what the library is doing (run it with a path
to fpdb.conf, feed on stdin, it tries to identify the language.
looks like you need to be in share/libexttextcat for it to pick up
the lm files correctly)
2021-11-15 00:29:04 +00:00
sthen
c06f261229 libexttextcat: rename fpdb.conf to fbdb.conf.dist and @sample 2021-02-08 15:41:12 +00:00
sthen
b2fac56fd1 import libexttextcat, ok tb@ benoit@ (tb spotted a test dep on bash and
suggested adding TDEP, I fixed it differently)

libexttextcat is an N-Gram-Based Text Categorization library primarily
intended for language guessing. It is an adaption of wiseguys libtextcat
extended to be UTF-8 aware.
2021-02-08 09:53:58 +00:00