2a813080bc
mail (also known as "Spam") containing images as the main content carrier. Using different methods, it analyzes the content and properties of images to distinguish between normal mails (Ham) and spam mails. The methods mainly are: * Optical Character Recognition using different engines and settings * Fuzzy word matching algorithm applied to OCR results * Image hashing system to learn unique properties of known spam images * Dimension, size and integrity checking of images * Content-Type verification for the containing email Help from William Yodlowsky <bsd () openbsd.rutgers.edu> Help and ok steven@, simon@
12 lines
604 B
Plaintext
12 lines
604 B
Plaintext
FuzzyOcr is a plugin for SpamAssassin which is aimed at unsolicited bulk
|
|
mail (also known as "Spam") containing images as the main content
|
|
carrier. Using different methods, it analyzes the content and properties
|
|
of images to distinguish between normal mails (Ham) and spam mails. The
|
|
methods mainly are:
|
|
|
|
* Optical Character Recognition using different engines and settings
|
|
* Fuzzy word matching algorithm applied to OCR results
|
|
* Image hashing system to learn unique properties of known spam images
|
|
* Dimension, size and integrity checking of images
|
|
* Content-Type verification for the containing email
|