| Email Security Products |
|
Bayesian FilteringImplementing Bayesian Classifier with GEE WhizBayesian Filtering is the most sophisticated spam detection heuristic algorithm available. Bayesian Filtering is based on mathematical algorithms that project the "probability" of an email being spam by comparing a known corpus of SPAM (bad email) and HAM (good email). What makes Bayesian Filtering so effective and valuable is the fact that Bayesian Classifier Filtering can be customised to use your specific email libraries rather than a "general" library. There is a wealth of information about Bayesian Filtering on the Internet. For more information on Bayesian filtering, do a search on Google or go to: http://www.paulgraham.com/ Using the Default Set of Bayesian TokensGEE Whiz ships a default set of Bayesian tokens based on a library of known SPAM and HAM used by Spam Assassin. By enabling the "Use Default Bayes Token" option, you can immediately take advantage of enhanced spam detection. When you enable the default Bayesian option, GEE Whiz applies the information in the DBAYES.DAT file from the SYS:GEE\TMPLTS directory. To read the DBAYES.DAT file, open it with Internet Explorer. You can revert to the Default Tokens at any time by selecting the "Use Default Bayes Token" link.
Customising Advanced Textual ClassifierTo customise Bayesian Classifier Filtering, you will need to have a library of known good mail (HAM) and known bad mail (SPAM). This mail must be in raw MIME format. The best way to gather these emails is to create shared folders called SPAM and HAM and ask a limited number of "trusted" users to move emails into the appropriate folders. The users need to be "trusted" in that you want them to put the right types of emails into the right shared folders! You don't what them putting SPAM in the HAM folder and vice versa. Another way to gather the emails is to provide them with one of the freeware utilities that are available to export emails from GroupWise and have your users export their emails directly to a shared SPAM and HAM directory on your server. Following are links to two GroupWise email export programmes. ExportSpam was developed by Michael Bell, the developer of Guinevere. The second programme, GWMime822, was sent to us by email without acknowledging the name of the developer. After gathering a sufficient number of SPAM/HAM emails (500 each)
Each time you select the "Teach Bayesian Classifier" link, GEE Whiz replaces the previous set of tokens, reads the SPAM and HAM emails and creates a new BAYES.DAT file that contains the token information (SYS:GEE\TMPLTS\BAYES.DAT). To read the file, open it with Internet Explorer. You can update the tokens by going through steps 2 and 3 above. Note: When you "Teach Bayesian Classifier", you replace the previous Bayesian Tokens (either the default tokens or the previous "taught" tokens). Do not "Teach Classifier" if you do not have emails in the GEE\BAYES\SPAM and HAM directories. You can always go back to the default set of Bayesian Tokens by selecting the "Use Default Bayes Tokens" link. This will replace your Custom Bayes tokens with the default set that ships with GEE Whiz. Enable Auto-BayesDo NOT use Auto-Bayes if you are using the Default Bayes Tokens or have implemented the Teach Classifier process. The Auto-Bayes option has been removed from GEE Whiz 2.0. Although Auto-Bayes might be a good way to "prime" your Bayesian filtering, be aware that when you apply Auto-Bayes, it will replace the default tokens or the "taught" tokens. Auto-Bayes automatically schedules the "Teach Bayesian Classifier" to add the new emails to the tokens once a night. |