Movatterモバイル変換

[0]ホーム

Jump to content

SpamBayes

العربية

Edit links

From Wikipedia, the free encyclopedia

This article has multiple issues. Please helpimprove it or discuss these issues on thetalk page.(Learn how and when to remove these messages)

An editor has determined thatsufficient sources exist to establish the subject'snotability. Please helpimprove this article byadding citations to reliable sources. Unsourced material may be challenged and removed.
Find sources: "SpamBayes" – news ·newspapers ·books ·scholar ·JSTOR(May 2024) (Learn how and when to remove this message)

This articlerelies excessively onreferences toprimary sources. Please improve this article by addingsecondary or tertiary sources.
Find sources: "SpamBayes" – news ·newspapers ·books ·scholar ·JSTOR(May 2024) (Learn how and when to remove this message)

(Learn how and when to remove this message)

SpamBayes
Original author	Tim Peters
Initial release	September 2002

Stable release	1.0.4 / March 2005
Preview release	1.1a6 / December 6, 2008 (2008-12-06)^[1]

Written in	Python
Platform	Cross-platform
Available in	English only
Type	E-mail filtering
License	PSFL
Website	spambayes.sourceforge.net

SpamBayes is aBayesian spam filter written inPython which uses techniques laid out byPaul Graham in his essay "A Plan for Spam". It has subsequently been improved byGary Robinson andTim Peters, among others.^[2]

The most notable difference between a conventional Bayesian filter and the filter used by SpamBayes is that there are three classifications rather than two: spam, non-spam (calledham in SpamBayes), and unsure. The user trains a message as being either ham or spam; when filtering a message, the spam filters generate one score for ham and another for spam.

If the spam score is high and the ham score is low, the message will be classified as spam.If the spam score is low and the ham score is high, the message will be classified as ham.If the scores are both high or both low, the message will be classified as unsure.

This approach leads to a low number offalse positives andfalse negatives, but it may result in a number of unsures which need a human decision.