Bogofilter is a mail filter that classifies e-mail as spam or ham (non-spam) by a statistical analysis of the message's header and content (body). Email filtering is the processing of E-mail to organize it according to specified criteria Electronic mail, often abbreviated to e-mail, email, or originally eMail, is a Store-and-forward method of writing sending receiving Spamming is the abuse of electronic messaging systems to indiscriminately send unsolicited bulk messages Statistics is a mathematical science pertaining to the collection analysis interpretation or explanation and presentation of Data. The program is able to learn from the user's classifications and corrections. It was originally written by Eric S. Raymond, and is now maintained together with a group of contributors by David Relson, Matthias Andree and Greg Louis. Eric Steven Raymond (born December 4 1957 often referred to as ESR, is a Computer programmer, author and Open source software advocate
The statistical technique used is known as Bayesian filtering and its use for spam was first described by Paul Graham in his article A Plan For Spam. Bayesian spam filtering (pronounced BAYS-ee-ən IPA pronunciation:, after Rev Paul Graham (born 1964 is a Programmer, Venture capitalist and Essayist, known for his work on Lisp. Gary Robinson, in his weblog Rants, suggests some refinements for improved discrimination between spam and ham. Statistical discrimination is an Economic theory of Inequality based on group Stereotypes. Bogofilter's primary algorithm uses the f(w) parameter and the Fisher inverse chi-square technique that he describes.
Bogofilter is run by an MDA script to classify an incoming message as spam or ham (using wordlists stored by BerkeleyDB, SQLite3 or QDBM). A Mail Delivery Agent ( MDA) is Software that delivers E-mail messages right after they've been accepted on a server distributing them to recipients' individual "Scripting" redirects here For other uses see Script. Berkeley DB (BDB is a Computer software library that provides a high-performance embedded Database, with bindings in C, SQLite is a mostly ACID -compliant Relational database management system contained in a relatively small (~500 kB) C programming library Bogofilter provides processing for plain text and HTML. HTML, an initialism of HyperText Markup Language, is the predominant Markup language for Web pages It provides a means to describe the structure It supports multi-part MIME message with decoding of base64, quoted-printable, and uuencoded text and ignores attachments, such as images. Multipurpose Internet Mail Extensions ( MIME) is an Internet standard that extends the format of e-mail to support text in Character
Standard tests at TREC 2005 show that Bogofilter compares well to its competitors spambayes, CRM114 and DSPAM. SpamBayes is a Bayesian spam filter written in Python which uses techniques laid out by Paul Graham in his essay "A Plan for Spam" CRM114 (full name "The CRM114 Discriminator" is a program based upon a statistical approach for classifying data and especially used for filtering email spam. DSPAM is a Free software statistical Spam filter written by Jonathan A Other competitors include, but are not limited to Spamprobe and QSF.
Bogofilter is written in C, and runs on Linux, FreeBSD, NetBSD, OpenBSD, Solaris, Mac OS X, HP-UX, AIX and other platforms. tags please moot on the talk page first! --> In Computing, C is a general-purpose cross-platform block structured Linux (commonly pronounced ˈlɪnəks FreeBSD is a Unix-like free Operating system descended from AT&T UNIX via the Berkeley Software Distribution (BSD branch through NetBSD is a freely redistributable Open source version of the Unix -derivative Berkeley Software Distribution (BSD Computer Operating OpenBSD is a Unix-like computer Operating system descended from Berkeley Software Distribution (BSD a Unix derivative developed at the Solaris is a Unix -based Operating system introduced by Sun Microsystems in 1992 as the successor to SunOS. Mac OS X (mæk oʊ ɛs tɛn is a line of computer Operating systems developed marketed and sold by Apple Inc, the latest of which is pre-loaded on all currently HP-UX (Hewlett Packard UniX is Hewlett-Packard 's proprietary implementation of the Unix Operating system, based on System V (initially
This article, or an earlier revision of it, was edited from bogofilter's homepage. In Computing, a blacklist is a basic Access control mechanism that allows every access except for the members of the black list (i Greylisting (or graylisting) is a method of defending E-mail users against spam. A whitelist is a list of accepted items or persons in a set This list is inclusionary confirming that the item being analyzed is acceptable A tarpit (also known as Teergrube, the German word for tarpit is a service on a Computer system (usually a server) that delays incoming connections for