The Information here is somewhat outdated, due to a change in the listmaster team and the NiH-Syndrom, the prior effort is currently discontinued, and a new mechanism is in the process of being set up. Please watch this space.

Spam in the Debian List Archive

Note that all this is very preliminary. Comments and suggestions are very welcome.

Status quo

It has been claimed that the Debian list archives contain spam email messages.

There is a "report as spam" button in on the list archive page of each message, but presently, spam is by and large not removed from the archives. The submissions seem to help (more or less) with finding spam but need manual review before they could be acted upon.

Towards a spam removal policy

Policy corner stones

Ad hoc policy

Review standards should be set after seeing how things pan out, I am aiming at three reviewers, including one experienced one (after some bootstrapping). I hope this would minimize the risk of unwarrented removal. A rigorous standard seems to be necessary to obtain consensus with the project. As such, the three reviewers is only a guideline, not a rule. Of course, more reviewers doing shorter reviews would help tremendously. Ultimately, guaranteeing the integrety of the list archives currently falls in the realm of the Debian listmaster.

Practical matters

About using newspamclassify.py:

Any suggestions on the above and/or the program are of course welcome.

Suggested Improvements

People doing this

If you want to jump in, add yourself here and contact CordBeermann for coordination. Your help is appreciated.

Works in progress

Our goal is to have at least three reports before removing anything. For the following lists, we have some, but not enough review reports. The people mentioned already sent in reports. Your help can most immediately used if you review lists which already have some, but not enough names listed. Please add your name after you sent in your report. Lines with no names mean that the report is ready, but no one have elaborated it (yet).

List

1st Report

2nd Report

3rd Report

debian-devel

Y Giridhar Appaji Nag

SandroTosi

debian-devel-italian

SandroTosi

debian-l10n-italian

SandroTosi

debian-italian

SandroTosi

debian-python (2nd round)

SandroTosi

debian-amd64

SandroTosi

debian-security

SandroTosi

debian-68k

SandroTosi (ready-to-report)

debian-accessibility

SandroTosi

debian-alpha

SandroTosi (ready-to-report)

debian-apache

SandroTosi (ready-to-report)

debian-arm

SandroTosi (ready-to-report)

debian-firewall

SandroTosi (ready-to-report)

People

Success stories

List

Stats: reported/spam_removed

Thank goes to...

debian-project

839/436

hecker, pabs, tviehmann, wijnen

debian-python

250/205

bzed, SandroTosi, tomv

debian-vote

315 spam messages removed

debian-java

489/313

CordBeermann, man-di, SandroTosi

debian-user-german

1688/135

bzed, CordBeermann, man-di

debian-release

631/380

LukClaes, AdamBarratt, MadCoder

debian-qa

706/430

CordBeermann, SandroTosi, LukClaes

debian-newmaint

751/558

AdamBarratt, SandroTosi, LukClaes

debian-www

4139/3110

CordBeermann, SandroTosi, LukClaes

Getting program and data


CategoryTeams