Difference between revisions of "MediaWiki/fighting spam"

from HTYP, the free directory anyone can edit if they can prove to me that they're not a spambot
Jump to navigation Jump to search
(updated links to be interwiki; Wiki Spam list)
(→‎Notes: monitor RSS)
Line 5: Line 5:
 
==Notes==
 
==Notes==
 
It looks like there are basically two methods for preventing [[spam]]. Both of them match submitted edits against a [[regex]] string, and reject those which fail the test. One method is built in and allows only a single regex string; the other requires an extension ("ambiguously licensed") and allows blacklist data to be pulled from remote sites.
 
It looks like there are basically two methods for preventing [[spam]]. Both of them match submitted edits against a [[regex]] string, and reject those which fail the test. One method is built in and allows only a single regex string; the other requires an extension ("ambiguously licensed") and allows blacklist data to be pulled from remote sites.
 +
 +
A tip for small sites: always monitor your site's RSS feed to see what changes are being made by others. A lot of spam does not show up when looking at the page as displayed in a browser; the content is hidden using CSS so that it will get picked up by search 'bots but not noticed by editors.
  
 
==Links==
 
==Links==

Revision as of 19:36, 11 July 2006

Navbar

computing: software: MediaWiki: fighting spam posts

This page is a seed article. You can help HTYP water it: make a request to expand a given page and/or donate to help give us more writing-hours!

Overview

This page relates to fighting spam postings, otherwise known as wikispam, in MediaWiki.

Notes

It looks like there are basically two methods for preventing spam. Both of them match submitted edits against a regex string, and reject those which fail the test. One method is built in and allows only a single regex string; the other requires an extension ("ambiguously licensed") and allows blacklist data to be pulled from remote sites.

A tip for small sites: always monitor your site's RSS feed to see what changes are being made by others. A lot of spam does not show up when looking at the page as displayed in a browser; the content is hidden using CSS so that it will get picked up by search 'bots but not noticed by editors.

Links

  • MediaWiki documentation
    • Wiki Spam: a detailed explanation of the problem
    • Anti-spam Features: simple built-in regex blacklist
    • SpamBlacklist extension: more powerful than the built-in regex blacklist. The README file explains most of it, but doesn't make it clear that there are two files you need to install: SpamBlacklist.php and SpamBlacklist_body.php