The current version of the DCC Reputation source is version 2.3.155, August 30, 2014.
The Distributed Checksum Clearinghouses or DCC is an anti-spam content filter that is based on distributed database that collects real time reports about mail from a global network of servers. Each report consists of several checksums. The most important checksums are of message bodies, the IP address of the SMTP client or mail sender, bits of envelope, and so forth. Each time a DCC client sends a report to a DCC server, the server records the report (modulo data reduction/compression) and answers with how many times it has heard of each checksum in the report. A DCC client can take that answer and decide "this message is bulk because the DCC network has heard of it more than X times."
click for more graphs
The commercial version of the DCC server also computes reputations by counting the total number of mail messages sent from particularly active IP addresses, and the number of bulk messages each sends. The percentage of bulk mail seen from an IP address is its DCC reputation. For example, a DCC Reputation of 50% means that 50% of the mail that the global DCC Reputation network has seen from an IP address has been bulk. Because the DCC network is not omniscient, the DCC Reputation of an IP address tends to understate the probability that the next message from an IP address will be bulk.
Reputations are flooded among DCC reputation servers along with DCC checksums. Thus DCC Reputations are like DCC counts, more reliable as more systems participate. Free DCC servers exchange floods of classic DCC checksums with DCC reputation servers.
DCC reputation servers detect bulk mail to compute reputations using counts of DCC body checksums reported to all DCC servers in the global network of DCC servers. Mail messages rejected because of a DCC reputation are reported a second time to the global network with counts of MANY. This ensures that the messages will be rejected if sent from some other IP address to any mail system using a DCC client, including free DCC clients. Free DCC users can view the reputation systems like automated "trojan proxy spam traps." Thus, the commercial and free DCC servers benefit mutually.
Like any reputation system, DCC reputations can have false positives. They can react more quickly than manual DNS blacklists to the appearance of new "trojan proxies" and "cracked" PHP-Nuke sites. DCC Reputations are less effective than greylisting, but many sites are unable to use greylisting. It is profitable to use both greylisting and DCC Reputations.
To minimize false positives in DCC reputations, they are computed using only reports from authenticated DCC clients. The reputation mechanism is present only in the commercial version of the DCC. That ensures that operators of DCC Reputation servers know that they must not allow indiscriminate or sloppy use of DCC clients, such as dccproc -cMANY. It would otherwise be too easy for a confused or malicious user to sully the DCC reputation of an IP address. Spam traps involving dccproc -cMANY are useful to DCC reputation servers.
Commercial DCC clients cannot use the public DCC servers, and so may as well be configured with non-anonymous DCC client-IDs. It is possible to run the commercial version of the DCC server code on public DCC servers. Their anonymous clients do not contribute to DCC reputations, but their authenticated clients using non-anonymous DCC client-IDs do.
DCC clients add the string bulk rep to X-DCC headers of mail messages that are not bulk mail but that come from IP addresses with reputations worse than a local configured threshold. There are two thresholds for DCC reputations. Unless they are set, bulk rep is not added by DCC clients and mail is not rejected, but commercial DCC servers accumulate data. Rep-total is the minimum number of mail messages, good as well as bulk, that must have been seen to compute a DCC reputation. This threshold is needed to avoid computing a reputation based on only a few mail messages. Rep is the percentage of bulk mail that gives a DCC Reputation for sending bulk mail.
Because reputations can involve more false positives, dccm and dccifd do not reject mail unless allowed by DCC-reps-on in the client whitelist file. That setting is normally in per-user whiteclnt files, but can be in the system's global /var/dcc/whiteclnt file. The proof of concept CGI scripts in the source include support for turning DCC reputations on and off and setting the Rep threshold by individual end users for their own mail.
DCC Reputations are about IP addresses, and so it is important for the system to recognize an installation's MX servers or mail systems receive mail from the Internet and forward it internally and not blame them for spam. Their IP addresses must be listed in the global whiteclnt file with MX or MXDCC entries.
DCC Reputations are implemented with proprietary programs. To protect the quality of the data and minimize false positives, there are no public servers providing DCC Reputation data to anonymous clients. The DCC Reputation software proprietary is unlike the open DCC software. It is distributed under a license.
Some of the public DCC servers are running DCC Reputation servers that answer ordinary DCC requests from anonymous clients as well as DCC Reputation requests from authenticated clients.
Some DCC Reputation logos that be displayed on web pages are available.
Because any sort of a bad reputation is not a guarantee of bad behavior, rejecting, discarding, or segregating mail from IP addresses with bad reputations (not just DCC Reputations) in "junk folders" can result in false positives. So DCC Reputations must be explicitly turned on for all mailboxes that should have DCC Reputation filtering on DCC clients, or mail systems using dccproc, dccifd, or dccm. To enable DCC Reputations:
SERVER=http://USER:PASSWD@www.rhyolite.com/anti-spam/dccrep/src/ export SERVER /var/dcc/libexec/updatedcc
If all mailboxes on a mail system should use DCC Reputation filtering and have the same thresholds, then lines like the following should be added to main /var/dcc/whiteclnt file:
option DCC-reps-on option threshold rep,20%If the system has whiteclnt files for individual mailboxes, suitable lines should be added to them. The settings for each mailbox are derived by first applying dccm -t or dccifd -t command line settings from REP_ARGS value in the /var/dcc/dcc_conf file, overriding those values with settings from the main /var/dcc/whiteclnt file, and finally applying any settings found in a per-user whiteclnt file, if any.
To choose a reputation threshold, consider the Mail with DCC Reputations graphs. The narrow band of yellow, pink, blue and red shows that most IP addresses have DCC Reputations that are either <1% or >60%. It makes little sense to use DCC Reputation filtering with a threshold below about 10% or above 60%. 40% or 50% is conservative threshold and 20% is somewhat aggressive.
cdcc "add dcc.example.com RTT-1000 32768 passwd1" cdcc "add com-dcc.rhyolite.com XXXXX passwd2"where 32768 and passwd1 are a local client-ID and matching password in the /var/dcc/ids file on your DCC Reputation servers and where XXXXXX and passwd2 are the client-ID and password for use with the backup DCC Reputation servers. passwd2 is usually the same as the PASSWD used with /var/dcc/libexec/updatedcc.
/var/dcc/libexec/updatedcc takes care of this when replacing the free version with the DCC Reputation software.