hopefully im allowed to post about this, i just stumble on a cool website that offers spam archives to download and train SA with. i dont know the quality of spam messages it has but for a new install it should be better than nothing, right?
Besides a starter database you can download (to replace a new empty existing one) they also have a daily update archive you can use to update it daily with a neat script to download the daily archive of spam then feeds it to sa-learn.
An important note on the website, i didnt know this
Important! To achieve the best results, train your filter regularly with ham (useful) emails as well! Ham and spam count should be nearly equal, but on a working system that will not be a problem. We cannot provide a ham collection because those are valid and good emails. If you run a mail server, you will have enough ham samples for training.
It makes little sense to me to use spam archives.
Just take some time to train Bayes. If you really want spam to train Bayes, create email addresses for wife and kids.
“We are stuck with technology when what we really want is just stuff that works.” -Douglas Adams
henk wrote: ↑17 Feb 2019 23:55
It makes little sense to me to use spam archives.
Just take some time to train Bayes. If you really want spam to train Bayes, create email addresses for wife and kids.