new install, empty bayes - training
Posted: 17 Feb 2019 23:12
hopefully im allowed to post about this, i just stumble on a cool website that offers spam archives to download and train SA with. i dont know the quality of spam messages it has but for a new install it should be better than nothing, right?
Besides a starter database you can download (to replace a new empty existing one) they also have a daily update archive you can use to update it daily with a neat script to download the daily archive of spam then feeds it to sa-learn.
http://artinvoice.hu/spams/
An important note on the website, i didnt know this
Besides a starter database you can download (to replace a new empty existing one) they also have a daily update archive you can use to update it daily with a neat script to download the daily archive of spam then feeds it to sa-learn.
http://artinvoice.hu/spams/
An important note on the website, i didnt know this
Important! To achieve the best results, train your filter regularly with ham (useful) emails as well! Ham and spam count should be nearly equal, but on a working system that will not be a problem. We cannot provide a ham collection because those are valid and good emails. If you run a mail server, you will have enough ham samples for training.