Educause Security Discussion mailing list archives

Re: Detecting phishing messages


From: Joseph Tam <tam () MATH UBC CA>
Date: Fri, 5 Jan 2018 23:17:06 -0800

On Fri, 5 Jan 2018, Erik D Evans wrote:

One thing we are considering is setting up a dictionary containing
common words we see in phishing messages such as the one I have
included below.  We regularly see words such as kindly, verify,
validate, important, urgent, account, etc...  What we would like to do
with this is if we see a message that has more than one of these words,
AND a link to an external web site - prepend a warning to the message
and make the URL unclickable.  However, we have some concern about how
many false positives we will get with this approach.

This sounds like an ad-hoc approach to Bayesian analysis.  If you already
have a corpus of this type of phishing (and a corpus of legitimate mail
including real notices sent by your IT staff), you can you teach the
Bayesian system to classify mail based on these corpuses.

The benefits:

        - categorizes based on both positive *and* negative correlation of
        tokens.

        - you can retrain on error without having to scrap keywords

Joseph Tam <tam () math ubc ca>


Current thread: