More specifically we love massive enterprise email datasets. Email doesn't have the best reputation with engineers - the protocol is ancient, poorly defined, even more poorly secured and email isn’t Slack. As a Data Science team we don’t think of email in terms of SMTP, but rather a beautiful, dynamic and pretty-huge JSON dataset that captures the intricacies of human-to-human communication. Email knows who you communicate with, what you communicate about, what clients you’re pitching, what projects you’ve just completed, who your team members are, your company hierarchy, (excitingly) the list goes on.
“Rule-based security systems are ineffective at detecting advanced threats on email. This is because the most advanced threats are either caused by - or exploit human relationships, which by their nature are dynamic and constantly in flux. This is a very interesting challenge for us in the Data Science team, one that requires using advanced NLP and training models to detect deviations from a user’s normal behavior.” - AMINE SALEM