Benkovich, Nikita and Dedenok, Roman and Golubev, Dmitry (2019): Deep Quarantine for Suspicious Mail. Published in: CEUR Workshop Proceedings , Vol. 2479, (26 September 2019): pp. 68-76.
Preview |
PDF
paper6.pdf Download (461kB) | Preview |
Abstract
In this paper, we introduce DeepQuarantine (DQ), a cloudtechnology to detect and quarantine potential spam messages. Spam at-tacks are becoming more diverse and can potentially be harmful to emailusers. Despite the high quality and performance of spam filtering sys-tems, detection of a spam campaign can take some time. Unfortunately,in this case some unwanted messages get delivered to users. To solve thisproblem, we created DQ, which detects potential spam and keeps it ina special Quarantine folder for a while. The time gained allows us todouble-check the messages to improve the reliability of the anti-spam so-lution. Due to high precision of the technology, most of the quarantinedmail is spam, which allows clients to use email without delay. Our solutionis based on applying Convolutional Neural Networks on MIME headersto extract deep features from large-scale historical data. We evaluatedthe proposed method on real-world data and showed that DQ enhancesthe quality of spam detection.
Item Type: | MPRA Paper |
---|---|
Original Title: | Deep Quarantine for Suspicious Mail |
English Title: | Deep Quarantine for Suspicious Mail |
Language: | English |
Keywords: | spam filtering; spam detection; machine learning; deeplearning; cloud technology |
Subjects: | C - Mathematical and Quantitative Methods > C4 - Econometric and Statistical Methods: Special Topics > C45 - Neural Networks and Related Topics M - Business Administration and Business Economics ; Marketing ; Accounting ; Personnel Economics > M1 - Business Administration > M15 - IT Management |
Item ID: | 97311 |
Depositing User: | Dr. Rustam Tagiew |
Date Deposited: | 09 Dec 2019 15:51 |
Last Modified: | 09 Dec 2019 15:51 |
References: | Bhowmick, A., Hazarika, S. M.: Machine Learning for E-mail Spam Filtering: Re-view Techniques and Trends. arXiv preprint arXiv:1606.01042 (2016) Biggio, B., Corona, I., Fumera, G., Giacinto, G., Roli, F.: Bagging Classifiers forFighting Poisoning Attacks in Adversarial Classification Tasks. In: MCS 2011,LNCS, vol. 6713, pp. 350-359, Springer,Verlag (2011) Boureau, Y.-L., Bach, F., LeCun, Y., Ponce, J.: Learning mid-level features forrecognition. In: Computer Vision and Pattern Recognition (CVPR), IEEE Confer-ence on, pp. 2559-2566, IEEE, Finland (2010) Dhamani, N., Azunre, P., Gleason, J. L., Corcoran, C., Honke, G., Kramer, S.,Morgan J.: Using Deep Networks and Transfer Learning to Address Disinformation.arXiv preprint arXiv:1905.10412 (2019) GooglePatentsDelaytechniqueine-mailfilteringsystem,https://patents.google.com/patent/US20090157708A1/en.Lastaccessed15Sep 2019 Halgas, L., Agrafiotis, I., Nurse, J.: Catching the Phish: Detecting Phishing Attacksusing Recurrent Neural Networks (RNNs). In: 20th World Conference on Informa-tion Security Applications, Korea (2019) Hinton, G. E., Srivastava, N., Krizhevsky, A., Sutskever, I., Salakhutdinov, R. R.:Improving neural networks by preventing coadaptation of feature detectors. arXivpreprint arXiv:1207.0580 (2012) Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8),1735-1780 (1997) Hu, Y., Guo, C., Ngai, E., Liu, M., Chen, S.: A Scalable Intelligent Non-content-based Spam-filtering Framework. Expert Systems with Applications 37(12), 8557-8565 (2010) Klimt, B., Yang, Y.: The Enron corpus: A new dataset for email classificationresearch. In: Proceedings of the 15th European Conference on Machine Learning,pp. 217-226, Pisa, Italy. (2004) Qaroush, A., Khater, I. M., Washaha, M.: Identifying spam e-mail based-on sta-tistical header features and sender behavior. In: CUBE International InformationTechnology Conference, pp. 771-778, ACM, USA (2012) RFC 1521 Mechanisms for Specifying and Describing the Format of Internet Mes-sage Bodies, https://tools.ietf.org/html/rfc1521. Last accessed 15 Sep 2019 Schmidhuber, J.: Deep Learning in Neural Networks: An Overview. Neural Net-works 61, 85-117 (2015) Securelist Spam and phishing in Q2 2019, https://securelist.com/spam-and-phishing-in-q2-2019/92379/. Last accessed 15 Sep 2019 Zhang, L., Zhu, J., Yao, T.: An Evaluation of Statistical Spam Filtering TechniquesSpam Filtering as Text Categorization. ACM Transactions on Asian Language In-formation Processing (TALIP) 3(4), 243-269 (2004) Zhang, X., Zhao, J., LeCun, Y.: Character-level convolutional networks for textclassification. In: NIPS, pp. 649-657, Montreal, Canada (2015) |
URI: | https://mpra.ub.uni-muenchen.de/id/eprint/97311 |