2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

IEEE Signal Processing Society

Institute of Electrical and Electronics Engineers (IEEE)

2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

Technical Program

Paper Detail

Paper ID	IFS-7.4
Paper Title	INTEGRATING DEEP LEARNING WITH FIRST-ORDER LOGIC PROGRAMMED CONSTRAINTS FOR ZERO-DAY PHISHING ATTACK DETECTION
Authors	Seok-Jun Bu, Sung-Bae Cho, Yonsei University, South Korea
Session	IFS-7: Information Hiding, Cryptography and Cybersecurity
Location	Gather.Town
Session Time:	Friday, 11 June, 11:30 - 12:15
Presentation Time:	Friday, 11 June, 11:30 - 12:15
Presentation	Poster
Topic	Information Forensics and Security: [CYB] Cybersecurity
IEEE Xplore Open Preview	Click here to view in IEEE Xplore
Virtual Presentation	Click here to watch in the Virtual Conference
Abstract	Considering the fatality of phishing attacks that are emphasized by many organizations, the inductive learning approach using reported malicious URLs has been verified in the field of deep learning. However, the deep learning-based method mainly focused on the fitting of a classification task via historical URL observation shows a limitation of recall due to the characteristics of zero-day attack. In order to model the nature of a zero-day phishing attack in which URL addresses are generated and discarded immediately, an approach that utilizes the expert knowledge is promising. We introduce the integration method of deep learning and logic programmed domain knowledge to inject the real-world constraints. We design neural and logic classifiers and propose the joint learning method of each component based on the traditional neuro-symbolic integration. Extensive experiments on three real-world datasets consisting of 222,541 URLs showed the highest recall among the latest deep learning methods, despite the hostile class-imbalanced condition. We demonstrate that the optimized weighting between neural and logic component has an effect of improving the recall over 3% compared to the existing methods.