Applied Sciences, Vol. 13, Pages 4061: Illegal Domain Name Generation Algorithm Based on Character Similarity of Domain Name Structure

1 year ago 33

Applied Sciences, Vol. 13, Pages 4061: Illegal Domain Name Generation Algorithm Based on Character Similarity of Domain Name Structure

Applied Sciences doi: 10.3390/app13064061

Authors: Yuchen Liang Yanan Cheng Zhaoxin Zhang Tingting Chai Chao Li

Detecting and controlling illegal websites (gambling and pornography sites) through illegal domain names has been an unsolved problem. Therefore, how to mine and discover potential illegal domain names in advance has become a current research hotspot. This paper studies a method of generating illegal domain names based on the character similarity of domain name structure. Firstly, the K-means algorithm classified illegal domain names with similar structures. Then, put the classified clusters into the adversarial generative network for training. Finally, through a specific result verification method, the experiment shows that the average concentration of the generation algorithm is 23.82%, the effective concentration is 63.54%, and the expansion rate is 7.5. By comparing the results with the enumeration algorithm, the generation algorithm has greatly improved in terms of generation efficiency and accuracy.

Read Entire Article