%0 Journal Article %A Du Donggao %A Ma Yan %T Malware variants detection based on ensemble learning %D 2020 %R 10.19682/j.cnki.1005-8885.2020.1010 %J 中国邮电高校学报(英文) %P 82-90 %V 27 %N 2 %X Application programming interface (API) is a procedure call interface to operation system resource. API-based behavior features can capture the malicious behaviors of malware variants. However, existing malware detection approaches have a deal of complex operations on constructing and matching. Furthermore, graph matching is adopted in many approaches, which is a nondeterministic polynominal (NP)-complete problem because of computational complexity. To address these problems, a novel approach is proposed to detect malware variants. Firstly, the API of the malware are divided by their functions and parameters. Then, the classified behavior graph (CBG) is constructed from the API call sequences. Finally, the signature based on CBGs for each malware family is generated. Besides, the malware variants are classified by ensemble learning algorithm. Experiments on 1 220 malware samples show that the true positive rate (TPR) is up to 89.0% with the low false positive rate (FPR) 3.7% by ensemble learning. %U https://jcupt.bupt.edu.cn/CN/10.19682/j.cnki.1005-8885.2020.1010