TY - JOUR
T1 - Effective rule mining of sparse data based on transfer learning
AU - Sun, Yongjiao
AU - Guo, Jiancheng
AU - Li, Boyang
AU - Haldar, Nur Al Hasan
PY - 2023/1
Y1 - 2023/1
N2 - Rule mining is an important and challenging task in data mining. Although many state-of-art algorithms have been proposed on dense data, they are not effectively adaptive for sparse data, such as sparse heterogeneous networks. Transfer learning improves the performance of algorithms in the target domain by transferring knowledge from a similar source domain, which provides a feasible and effective method to solve the above challenge. In this paper, we propose a transfer learning-based algorithm to mine rules on sparse data effectively, named TL-ERMSD. The algorithm is capable of detecting the knowledge of a common structure as well as the rules and logics between the source and target domains. Then, rule transfer is carried out by establishing the mapping mechanism between the two domains. We conducted experiments over the heterogeneous network datasets, including the source domain dataset FB15K and the target domain dataset Yago2Sample. The results demonstrate that the proposed TL-ERMSD for rule mining has a significant advantage over the existing algorithms.
AB - Rule mining is an important and challenging task in data mining. Although many state-of-art algorithms have been proposed on dense data, they are not effectively adaptive for sparse data, such as sparse heterogeneous networks. Transfer learning improves the performance of algorithms in the target domain by transferring knowledge from a similar source domain, which provides a feasible and effective method to solve the above challenge. In this paper, we propose a transfer learning-based algorithm to mine rules on sparse data effectively, named TL-ERMSD. The algorithm is capable of detecting the knowledge of a common structure as well as the rules and logics between the source and target domains. Then, rule transfer is carried out by establishing the mapping mechanism between the two domains. We conducted experiments over the heterogeneous network datasets, including the source domain dataset FB15K and the target domain dataset Yago2Sample. The results demonstrate that the proposed TL-ERMSD for rule mining has a significant advantage over the existing algorithms.
KW - Heterogeneous network
KW - Rule mining
KW - Sparse data
KW - Transfer learning
UR - http://www.scopus.com/inward/record.url?scp=85126771370&partnerID=8YFLogxK
U2 - 10.1007/s11280-022-01042-1
DO - 10.1007/s11280-022-01042-1
M3 - Article
AN - SCOPUS:85126771370
SN - 1386-145X
VL - 26
SP - 461
EP - 480
JO - World Wide Web
JF - World Wide Web
IS - 1
ER -