TY - JOUR
T1 - DFraud³: Multi-Component Fraud Detection Free of Cold-Start.
T2 - Multi-Component Fraud Detection free of Cold-start
AU - Shehnepoor, Saeedreza
AU - Togneri, Roberto
AU - Liu, Wei
AU - Bennamoun, Mohammed
PY - 2021/5/19
Y1 - 2021/5/19
N2 - Fraud review detection is a hot research topic in recent years. The Cold-start is a particularly new but significant problem referring to the failure of a detection system to recognize the authenticity of a new user. State-of-the-art solutions employ a translational knowledge graph embedding approach (TransE) to model the interaction of the components of a review system. However, these approaches suffer from the limitation of TransE in handling N-1 relations and the narrow scope of a single classification task, i.e., detecting fraudsters only. In this paper, we model a review system as a Heterogeneous Information Network (HIN) which enables a unique representation to every component and performs graph inductive learning on the review data through aggregating features of nearby nodes. HIN with graph induction helps to address the camouflage issue (fraudsters with genuine reviews) which has shown to be more severe when it is coupled with cold-start, i.e., new fraudsters with genuine first reviews. In this research, instead of focusing only on one component, detecting either fraud reviews or fraud users (fraudsters), vector representations are learned for each component, enabling multi-component classification. In other words, we can detect fraud reviews, fraudsters, and fraud-targeted items, thus the name of our approach DFraud3. DFraud3 demonstrates a significant accuracy increase of 13% over the state of the art on Yelp.
AB - Fraud review detection is a hot research topic in recent years. The Cold-start is a particularly new but significant problem referring to the failure of a detection system to recognize the authenticity of a new user. State-of-the-art solutions employ a translational knowledge graph embedding approach (TransE) to model the interaction of the components of a review system. However, these approaches suffer from the limitation of TransE in handling N-1 relations and the narrow scope of a single classification task, i.e., detecting fraudsters only. In this paper, we model a review system as a Heterogeneous Information Network (HIN) which enables a unique representation to every component and performs graph inductive learning on the review data through aggregating features of nearby nodes. HIN with graph induction helps to address the camouflage issue (fraudsters with genuine reviews) which has shown to be more severe when it is coupled with cold-start, i.e., new fraudsters with genuine first reviews. In this research, instead of focusing only on one component, detecting either fraud reviews or fraud users (fraudsters), vector representations are learned for each component, enabling multi-component classification. In other words, we can detect fraud reviews, fraudsters, and fraud-targeted items, thus the name of our approach DFraud3. DFraud3 demonstrates a significant accuracy increase of 13% over the state of the art on Yelp.
KW - Australia
KW - Camouflage
KW - cold-start
KW - Feature extraction
KW - Fraud
KW - Inductive Learning
KW - Linear programming
KW - Multi-Component Classification
KW - Performance gain
KW - Social Media
KW - Surges
KW - Training
KW - Writing
UR - http://www.scopus.com/inward/record.url?scp=85106686143&partnerID=8YFLogxK
U2 - 10.1109/TIFS.2021.3081258
DO - 10.1109/TIFS.2021.3081258
M3 - Article
AN - SCOPUS:85106686143
SN - 1556-6013
VL - 16
SP - 3456
EP - 3468
JO - IEEE Transactions on Information Forensics and Security
JF - IEEE Transactions on Information Forensics and Security
M1 - 9435380
ER -