TY - JOUR
T1 - Finding smallest k-Compact tree set for keyword queries on graphs using Map Reduce.
AU - Liu, Chengfei
AU - Yao, Liang
AU - Li, Jianxin
AU - Zhou, Rui
AU - He, Zhenying
PY - 2015
Y1 - 2015
N2 - Keyword search is integrated in many applications on account of the convenience to convey users’ query intention. Most existing works in keyword search on graphs modeled the query results as individual minimal connected trees or connected graphs that contain the keywords. We observe that significant overlap may exist among those query results, which would affect the result diversification. Besides, most solutions required accessing graph data and pre-built indexes in memory, which is not suitable to process big dataset. In this paper, we define the smallest k-compact tree set as the keyword query result, where no shared graph node exists between any two compact trees. We then develop a progressive A* based scalable solution using MapReduce to compute the smallest k-compact tree set, where the computation process could be stopped once the generated compact tree set is sufficient to compute the keyword query result. We conduct experiments to show the efficiency of our proposed algorithm. © 2015, Springer Science+Business Media New York.
AB - Keyword search is integrated in many applications on account of the convenience to convey users’ query intention. Most existing works in keyword search on graphs modeled the query results as individual minimal connected trees or connected graphs that contain the keywords. We observe that significant overlap may exist among those query results, which would affect the result diversification. Besides, most solutions required accessing graph data and pre-built indexes in memory, which is not suitable to process big dataset. In this paper, we define the smallest k-compact tree set as the keyword query result, where no shared graph node exists between any two compact trees. We then develop a progressive A* based scalable solution using MapReduce to compute the smallest k-compact tree set, where the computation process could be stopped once the generated compact tree set is sufficient to compute the keyword query result. We conduct experiments to show the efficiency of our proposed algorithm. © 2015, Springer Science+Business Media New York.
UR - http://www.scopus.com/inward/record.url?scp=84961201338&partnerID=8YFLogxK
U2 - 10.1007/s11280-015-0337-1
DO - 10.1007/s11280-015-0337-1
M3 - Article
SN - 1386-145X
VL - 19
SP - 499
EP - 518
JO - World Wide Web
JF - World Wide Web
IS - 3
ER -