Báo cáo khoa học: "Clustering Technique in Multi-Document Personal Name Disambiguation"
Focusing on multi-document personal name disambiguation, this paper develops an agglomerative clustering approach to resolving this problem. We start from an analysis of pointwise mutual information between feature and the ambiguous name, which brings about a novel weight computing method for feature in clustering. Then a trade-off measure between within-cluster compactness and among-cluster separation is proposed for stopping clustering. After that, we apply a labeling method to find representative feature for each cluster. ...
88
Proceedings of the ACL-IJCNLP 2009 Student Research Workshop, pages 88–95,
Suntec, Singapore, 4 August 2009. c(cid:13)2009 ACL and AFNLP
89
90
91
92
93
94
95
Có thể bạn quan tâm
Tài liêu mới