سال انتشار: ۱۳۹۱

محل انتشار: چهارمین کنفرانس مهندسی برق و الکترونیک ایران

تعداد صفحات: ۶

نویسنده(ها):

Nasrin Malakooti – School of Electrical & Computer EngineeringShiraz UniversityShiraz, Iran
Ali Hamzeh – School of Electrical & Computer EngineeringShiraz UniversityShiraz, Iran

چکیده:

Document similarity has formed an important area in information retrieval and the document mining domains. In other words, having an accurate document similarity measure has a great influence on our performance in these areas. Although many proposed document similaritymethods suffer from several shortcomings. Therefore, this paper focuses on the problem of text document clusteringand has proposed a weighted graph algorithm to improve previous proposed methods. In this study, a weighted method which is based on the semantic relation of document’s words is introduced. Using this relation helps us to capture the meaning of documents more accurately. Final results confirm that our method outperforms othercompared methods