如何识别哪个群集属于哪种情绪

时间:2013-04-25 04:37:16

标签: weka

我是weka的新手。我需要根据Weka工具聚类方法从博客文档中识别出一组情绪。对于情绪检测,我使用不同的功能集值(我的功能表示为属性)。例如,我的数据集将为:

@relation emotion

@attribute pos real ->total no of times each part-of-speech(noun,verb,adjective,adverb) occur in the document / Total no of words in the document
@attribute Positive_Words real ->Count of positive words occur in the document / Total no of words in the document
@attribute Negative_Words real ->Count of Negative words occur in the document / Total no of words in the document
@attribute Emotion_Words real ->Count of Emotion words occur in the document / Total no of words in the document
@attribute First_Sent_Weight real ->Weight given to first sentence in each blog / Total no of sentences in the document

@data
0.4, 0.24, 0.43, 0.32, 0.65
0.32, 0.5, 0.74, 0.8, 0.43

我有5000个实例(通过给出,每5000个博客文档的每个功能集,我创建了5000个实例)。将这些实例传递到Weka工具中K-means的聚类算法中,生成6个聚类。我怀疑的是如何识别哪个群集属于哪种情绪。请提出任何想法。提前致谢。

0 个答案:

没有答案