Question

我英语不好。对于那个很抱歉。我想可视化word2vec模型的最相似结果。

我用文字训练了word2vec模型，并带来了topn = 100 most_similar结果

我只想用 TSNE （不是我模型的所有语音）来可视化结果，但是我不确定该怎么做。我的代码在这里。

"X_tsne = tsne.fit_transform(X[:50, :])"处的错误

我认为结果也是列表，但是没有用。

vocab = list(result)
X = vocab


tsne = TSNE(n_components = 2)
X_tsne = tsne.fit_transform(X[:50, :]) #Error

df = pd.DataFrame(X_tsne, index = vocab[:50], columns = ['x', 'y'])
df.shape

df.head(10)

fig = plt.figure()
fig.set_size_inches(40, 20)
ax = fig.add_subplot(1, 1, 1)

ax.scatter(df['x'], df['y'])

for word, posi in df.iterrows():
    ax.annotate(word, posi, fontsize = 30)
plt.show()

如何仅对word2vec model.most_like使用TSNE？

0 个答案: