'train'和'class'具有不同的长度错误

时间:2019-02-04 20:53:01

标签: r dataframe training-data

尝试在数据集中使用knn模型时,我不断收到“班级和训练的长度不同”错误。

newDF<- newDF[c(14, 1:13)]
newDF
str(newDF)

newDF1 <- newDF[c(2:11, 14)]
newDF1

df_train = newDF1[1:47385,]
dim(df_train)

df_test = newDF1[47386:59231,]
dim(df_test)


train_lbl <- newDF[1:47385,1]
test_lbl <- newDF[47386:59231,1]
dim(train_lbl)

install.packages("class")
library(class)

newDF_pred <- knn(train = df_train, test = df_test, cl = train_lbl, k = 245)

CrossTable(x = test_lbl, y=newDF_pred, propchisq=FALSE)

newDF是我的整个数据集,而newDF1仅包含数据类型“ num”

问题出在哪里,我该如何解决?

这是数据: -10lgP质量长度ppm m/z RT起始终止Intensity Sample 9 Precursor Id的范围                                                   1 0.543 0.234 0.245 0.348 0.0310 0.543 0.234 0.245 0.348 0.0310 0.0254  2 0.198 0.476 0.499 0.348 0.588 0.198 0.476 0.499 0.348 0.588 0.0256  3 0.234 0.245 0.348 0.0310 0.543 0.234 0.245 0.348 0.0310 0.543 0.0255  4 0.476 0.499 0.348 0.588 0.198 0.476 0.499 0.348 0.588 0.198 0.0254  5 0.245 0.348 0.0310 0.543 0.234 0.245 0.348 0.0310 0.543 0.234 0.0254  6 0.499 0.348 0.588 0.198 0.476 0.499 0.348 0.588 0.198 0.476 0.0256  7 0.348 0.0310 0.543 0.234 0.245 0.348 0.0310 0.543 0.234 0.245 0.0255  8 0.348 0.588 0.198 0.476 0.499 0.348 0.588 0.198 0.476 0.499 0.0254  9 0.0310 0.543 0.234 0.245 0.348 0.0310 0.543 0.234 0.245 0.348 0.0254 10 0.588 0.198 0.476 0.499 0.348 0.588 0.198 0.476 0.499 0.348 0.0256

...还有59,221行

班级和火车的规模如下: 昏暗(train_lbl) [1] 47385 1

  

dim(df_train)   [1] 47385 11

0 个答案:

没有答案
相关问题