尝试在数据集中使用knn模型时,我不断收到“班级和训练的长度不同”错误。
newDF<- newDF[c(14, 1:13)]
newDF
str(newDF)
newDF1 <- newDF[c(2:11, 14)]
newDF1
df_train = newDF1[1:47385,]
dim(df_train)
df_test = newDF1[47386:59231,]
dim(df_test)
train_lbl <- newDF[1:47385,1]
test_lbl <- newDF[47386:59231,1]
dim(train_lbl)
install.packages("class")
library(class)
newDF_pred <- knn(train = df_train, test = df_test, cl = train_lbl, k = 245)
CrossTable(x = test_lbl, y=newDF_pred, propchisq=FALSE)
newDF是我的整个数据集,而newDF1仅包含数据类型“ num”
问题出在哪里,我该如何解决?
这是数据:
-10lgP
质量长度ppm m/z
RT起始终止Intensity Sample 9
Precursor Id
的范围
1 0.543 0.234 0.245 0.348 0.0310 0.543 0.234 0.245 0.348 0.0310 0.0254
2 0.198 0.476 0.499 0.348 0.588 0.198 0.476 0.499 0.348 0.588 0.0256
3 0.234 0.245 0.348 0.0310 0.543 0.234 0.245 0.348 0.0310 0.543 0.0255
4 0.476 0.499 0.348 0.588 0.198 0.476 0.499 0.348 0.588 0.198 0.0254
5 0.245 0.348 0.0310 0.543 0.234 0.245 0.348 0.0310 0.543 0.234 0.0254
6 0.499 0.348 0.588 0.198 0.476 0.499 0.348 0.588 0.198 0.476 0.0256
7 0.348 0.0310 0.543 0.234 0.245 0.348 0.0310 0.543 0.234 0.245 0.0255
8 0.348 0.588 0.198 0.476 0.499 0.348 0.588 0.198 0.476 0.499 0.0254
9 0.0310 0.543 0.234 0.245 0.348 0.0310 0.543 0.234 0.245 0.348 0.0254
10 0.588 0.198 0.476 0.499 0.348 0.588 0.198 0.476 0.499 0.348 0.0256
班级和火车的规模如下: 昏暗(train_lbl) [1] 47385 1
dim(df_train) [1] 47385 11