我对使用R插入符号包的train函数的preProc参数有问题.我想集中和zoom 我的预测值,但忽略因子列.当我在train外进行预处理时,它工作正常,但我希望在train功能内进行预处理.我错过了什么吗?
以下是在列车外使用PreProcess时忽略系数预测器的示例.
df <- data.frame(
score = runif(1000, 80, 110),
var1 = as.factor(sample(0:1, 1000, replace = TRUE)),
var2 = runif(1000, 5, 25)
)
preProcess(df[-1], method=c('center','scale'))
Created from 1000 samples and 2 variables
Pre-processing:
- centered (1)
- ignored (1)
- scaled (1)
以下是当我在火车内部使用preProc时发生的情况
df <- data.frame(
score = runif(1000, 80, 110),
var1 = as.factor(sample(0:1, 1000, replace = TRUE)),
var2 = runif(1000, 5, 25)
)
mod <- train(score ~., data = df,
method = "lm",
preProc = c("center", "scale"))
mod$preProcess
Created from 1000 samples and 2 variables
Pre-processing:
- centered (2)
- ignored (0)
- scaled (2)