你的位置：首页>programmer>r - How to find optimal split of train and test to return the minimum RMSE for Boston housing data set without looping - Stack O

r - How to find optimal split of train and test to return the minimum RMSE for Boston housing data set without looping - Stack O

programmeradmin2025-02-052浏览0评论

I'm working to minimize the RMSE for the Boston housing data set. This is a very basic result:

library(Metrics)
df <- MASS::Boston
train <- df[1:400, ]
test <- df[401:506, ]
Boston_lm <- lm(medv ~., data = train)
Boston_lm_RMSE <- Metrics::rmse(actual = test$medv,
predicted = predict(object = Boston_lm, newdata = test))
# 6.155792

However, if the amount of train and test is changed, the RMSE is very different:

df <- MASS::Boston
train <- df[1:300, ]
test <- df[301:506, ]
Boston_lm <- lm(medv ~., data = train)
Boston_lm_RMSE <- Metrics::rmse(actual = test$medv,
predicted = predict(object = Boston_lm, newdata = test))
# 19.13284

Is there a way to determine the train and test amounts that return the lowest RMSE on the test data set without looping through a range of possible values?

与本文相关的文章

评论列表(0)

暂无评论

科技改变生活-雨落星辰 - 所有的伟大,都源于一个勇敢的开始

与本文相关的文章

评论列表(0)