Run a Super Learner for the provided subset of features
Run a Super Learner for the provided subset of features
run_sl(
Y = NULL,
X = NULL,
V = 5,
SL.library = "SL.glm",
univariate_SL.library = NULL,
s = 1,
cv_folds = NULL,
sample_splitting = TRUE,
ss_folds = NULL,
split = 1,
verbose = FALSE,
progress_bar = NULL,
indx = 1,
weights = rep(1, nrow(X)),
cross_fitted_se = TRUE,
full = NULL,
vector = TRUE,
...
)
run_sl(
Y = NULL,
X = NULL,
V = 5,
SL.library = "SL.glm",
univariate_SL.library = NULL,
s = 1,
cv_folds = NULL,
sample_splitting = TRUE,
ss_folds = NULL,
split = 1,
verbose = FALSE,
progress_bar = NULL,
indx = 1,
weights = rep(1, nrow(X)),
cross_fitted_se = TRUE,
full = NULL,
vector = TRUE,
...
)
the outcome
the covariates
the number of folds
the library of candidate learners
the library of candidate learners for single-covariate regressions
the subset of interest
the CV folds
logical; should we use sample-splitting for predictiveness estimation?
the sample-splitting folds; only used if
sample_splitting = TRUE
the split to use for sample-splitting; only used if
sample_splitting = TRUE
should we print progress? defaults to FALSE
the progress bar to print to (only if verbose = TRUE)
the index to pass to progress bar (only if verbose = TRUE)
weights to pass to estimation procedure
if TRUE
, uses a cross-fitted estimator of
the standard error; otherwise, uses the entire dataset
should this be considered a "full" or "reduced" regression?
If NULL
(the default), this is determined automatically; a full
regression corresponds to s
being equal to the full covariate vector.
For SPVIMs, can be entered manually.
should we return a vector (TRUE
) or a list (FALSE
)?
other arguments to Super Learner
a list of length V, with the results of predicting on the hold-out data for each v in 1 through V
a list of length V, with the results of predicting on the hold-out data for each v in 1 through V