validate#

tpcp.validate.validate(pipeline: Pipeline, dataset: Dataset, *, scoring: Callable | Scorer | None = None, n_jobs: int | None = None, verbose: int = 0, pre_dispatch: str | int = '2*n_jobs', progress_bar: bool = True)[source]#

Evaluate a pipeline on a dataset without any optimization.

Parameters:
pipeline

A Pipeline to evaluate on the given data.

dataset

A Dataset containing all information.

scoring

A callable that can score a single data point given a pipeline. This function should return either a single score or a dictionary of scores. If scoring is None the default score method of the optimizable is used instead.

n_jobs

Number of jobs to run in parallel. One job is created per datapoint. The default (None) means 1 job at the time, hence, no parallel computing.

verbose

The verbosity level (larger number -> higher verbosity). At the moment this only effects Parallel.

pre_dispatch

The number of jobs that should be pre dispatched. For an explanation see the documentation of Parallel.

progress_bar

True/False to enable/disable a tqdm progress bar.

Examples using tpcp.validate.validate#

Validation

Validation