sdf_sample

Randomly Sample Rows from a Spark DataFrame

Description

Draw a random sample of rows (with or without replacement) from a Spark DataFrame.

Usage

sdf_sample(x, fraction = 1, replacement = TRUE, seed = NULL)

Arguments

Argument Description
x An object coercable to a Spark DataFrame.
fraction The fraction to sample.
replacement Boolean; sample with replacement?
seed An (optional) integer seed.

See Also

Other Spark data frames: sdf_copy_to(), sdf_distinct(), sdf_random_split(), sdf_register(), sdf_sort(), sdf_weighted_sample()