ml_corr

Compute correlation matrix

Description

Compute correlation matrix

Usage

ml_corr(x, columns = NULL, method = c("pearson", "spearman"))

Arguments

Argument Description
x A tbl_spark.
columns The names of the columns to calculate correlations of. If only one

column is specified, it must be a vector column (for example, assembled using ft_vector_assember()). method | The method to use, either "pearson" or "spearman".

Value

A correlation matrix organized as a data frame.

Examples


sc <- spark_connect(master = "local")
iris_tbl <- sdf_copy_to(sc, iris, name = "iris_tbl", overwrite = TRUE)

features <- c("Petal_Width", "Petal_Length", "Sepal_Length", "Sepal_Width")

ml_corr(iris_tbl, columns = features, method = "pearson")