sdf_schema

Read the Schema of a Spark DataFrame

Description

Read the schema of a Spark DataFrame.

Usage

sdf_schema(x, expand_nested_cols = FALSE, expand_struct_cols = FALSE)

Arguments

Argument Description
x A spark_connection, ml_pipeline, or a tbl_spark.
expand_nested_cols Whether to expand columns containing nested array

of structs (which are usually created by tidyr::nest on a Spark data frame) expand_struct_cols | Whether to expand columns containing structs

Details

The type column returned gives the string representation of the underlying Spark type for that column; for example, a vector of numeric values would be returned with the type "DoubleType". Please see the https://spark.apache.org/docs/latest/api/scala/index.htmlSpark Scala API Documentation for information on what types are available and exposed by Spark.

Value

An list, with each list element describing the name and type of a column.