Uses very generic dplyr code to aggregate data and then `ggplot2` to create the histogram. Because of this approach, the calculations automatically run inside the database if `data` has a database or sparklyr connection. The `class()` of such tables in R are: tbl_sql, tbl_dbi, tbl_spark
dbplot_histogram(data, x, bins = 30, binwidth = NULL)
data | A table (tbl) |
---|---|
x | A continuous variable |
bins | Number of bins. Defaults to 30. |
binwidth | Single value that sets the side of the bins, it overrides bins |
# A ggplot histogram with bins of size 10 mtcars %>% dbplot_histogram(mpg, binwidth = 10)