R interface to Apache Spark ™
Interact with Spark using familiar R interfaces, such as
dplyr,broom, andDBI.Gain access to Spark’s distributed Machine Learning libraries, Structure Streaming,and ML Pipelines from R.
Extend your toolbox by adding XGBoost, MLeap, H2O and Graphframes to your Spark plus R analysis.
Connect R wherever Spark runs: Hadoop, Mesos, Kubernetes, Stand Alone, and Livy.
Run distributed R code inside Spark
Get Started
Welcome new users! Start here to learn how to install and use sparklyr.
Guides
“How-to” articles to help you learn how to do things such as: connect AWS S3 buckets, handling Streaming Data, create ML Pipelines and others.
Deployment
Articles on Spark environments. Including AWS EMR, Databricks and Qubole.