I installed Spark-1.4.0. I also installed my R SparkR package, and I can use it through Spark-shell and through RStudio, however there is one difference that I cannot solve.
When launching the SparkR shell
./bin/sparkR --master local[7] --packages com.databricks:spark-csv_2.10:1.0.3
I can read the CSV file as follows
flights <- read.df(sqlContext, "data/nycflights13.csv", "com.databricks.spark.csv", header="true")
Unfortunately, when I run SparkR through RStudio (setting my SPARK_HOME correctly), I get the following error message:
15/06/16 16:18:58 ERROR RBackendHandler: load on 1 failed
Caused by: java.lang.RuntimeException: Failed to load class for data source: com.databricks.spark.csv
I know I have to load com.databricks: spark-csv_2.10: 1.0.3 in a way, but I have no idea how to do this. Can someone help me?
source
share