If you are familiar with Dockers, the instructions below would help you get started with Spark in no time.
- Download the Spark from https://spark.apache.org/downloads.html page. Remember to select a package type with option such as “Pre-built…”. Once the zipped files are downloaded, unzip the files under the location “C:\Users\<Username>”
- Build Java8 image and start the container. Follow the instructions on this page, http://vitalflux.com/dockers-how-to-get-started-with-java8-dev-environment/.
- Once the container is started, go to the folder where you would find spark files. Go to bin folder. The path could look like cd /mnt/Users/<Your_Username>/spark-1.6.0-bin-hadoop2.6/bin.
- Execute the command “./pyspark” and you will get started with following screenshot: