Apache Spark 1.5.0 recently released, is the sixth release on the 1.x line. This release represents 1400+ patches from 230+ contributors and 80+ institutions. Apache Spark is a fast and general engine for large-scale data processing. Spark has an advanced DAG execution engine that supports cyclic data flow and in-memory computing and runs on Hadoop, Mesos, standalone, or in the cloud. It can access diverse data sources including HDFS, Cassandra, HBase, and S3.