Apache Hadoop and Spark both are used for Data processing purpose , but there will few changes adopted in each side to complete the process .
Spark is a distributed processing engine and
HDFS is distributed storage system.
However both Hadoop and Spark do not perform exactly the same tasks, and they are not mutually exclusive, as they are able to work together. Although Spark is reported to work up to 100 times faster than Hadoop in certain circumstances, it does not provide its own distributed storage system.Because Apache Spark is not tied to two stage map reduce paradigm .