Each individual work divides into smaller sized sets of responsibilities known as phases that rely upon one another. Stages are categorised as computational boundaries. All computation cannot be finished in a single stage. It achieves over numerous stages. RDD or Resilient Dispersed Datasets are supplied by the Spark, that's also http://fredi891axt7.get-blogging.com/profile