Spark Enhanced Support for Multiple Operations


Spark provides numerous essential supports related to data processing in big data industry like –
  • For streaming data
  • SQL queries
  • Graphic data processing,
  • Machine learning,
  • MapReduce operations.

Multi-platform Support

Apache Spark provides extended interoperability regarding its running platform or supported data structure. Spark supports applications running in –
  • cloud
  • standalone cluster mode
Besides, that Spark can access varied data structures
  • HBase
  • Tachyon
  • HDFS
  • Cassandra
  • Hive
  • Hadoop data source
Spark can be deployed on
  • A distributed framework such as YARN or Mesos
  • Standalone server 

Spark Features

   Open source
   Fast  processing
   Distributed 
   Data Processing
   Highly Productive
   Scalability

Comments

Popular posts from this blog

Why Scala for Spark ?

Apache Spark Driver logs redirecting to directory in both cluster and client mode