Spark Enhanced Support for Multiple Operations


Spark provides numerous essential supports related to data processing in big data industry like –
  • For streaming data
  • SQL queries
  • Graphic data processing,
  • Machine learning,
  • MapReduce operations.

Multi-platform Support

Apache Spark provides extended interoperability regarding its running platform or supported data structure. Spark supports applications running in –
  • cloud
  • standalone cluster mode
Besides, that Spark can access varied data structures
  • HBase
  • Tachyon
  • HDFS
  • Cassandra
  • Hive
  • Hadoop data source
Spark can be deployed on
  • A distributed framework such as YARN or Mesos
  • Standalone server 

Spark Features

   Open source
   Fast  processing
   Distributed 
   Data Processing
   Highly Productive
   Scalability

Comments

Popular posts from this blog

Apache Spark Driver logs redirecting to directory in both cluster and client mode

eclipse oxygen is not starting

Application Localisation in YARN and its pitfalls