Apache Spark is an open source framework that provides highly generalizable methods to process data in parallel. On its own, Spark is not a data storage solution. Spark can be run locally, on a single machine with a single JVM (called local mode). More often Spark is used in tandem with a distributed storage system to write the data processed with Spark (such as HDFS, Cassandra, or S3) and a cluster manager to manage the distribution of the application across the cluster. Spark currently supports three kinds of cluster managers: the manager included in Spark, called the Standalone Cluster Manager, which requires Spark to be installed in each node of a cluster, Apache Mesos; and Hadoop YARN.
Various components of spark
I am Reddy having 10 years of IT experience.For the last 4 years I have been working on Bigdata.
From Bigdata perspective,I had working experience on Kafka,Spark,and Hbase,cassandra,hive technologies.
And also I had working experience with AWS and Java technologies.
I have the experience in desigining and implemeting lambda architecture solutions in bigdata
Has experience in Working with Rest API and worked in various domains like financial ,insurance,manufacuring.
I am so passinate about new technologies.
BigDataTechnologies is a online training provider and has many experienced lecturers who will proivde excellent training.
BigDataTechnologies has extensive experience in providing training for Java,AWS,iphone,Mapredue,hive,pig,hbase,cassandra,Mongodb,spark,storm and Kafka.
From skills that will help you to develop and future proof your career to immediate solutions to every day tech challenges.
Main objective is to provide high quality content to all students