Real World Spark 2 - ScalaIDE Spark Core 2 Developer
4.0 (1 rating)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
83 students enrolled
Wishlisted Wishlist

Please confirm that you want to add Real World Spark 2 - ScalaIDE Spark Core 2 Developer to your Wishlist.

Add to Wishlist

Real World Spark 2 - ScalaIDE Spark Core 2 Developer

Build a Vagrant box, walk through Spark 2 Core Code via sbt and ScalaIDE. The modern cluster computation engine.
4.0 (1 rating)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
83 students enrolled
Created by Toyin Akin
Last updated 1/2017
English
Curiosity Sale
Current price: $10 Original price: $90 Discount: 89% off
30-Day Money-Back Guarantee
Includes:
  • 3.5 hours on-demand video
  • 3 Articles
  • 2 Supplemental Resources
  • Full lifetime access
  • Access on mobile and TV
  • Certificate of Completion
What Will I Learn?
  • Simply run a single command on your desktop, go for a coffee, and come back with a running distributed environment for cluster deployment
  • Code in Scala against Spark. Transformation, Actions and Spark Monitoring
  • Debug Spark Code within ScalaIDE
View Curriculum
Requirements
  • Basic programming or scripting experience is required.
  • You will need a desktop PC and an Internet connection. The course is created with Windows in mind.
  • The software needed for this course is freely available
  • Optional : This course is based on top of my previous course - "Real World Vagrant - Build an Apache Spark Development - Toyin Akin"
  • You will require a computer with a Virtualization chipset support - VT-x. Most computers purchased over the last five years should be good enough
  • Optional : Some exposure to Linux and/or Bash shell environment
  • 64-bit Windows operating system required (Would recommend Windows 7 or above)
  • This course is not recommened if you have no desire to work with/in distributed computing
Description

Note : This course is built on top of the "Real World Vagrant - Build an Apache Spark Development Env! - Toyin Akin" course. So if you do not have a Spark + ScalaIDE environment already installed (within a VM or directly installed), you can take the stated course above.

Scala IDE provides advanced editing and debugging support for the development of pure Scala and mixed Scala-Java applications. 

Now with a shiny Scala debugger, semantic highlight, more reliable JUnit test finder, an ecosystem of related plugins, and much more.

Scala Debugger. Stepping through closures and Scala-aware display of debugging information.

Spark Monitoring and Instrumentation

While creating RDDs, performing transformations and executing actions, you will be working heavily within the monitoring view of the Web UI.

Every SparkContext launches a web UI, by default on port 4040, that displays useful information about the application. This includes:

A list of scheduler stages and tasks
A summary of RDD sizes and memory usage
Environmental information.
Information about the running executors

Why Apache Spark ...

Apache Spark run programs up to 100x faster than Hadoop MapReduce in memory, or 10x faster on disk. Apache Spark has an advanced DAG execution engine that supports cyclic data flow and in-memory computing. Apache Spark offers over 80 high-level operators that make it easy to build parallel apps. And you can use it interactively from the Scala, Python and R shells. Apache Spark can combine SQL, streaming, and complex analytics.

Apache Spark powers a stack of libraries including SQL and DataFrames, MLlib for machine learning, GraphX, and Spark Streaming. You can combine these libraries seamlessly in the same application.

Who is the target audience?
  • Software engineers who want to expand their skills into the world of distributed computing
  • Developers who want to write/test their code against Scala / Spark
Students Who Viewed This Course Also Viewed
Curriculum For This Course
22 Lectures
03:43:23
+
Introduction to Scala, Spark Core via ScalaIDE
2 Lectures 09:53

A quick tour of ScalaIDE with Spark

Preview 09:53

Suggested Spark Udemy curriculum courses to follow. You do not need to
take/purchase the first three courses if you already have spark
installed.

Preview 00:00
+
Author, Equipment and Compensation
4 Lectures 25:57

My experience within the Enterprise

Preview 11:28

Spark job compensation for those in this field.

Preview 07:09

Memory Requirements
00:17

Recommended Hardware for Spark and Hadoop labs ...

Recommended Hardware for Spark and Hadoop labs ...
07:03
+
Setup the Environment
5 Lectures 35:33

Resource files for the course

Resource files for the course
00:30

Spark setup

Spark setup
04:03

Walking through the Base Vagrant Spark Box

Walking through the Base Vagrant Spark Box
16:40

Upgrade and Package the Vagrant Box to Spark 2

Upgrade and Package the Vagrant Box to Spark 2
11:28

Register the updated Vagrant Spark Box

Register the updated Vagrant Spark Box
02:52
+
Spark Core for Scala Developers (ScalaIDE)
10 Lectures 02:25:59

Boot up and Walkthrough of Spark ScalaIDE Environment

Boot up and Walkthrough of Spark ScalaIDE Environment
13:11

Configure and Startup a Spark Environment for Distributed Computing

Configure and Startup a Spark Environment for Distributed Computing
12:39

Scala Spark RDD, Transformations, Actions and Monitoring I

Scala Spark RDD, Transformations, Actions and Monitoring I
16:07

Scala Spark RDD, Transformations, Actions and Monitoring II

Scala Spark RDD, Transformations, Actions and Monitoring II
12:53

Scala Spark RDD, Transformations, Actions and Monitoring III

Scala Spark RDD, Transformations, Actions and Monitoring III
16:14

Scala Spark RDD, Transformations, Actions and Monitoring IV

Scala Spark RDD, Transformations, Actions and Monitoring IV
16:16

Scala Spark RDD, Transformations, Actions and Monitoring V

Scala Spark RDD, Transformations, Actions and Monitoring V
16:58

Scala Spark RDD, Transformations, Actions and Monitoring VI

Scala Spark RDD, Transformations, Actions and Monitoring VI
18:54

Scala Spark RDD, Transformations, Actions and Monitoring VII

Scala Spark RDD, Transformations, Actions and Monitoring VII
11:42

Scala Spark RDD, Transformations, Actions and Monitoring VIII

Scala Spark RDD, Transformations, Actions and Monitoring VIII
11:05
+
Conclusion
1 Lecture 06:02

Conclusion

Conclusion
06:02
About the Instructor
Toyin Akin
3.8 Average rating
135 Reviews
1,374 Students
15 Courses
Big Data Engineer, Capital Markets FinTech Developer

I spent 6 years at "Royal Bank of Scotland" and 5 years at the investment bank "BNP Paribas"  developing and managing Interest Rate Derivatives services as well as engineering and deploying In Memory DataBases (Oracle Coherence), NoSQL and Hadoop clusters (Cloudera) into production.

In 2016, I left to start my own training, POC-D. "Proof Of Concept - Delivered", which focuses on delivering training on IMDB (In Memory Database), NoSQL, BigData and DevOps technology. 

From Q3 2017, this will also include FinTech Training in Capital Markets using Microsoft Excel (Windows), JVM languages (Java/Scala) as well as .NET (C#, VB.NET, C++/CLI, F# and IronPythyon)

I have a YouTube Channel, publishing snippets of my videos. These are not courses. Simply ad-hoc videos discussing various distributed computing ideas.

Check out my website and/or YouTube for more info

See you inside ...