Elasticsearch 5.x Solutions – Node and Cluster Management
0.0 (0 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
15 students enrolled
Wishlisted Wishlist

Please confirm that you want to add Elasticsearch 5.x Solutions – Node and Cluster Management to your Wishlist.

Add to Wishlist

Elasticsearch 5.x Solutions – Node and Cluster Management

Get deep knowledge on the node and cluster management aspects of Elasticsearch
0.0 (0 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
15 students enrolled
Created by Packt Publishing
Last updated 7/2017
Current price: $10 Original price: $125 Discount: 92% off
5 hours left at this price!
30-Day Money-Back Guarantee
  • 1 hour on-demand video
  • 1 Supplemental Resource
  • Full lifetime access
  • Access on mobile and TV
  • Certificate of Completion
What Will I Learn?
  • Control cluster health and state via an API
  • Use the task management and Hot thread API
  • Manage repositories
  • Manage Kibana dashboards
  • Visualize data with Kibana
  • Look at PUT, GET, and DELETE an ingest pipeline
  • Built-in and Grok processors
View Curriculum
  • Some understanding of JSON is expected. If you want to extend Elasticsearch, understanding of Java and related technologies is also required.

This highly practical course focuses primarily on the node and cluster management aspects of Elasticsearch. The video contains recipes and hands-on solutions to backing up and restoring your nodes and clusters in Elasticsearch, as well as working with user interfaces.

About The Author

Alberto Paro is an engineer, project manager, and software developer. He currently works as freelance trainer/consultant on big data technologies and NoSQL solutions. He loves to study emerging solutions and applications mainly related to big data processing, NoSQL, natural language processing, and neural networks. He began programming in BASIC on a Sinclair Spectrum when he was eight years old, and to date, has collected a lot of experience using different operating systems, applications, and programming languages.

In 2000, he graduated in computer science engineering from Politecnico di Milano with a thesis on designing multiuser and multidevice web applications. He assisted professors at the university for about a year. He then came in contact with The Net Planet Company and loved their innovative ideas; he started working on knowledge management solutions and advanced data mining products. In summer 2014, his company was acquired by a big data technologies company, where he worked until the end of 2015 mainly using Scala and Python on state-of-the-art big data software (Spark, Akka, Cassandra, and YARN). In 2013, he started freelancing as a consultant for big data, machine learning, Elasticsearch and other NoSQL products. He has created or helped to develop big data solutions for business intelligence, financial, and banking companies all over the world. A lot of his time is spent teaching how to efficiently use big data solutions (mainly Apache Spark), NoSql datastores (Elasticsearch, HBase, and Accumulo) and related technologies (Scala, Akka, and Playframework). He is often called to present at big data or Scala events. He is an evangelist on Scala and Scala.js (the transcompiler from Scala to JavaScript).

Who is the target audience?
  • If you are a developer who wants to get the most out of Elasticsearch for advanced search and analytics, this is the video for you.
Students Who Viewed This Course Also Viewed
Curriculum For This Course
21 Lectures
Managing Clusters and Nodes
6 Lectures 26:14

This video gives an overview of the entire course.      

Preview 02:49

Elasticsearch provides a convenient way to manage the cluster state, which is one of the first things to check if any problems occur. If you need more details on your cluster, you need to query its state.       

Controlling Cluster Health and State Via an API

In production clusters, it's very important to monitor nodes via this API to detect misconfiguration and problems relating to 

different plugins and modules.       

Getting Nodes Information and Statistics Via API

When some actions are called, they create a server side task that executes the job. The task management API allows you to control these actions. Sometimes, your cluster slows down due to massive CPU usage and you need to understand why.       

Using the Task Management and the Hot thread API

Sometimes, due to massive relocation, or due to nodes restarting or some other cluster issues, it's necessary to monitor or define custom shard allocation.       

Managing the Shard Allocation

Monitoring the index segments means monitoring the health of an index. Cleaning the cache helps to speed up searching, such as cache results, items and filter results.       

Monitoring Segments and Clearing the Cache
Backup and Restore
4 Lectures 12:19

An Elasticsearch snapshot allows for the creation of snapshots of individual indices (or aliases), or an entire cluster, into a remote repository.       

Preview 03:43

 We can create snapshots of indices, a full backup of an index, in the exact instant that the command is called       

Executing a Snapshot

 Once you have the snapshots of your data, it can be restored.       

Restoring a Snapshot

It's not possible to restore backups of a newer Elasticsearch version in an older version. The restore is only forward-compatible. So, we need reindexing from a remote cluster.       

Reindexing from a Remote Cluster
User Interfaces
6 Lectures 18:17

 Cerebro is a partial rewrite of the previous plugin available as a self-working application server.       

Preview 05:56

Kibana is an opensource pluggable interface, free to change to be used for Elasticsearch. It provides data visualization and data discovery and with commercial products such as X-Pack, and also supports security, graph, and cluster monitoring.       

Installing Kibana and X-Pack

 The core of Kibana are the dashboards–an aggregation of widgets that are the results of queries and aggregations.            

Managing Kibana Dashboards

X-Pack provides cluster functionalities that allows to control and monitor your nodes and cluster. This is a very useful component of X-Pack as it is the lifesaver on large installations.       

Monitoring with Kibana

X-Pack provides cluster functionalities that allow you to control and monitor your nodes and cluster. This is a very useful component of X-Pack, as it is the lifesaver on large installations.       

Using Kibana dev-console

Kibana allows you to create reusable data representations called visualizations. They are the representations of aggregations and can be used to power up the dashboard with custom graphs.       

Visualizing Data with Kibana
5 Lectures 15:42

 The job of ingest nodes is to pre-process the documents before sending them to the data nodes.       

Preview 01:55

The definition is stored in a cluster state via the put pipeline  API. After having stored your pipeline, it is common to retrieve its content, for checking its definition. This action can be done via the get pipeline API. To clean up our Elasticsearch cluster for obsolete or unwanted pipelines, we need to call the delete pipeline API with the ID of the pipeline.       

PUT, GET, and DELETE an Ingest Pipeline

The ingest part of every architecture is very sensitive, so the Elasticsearch team has created the possibility of simulating your pipelines without the need to store them in Elasticsearch.       

Simulating an Ingest Pipeline

Elasticsearch provides, by default, a large set of ingest processors. Their number and functionalities can also change from minor versions to extended versions for new scenarios. We will cover one of the most used for log analysis: the grok processor, which is well known to Logstash users.       

Built-in and Grok Processors

An ingest node can be held under very high pressure without causing problems to the rest of the Elasticsearch cluster. GeoIP allows us to map an IP address to a GeoPoint and other location data.       

Using the Ingest Attachment and GeoIP Plugin
About the Instructor
Packt Publishing
3.9 Average rating
8,175 Reviews
58,787 Students
686 Courses
Tech Knowledge in Motion

Packt has been committed to developer learning since 2004. A lot has changed in software since then - but Packt has remained responsive to these changes, continuing to look forward at the trends and tools defining the way we work and live. And how to put them to work.

With an extensive library of content - more than 4000 books and video courses -Packt's mission is to help developers stay relevant in a rapidly changing world. From new web frameworks and programming languages, to cutting edge data analytics, and DevOps, Packt takes software professionals in every field to what's important to them now.

From skills that will help you to develop and future proof your career to immediate solutions to every day tech challenges, Packt is a go-to resource to make you a better, smarter developer.

Packt Udemy courses continue this tradition, bringing you comprehensive yet concise video courses straight from the experts.