Disaster Recovery Engineer

Name: Disaster Recovery Engineer
Rating: 4.6 (81 reviews)

Plan like a pro and act ahead time

Highest Rated

Created byStone River eLearning

Last updated 6/2020

English

What you'll learn

After successfully completing this course, the students shall be able to:
Business Impact Analysis
Risk Analysis
BCP Strategies
Recovery Strategies
Implementation Phase
Testing and Exercise
Maintenance and Updating

Course content

8 sections • 121 lectures • 5h 38m total length

Introduction to BCP and Disaster Recovery0:42
Introduction1:46
Understand why disaster recovery matters—from cyber incidents to natural disasters—and learn who requests recovery, how to plan, and how to comply with laws and regulations.
Definitions4:07
The Nature of Disasters - Natural1:19
Explore natural disasters like volcanoes, hurricanes, floods, fires, wildfires, and storms, and note that insurance plans vary, with you paying only for what you're protected.
The Nature of Disasters - Man-Made2:46
Examine man-made disasters from terrorism and arson to IT failures such as hacking and ransomware, and learn how power outages, backups, ups and generators, and infrastructure disruptions threaten operations.
Business Continuity Planning (BCP)3:12
Business continuity planning (BCP) ensures an organization stays functional during disasters by prioritizing critical processes such as accounting, sales, and CRM, and conducting a business impact analysis.
Disaster Recovery Planning (DRP)1:21
BCP vs. DRP3:54
Discover the difference between bcp and drp by focusing on sustaining critical business processes during disruptions versus preparing equipment, procedures, and call trees for disaster recovery.
Other Types of Plans4:07
Phases of Continuity Planning2:13
Outline the high-level phases of building a business continuity plan, from project initiation and business impact analysis to strategy development, implementation, testing, and ongoing maintenance.
Four Phases of the BCP0:55
Define the project scope, perform a business impact assessment, write the plan, obtain management approval, and begin implementation; then test and maintain the plan with annual updates.
BCP Step 1: Project Scope and Planning3:02
Define the project scope in BCP step 1 by mapping the organization's operations and essential services, forming a cross-functional team, and planning resources and regulatory considerations for a multi-site program.
BCP Step 2: Business Impact Assessment1:58
Perform a business impact assessment by surveying priorities, quantifying process importance and budget share, identifying natural and manmade risks, evaluating disruption and cost, and prioritizing resources for a final report.
BCP Step 3: Continuity Planning2:27
Explore step 3 of the BCP: the actual continuity plan after identifying risks and critical processes, and demonstrate to management a strategy to survive disruptions.
BCP Step 4: Approval and Implementation1:48
Secure senior management approval and funding, then procure hardware and software, train staff, and implement with deployed resources, testing readiness, and ongoing adaptation.
Module 1 Review0:34
Quiz 1

Project Scope and Planning1:04
Project Initiation Phase3:51
Management Responsibility3:13
Management involvement strengthens business continuity, requiring a legally sound business case, recovery plans, and cost-benefit analysis to justify an alternative site and executive accountability.
Executive Management1:18
Functional Management0:58
Stage 1 - Organizations Operations and Support Services1:52
Stage 2 - BCP Team Selection4:16
Select a cross-functional BCP team from core services and support departments, including IT specialists, security, and legal reps, to create, implement, and test recovery plans.
Stage 2 - Senior Management and BCP0:42
Stage 3 - Resource Requirements2:01
Stage 3 - Resource Requirements2:27
Define resource requirements with provisions and processes to mitigate risks in disaster recovery. Protect people, data centers, and infrastructure through hardening, backups, alternative sites, and secure networks.
Stage 4 - Legal and Regulatory Requirements2:25
Stage 4 - Legal and Regulatory Requirements1:15
Explore legal and regulatory requirements for disaster recovery, including ISO/IEC standards, NIST guidelines for federal IT systems, and PCI DSS 3.2, with emphasis on recovery and continuity planning.
Other: Departmental Involvement0:45
Executive management initiates, approves, and funds disaster recovery project, while the VCP committee handles planning, implementation, and testing; functional business units participate in implementation and testing, collaborating on the plan.
Other: IT Departmental Issues1:21
Identify critical IT dependencies and backup needs, monitor data feeds to financial institutions during restoration, safeguard offsite media, and maintain security and up-to-date inventory, even in emergencies.
Module 2 Review0:49
Outline the scope, covering stage one operations and support, stage two BCB team with senior management approval, stage three resource assessment, and legal and regulatory landscape with an attorney.
Quiz 2

Introduction0:27
Explore the quantitative and qualitative types of business impact analysis, identify priorities, assess risk likelihood and impact, prioritize resources, and generate a final report.
Business Impact Analysis (BIA)1:01
Identify critical resources and threats, assess likelihood and impact, and translate potential losses into a dollar figure for management to guide disaster recovery decisions.
BIA Types1:43
BIA Task 1: Identify Priorities1:35
Identify priorities in a business impact analysis by listing and ranking critical processes. Senior business unit managers collect input via surveys, interviews, workshops, or videoconferences, depending on organizational complexity.
BIA Task 1: Identify Priorities0:24
Identify priorities by conducting a criticality survey that senior management fills out to highlight the most important systems or processes in their business units.
BIA Task 1: Identify Priorities1:04
Identify priorities through a quantitative BIA by listing assets with dollar values, determining the maximum tolerable downtime (MTD), and setting recovery time objectives for each business function.
BIA Task 2: Risk Identification3:24
Identify natural risks that disrupt facilities and data centers, from power outages and weather events to earthquakes and floods, and assess their impact using 100-year floodplain concepts.
BIA Task 2: Risk Identification2:31
Identify manmade risks such as civil unrest, theft, fires, power outages, and transportation disruptions, plus malware threats like ransomware targeting critical IT systems.
BIA Task 3: Likelihood Assessment and Impact Assessment2:32
BIA Task 3: Likelihood Assessment and Impact Assessment1:12
Assess likelihood and impact using quantitative metrics such as exposure factor, single loss expectancy, and annualized loss expectancy to estimate yearly asset losses.
BIA Task 3: Likelihood Assessment and Impact Assessment2:44
Assess qualitative likelihood and impact of security incidents on future business and trust. Explore cases of privacy breaches, downtime, and negative publicity shaping disaster recovery decisions.
BIA Task 4: Resource Prioritization0:50
Prioritize base business continuity resources by evaluating risks quantitatively and sorting them by impact to create a single, merged priority list from the BCB team and management reps.
BIA Task 5: BIA Reporting1:09
BIA Task 5: BIA Reporting1:14
Explain to management the loss types—direct, indirect, and delayed—and how reputational damage, lost sales, and competitive advantage affect business continuity; include legal violations and income costs in BIA reporting.
BIA Task 5: Maximum Tolerable Downtime (MTD)0:42
Identify the maximum tolerable downtime (MTD) for various activity priorities, from critical two hours to non-essential thirty days, with normal seven days, important 72 hours, and urgent 24 hours.
BIA Task 5: Interdependencies1:27
Analyze interdependencies across operations from manufacturing to IT, identify essential functions and bottlenecks, present quantitative and qualitative threat data with rationale and alternative recovery methods.
Demo - BCP from NIST - SP 800-3423:23
In the disaster recovery engineer course, this demo shows how to locate NIST SP 800-34 business continuity planning samples, and outlines the contingency planning process, templates, and testing for recovery.
Module 3 Review0:46
Summarizes the five phases of business impact analysis—identify priorities, identify risks, assess likelihood and impacts, prioritize resources, and report to management—alongside quantitative and qualitative analysis types.
Quiz 3

BCP Design and Development Phases0:24
BCP Policy Draft Cycle1:46
Document policy components and laws and standards and best practices for business continuity, perform gap analysis against business impact analysis, gain feedback and buy-in, and secure management approval before publishing.
Project Management1:07
Apply core project management principles to disaster recovery by securing funding and talent, then perform a SWOT analysis to identify strengths, weaknesses, opportunities, and threats.
BCP Step 3: Continuity Planning0:55
Develop and implement a business continuity plan through continuity planning using Microsoft Project, Primavera, or a spreadsheet, guided by the business impact analysis to minimize impact and risk.
Continuity Planning Sub-Task 1: Strategy Development1:03
Develop a strategy to bridge the gap between business impact assessment and continuity planning, prioritizing concerns from the prioritization exercise and evaluating MTV estimates to identify acceptable risks and mitigations.
Continuity Planning Sub-Task 2: Provisions and Processes2:26
BCP and DRP Objectives2:35
BCP and DRP Objectives Contd...1:10
Define BCP and DRP objectives, implement and test the plan, and document it for quick access. Conduct annual drills to train assigned personnel and improve the business continuity program.
Demo - FFIEC BCP for Financial Institutions15:32
Module 4 Review0:18
Review module 4 material for disaster recovery engineers, covering the VCP and related topics, and prepare for the next chapter.
Quiz 4

BCP Plan Approval and Implementation0:18
Explore the approval and implementation of a BCP plan, then cover training and education, documentation, maintenance, and testing.
BCP Step 4: BCP Plan Approval and Implementation1:33
Develop a valid, reasonable, feasible BCP plan with resources and a realistic timeline. Secure senior management approval and cross-unit buy-in for funding and implementation.
BCP Step 4: BCP Plan Approval and Implementation Contd...1:20
Coordinate resources, train staff, and procure goods and services to implement the approved BCP, while writing contracts and maintaining a flexible maintenance program to adapt to change.
Training and Education3:03
Invest in training and education for all personnel to align the team with the plan, prevent backlogs, and avoid costly remediation from untrained accounting.
Benefits of Documenting the BCP2:00
Document the BCP process to capture lessons, track failures and frictions, and provide an emergency reference plus a historical record for future personnel.
BCP Documentation1:53
Define BCP goals and priorities early in the documentation, reflect critical systems and business impact analysis, and communicate management's commitment to enterprise continuity to employees.
BCP Documentation Contd...2:30
BCP Documentation Contd...2:45
Examine how disaster recovery plans capture risks, acceptance decisions, and mitigation steps, with signed documents, responsible roles, notification procedures, vital records, backups, and emergency response guidelines.
BCP Documentation Contd...3:17
Wrap up the VCP documentation with annual update cycles, reflect organizational and technology changes, run formal, trained disaster exercises, and keep teams engaged with clear checklists and pre-read materials.
Module 5 Review1:05
Quiz 5

Testing and Validating the Plan0:53
Test and validate the disaster recovery plan after implementation, and schedule annual testing based on system criticality. Document test processes and types, and continuously update the plan.
Testing the DRP1:20
Develop and execute an annual DRP test plan, detailing test scenarios, contact roles, backup and restore procedures, and alignment with PCI, ISO 27000, and other frameworks.
Reasons to Test3:45
Explain why testing costs time and money, and how it informs management of recovery capabilities. Onboard leaders, verify backup site readiness, and train employees with clear procedures and emergency contacts.
Documenting Test Process3:35
Document the entire test process by scheduling, notifying participants, and outlining test steps, roles, scenarios, and required resources to ensure an efficient, well-prepared disaster recovery drill.
Test Types0:39
Explore test types from checklist to full interruption, including structure, walk-through simulation, and test pilot tests. Costs rise with depth, so choose tests based on needs and budget.
Test Types: Checklist1:03
Distribute the plan to management for review to verify critical processes and procedures, enabling a preliminary sanity check before a real test and identifying potential loss of key personnel.
Test Types: Structured Walk-Through1:12
Structural walkthrough, or tabletop exercise, brings managers, admins, operators, and business owners together to role-play a disaster scenario and validate the recovery plan.
Test Types: Simulation1:09
Conduct an assimilation simulation with real operational personnel working through simulated disasters to test immediate response, featuring actors and potential interruptions of non-critical activities.
Test Types: Parallel0:55
Assess parallel testing by running the test backup site alongside main production, ensuring hardware, software, and databases function, can take load, and operators can execute cold process procedures smoothly.
Test Types: Full-Interruption1:28
Conduct a full interruption test by disconnecting the main site to validate the disaster recovery plan, acknowledging its high cost and environmental needs.
Data Recovery Plan Maintenance1:18
Maintain the data recovery plan by updating contacts, technology changes, management changes, and personnel shifts; review processes and external connections annually using the organization's BGP template.
Emergency Response2:56
BCP and DRP Maintenance2:58
Demo - Ready.gov Business Continuity Planning Suite1:00:34
Learn how the ready.gov business continuity planning suite helps organizations create, test, and maintain a scalable disaster recovery and business continuity plan with templates and training.
Module 6 Review0:30
Review the importance of testing the DRP, document test processes, and explore different test types, while maintaining and updating the BCB and DRP plans to stay current.
Quiz 6

IT Recovery Strategies0:20
Operations: Recovery and Restoration7:50
Execute the recovery and restoration functions with trained teams to keep IT and business operations running. Securely restore data from backups and reestablish operations at the original or secondary site.
Hardware Failure3:47
Assess hardware reliability by examining MTBF as a predictor of failure, compare warranties for drive quality, and plan for MTTR, hot-swapping drives, and parity-driven data recovery in RAID setups.
Reasons to Develop a Comprehensive DR Plan5:56
Develop a comprehensive disaster recovery plan with a written document reviewed by top management, rehearsed and tested annually to minimize economic loss and reduce security exposures.
Reasons to Develop a Comprehensive DR Plan Contd...5:44
Develop a comprehensive disaster recovery plan to ensure orderly recovery, protect assets and personnel, and minimize liability and insurance costs through designated alternates and clear delegates.
Justifying DR Planning4:48
Justify disaster recovery planning by budgeting for personnel, outsourcing, network recovery, and offsite storage. Learn from the 1982 Chicago flood and 9/11 to justify distant backup sites.
Recovery Strategy Insurance Coverage1:32
Explore disaster recovery through insurance coverage for vital records and documents and crime insurance for money and securities, noting databases, servers, and software used in data processing are not covered.
Disaster/Property Insurance3:37
Explain property insurance options, including name peril and open peril policies, flood risk, and exclusions for disasters affecting businesses.
Disaster/Property Insurance Contd...3:32
Compare replacement cost and actual cost valuation policies, outlining depreciation and current replacement costs that determine payouts for damaged or destroyed property.
Recovery Strategy Business Unit and Functional Priorities2:15
Identify critical business units and functional priorities using the business impact analysis. Create a priority checklist with risk, cost, and mean time to recovery to set recovery objectives and milestones.
Recovery Strategy Crisis Management & Emergency Communications3:36
Work Group Recovery1:02
Implement disaster recovery plan by restoring work groups to their usual locations or to separate recovery facilities until the main operations facility is back online, ensuring continuity.
Backups and Offsite Storage1:25
Types of Backups3:06
Explain full backups as complete data copies that reset the archive bit; use incremental backups to copy only touched files, and differential backups to cover changes since the full backup.
Backup Tape Formats1:47
Assess characteristics and wear of backup tapes and drives, check specs, and clean drives to prevent oxidation and dust. Formats include digital data storage, DAT, DHT, DLT, and open altie.
Backup Best Practices2:59
Software Escrow Arrangements1:50
Additional DRP Factors3:29
Alternate Processing Sites1:04
Explore alternate processing sites for disaster recovery, from cold to hot, rolling, service bureau, and multiple sites, prioritizing lowest cost and locating about 20 miles from the main site.
Cold Site1:46
Maintain a cold site as an empty warehouse ready for equipment during an emergency, with no on-site hardware or IT resources, and bring in servers and tape drives.
Warm Site2:47
Compare warm site options to hot sites, noting power, cooling, and computers are available but data copies lag and patching isn’t current, roughly twelve hours to full operation.
Hot Site1:06
Mobile Sites1:51
Service Bureaus2:20
Multiple Processing Centers / Multiple Sites2:01
Organizations use multiple processing centers such as remote offices as redundant backup sites to reduce disaster impact when sites are far apart, though managing dispersed staff adds cost and complexity.
Redundant Site1:06
Mutual Aid/Assistance Agreements3:38
Mutual aid and assistance agreements enable organizations with similar hardware and software to share costs, management, information, equipment, and parts of facilities for outages or disruptions, under legally binding contracts.
Mutual Aid/Assistance Agreements Contd...1:32
Transaction Redundancy Implementations3:47
Implement transaction redundancy with database recovery and C2 logging to capture before and after images, enabling fault tolerance and rapid restoration; use off-site backups and remote journaling for integrity.
Transaction Redundancy Implementations Contd...2:15
Explore remote mirroring as a high-cost, high-speed transaction redundancy solution that replicates data to multiple locations in seconds, enabling a fully redundant database with clustered servers.
Recovery Plan Development1:22
Wrap up the recovery plan with an executive summary, department-specific plans, IT technical guides for backups and replicated databases, and team checklists, all bound in a red binder.
Recovery Plan Development Contd...3:21
Develop and validate disaster recovery plans with executive summaries, emergency response, clear communications, contact details, backups and offsite storage, restoration procedures, and training for disaster recovery and business continuity teams.
Cloud Strategies2:15
Module 7 Review0:21
Quiz 7

Requirements

Prior knowledge and experience in IT and support is required.

Description

The core aspect of any company crucial its success is information technology. All systems in a modern enterprise depend on the IT infrastructure. The need to have a detailed DR strategy is therefore KEY.

Being a Disaster Recovery Engineer (DRE) validates that you are prepared to work with businesses to create and implement disaster recovery and business continuity plans.

Those who are certified will be working, as a professional, with a business to prepare processes, policies and procedures to follow in the event of a disruption. You are important to keep a business’ critical operations running, which today heavily relies on its IT infrastructure. SIGN UP NOW!

Who this course is for:

This course is for those who have some knowledge or experience in IT and want gain expertise in Disaster Recovery of data and systems.

Disaster Recovery Engineer

What you'll learn

Explore related topics

Course content

Introduction to BCP and Disaster Recovery16 lectures • 36min

Project Scope and Planning15 lectures • 28min

Business Impact Analysis18 lectures • 48min

BCP Design and Development Phases10 lectures • 27min

BCP Plan Approval and Implementation10 lectures • 20min

Testing and Validating the Plan15 lectures • 1hr 24min

IT Recovery Strategies34 lectures • 1hr 31min

Conclusion and Next Steps3 lectures • 4min

Requirements

Description

Who this course is for: