A key distinction in the realm of disaster recovery is the one between failover and failback. Both terms describe two sides of the same coin, complementary processes that are often brought together.
However, their effects and purposes couldn't be more different. Both play critical roles in ensuring business continuity and disaster recovery, making it essential to understand what they are and how they differ.
Failover is a business continuity operation that ensures continued access to a system by fully transitioning to another instance of that system. This secondary system is designed to be resilient, ideally unaffected by the event that compromised the primary system.
Put simply, failover occurs when connectivity is switched from one system instance to another. This can happen in various ways, including:
Editor's Note:
This guest blog post was written by the staff at Pure Storage, an US-based publicly traded tech company dedicated to enterprise all-flash data storage solutions. Pure Storage keeps a very active blog, this is one of their "Purely Educational" posts that we are reprinting here with their permission.
The critical point about failover is that it involves a complete migration of logical or physical access from the primary system, server, or hosting location to a secondary one.
While other processes, such as load balancing, may distribute partial connectivity between system instances or components, they do not qualify as failover because they do not represent a full cutover.
Failback is the quintessential disaster recovery operation. It involves a full migration back to the production status quo – a recovery if you will – at the validated conclusion of a disaster.
Failback occurs when a system reverts back to the primary environment after the root cause of a disruption has been addressed. In practice, this looks like a failover, but in reverse. Once the primary system is restored, access is pointed to that system, and the standby is deactivated.
This reversion is a critical distinction. Some organizations may have complete standby systems for critical applications, which permit full operations on the standby system. In that case, the standby can rightfully be considered the primary and the repaired former primary the new standby.
Failover is critical in a business continuity event because it keeps operations running. By having a system to which your business can transition when a primary system is unavailable, you're able to continue doing business. People can work, revenue streams are preserved, and customers can be served.
Without failover, these functions could grind to a halt, leading to significant disruption. Many organizations depend on technology for critical processes, and when those processes are unavailable, analog alternatives may be insufficient or entirely obsolete. Failover ensures that even in a disaster, the business keeps moving.
Failback comes into play once the need for failover ends. As the disaster is resolved, failback allows the organization to return to normal operations. Typically, failback is necessary when the standby system cannot sustain operations as effectively as the primary system. For instance, a standby system may not be a full replica of the primary system and might be designed only for temporary use during an emergency.
For mission-critical systems, some organizations may build a standby system that is a full replica of the primary. While costly, this approach mitigates the risks of diminished functionality during disasters.
In an ideal world, every business would maintain two fully operational environments: a primary environment and an identical standby environment. This setup would allow for seamless transitions during disasters, ensuring that business operations are completely unaffected.
However, that model can effectively double an IT budget: two sets of endpoints, two sets of servers, two sets of cloud environments, two sets of data, staff to support that both in IT and business operations, etc. It's costly and inefficient for any company, to the point where no company truly maintains that support model.
Instead, most organizations opt for a failover and failback model because it balances cost and efficiency. With this approach, the standby environment is designed to sustain critical operations during a disaster, even if it's not as robust as the primary system. This makes it more economical, less work is duplicated, and the risk of data loss or impact is lower.
It's crucial to maintain a well-designed secondary environment. Cutting costs too deeply on a standby system can result in inefficiencies or financial losses if critical operations are disrupted. Striking the right balance between cost and functionality is key.
If uninterrupted business operations are essential, then a strategic failover and failback plan is not optional – it's a necessity.
文章
62343
浏览
1329
获赞
3
Meme celebrates the internet's favorite highly versatile actors
Some actors are more versatile than others. For scientific proof, please see the following meme.AfteKellyanne Conway unleashes trolls on Twitter's head of site integrity
Kellyanne Conway is not above a good old-fashioned Twitter harassment campaign. As counselor to PresMark Zuckerberg wants to register 4 million new voters but keep showing them Trump's posts
As the U.S. presidential election approaches, tech companies' responsibility to halt misinformationSurface Headphones 2 review: Dull design for quality noise cancelation
Sometimes things don't need fixing, they just need a timely reintroduction.When Microsoft released iChinese hackers counted on no one clicking 'update' in decade
Uh, maybe stop asking your computer to remind you tomorrow. The Department of Justice unsealed an inHow to dirty talk: A guide for beginners in quarantine
In the era of social-distancing, words have become one of our best replacements for physical intimacApple will let you change the default email and web browser app in iOS 14
There was a lot to take in at Apple’s annual Worldwide Developers Conference this year, so youPartying Americans just got a reality check from the coronavirus
On June 6, after over 100,000 Americans had already been killed by the new coronavirus, over a dozenCloudflare goes down, and takes the internet's security blanket with it
When Cloudflare has problems, the rest of the internet can't be that far behind. The company whose eTesla reduces price of nearly every car by $2,000 or more
Tesla cars became cheaper overnight. The electric car company decided to give its prices a proper trElon Musk reschedules Tesla 'Battery Day' for later this year
Last week, Tesla announced it had managed to increase the range of the Model S to 402 miles thanks tThe 4 most exciting announcements from WWDC 2020
The 2020 edition of Apple's Worldwide Developers Conference wasn't exactly full of bombshells, but wUK government ditches 'porn block'. What happens now?
The UK government has shelved its controversial "porn block", which would have introduced legislatioTwitter's web app now supports saved drafts and scheduled tweets
Twitter this week announced a new feature that lets folks save tweets as a draft and schedule postsHow Twitter and Trump's toxic relationship came crashing down
When it comes to passionate love affairs, the one between Donald Trump and Twitter lasted quite a wh