Comments
bruce.armstrong wrote: Somebody just said it better than I did, and with more chops to say it: Open Letter to Mark Zuckerberg, Sheryl Sandberg & Facebook Mobile
Cloud Expo on Google News

SYS-CON.TV
Cloud Expo & Virtualization 2009 East
PLATINUM SPONSORS:
IBM
Smarter Business Solutions Through Dynamic Infrastructure
IBM
Smarter Insights: How the CIO Becomes a Hero Again
Microsoft
Windows Azure
GOLD SPONSORS:
Appsense
Why VDI?
CA
Maximizing the Business Value of Virtualization in Enterprise and Cloud Computing Environments
ExactTarget
Messaging in the Cloud - Email, SMS and Voice
Freedom OSS
Stairway to the Cloud
Sun
Sun's Incubation Platform: Helping Startups Serve the Enterprise
POWER PANELS:
Cloud Computing & Enterprise IT: Cost & Operational Benefits
How and Why is a Flexible IT Infrastructure the Key To the Future?
Click For 2008 West
Event Webcasts
Importance of Having DR Procedures
BlackBerry outage highlights poor state of disaster recovery and backup procedures

A recent BlackBerry outage is the nightmare for both RIM and its users. Nature and mass popularity of Blackberry service  make this particular outage highly visible. There are many other in-house outages we never hear about. Companies prefer to keep silent about them, if they can. The only insight we get into the poor state of DR procedures is in some public cases like this one, or from limited personal experience. In my Oracle database infrastructure consulting career I have seen several serious production outages and was part of a few data recovery efforts.

Backups and DR procedures are just not as high on IT priority list as development, production support and new projects are. Backups and DR are frequently considered a chore, mechanical stuff, uninteresting work. Oracle database works fine most of the time, it is very reliable and robust product. When disaster strikes, be it human mistake or external cause (hardware failure), it is often difficult to recover from it.

Fairly large percentage of Oracle database backups in average enterprise fail every day and nobody even notices. Recovery of failed Oracle production database is not a simple task, most of database administrators can not do it properly. There are many possible variations and cases during Oracle database recovery. Smart, experienced, collected and well trained DBA will perhaps be able to recover the database, if all elements are properly aligned and available. Loss of data, incomplete recovery or even ad-hoc rebuild of production environments is very real possibility.

Why is the current state of backups and DR so poor?
Hardware and software are inherently unstable.  Switches fail, SANs/disks fail, software is buggy, systems are complex, staff is lacking skills - it all sometimes creates the perfect storm which makes production systems go down. Backups are poorly designed and executed, many companies still backup to tape. DR facilities and procedures are supposed to provide protection against production system failures and human mistakes.DR sites are half-ready, out of sync with production environments they are supposed to shadow. Staff is not specialized enough and not well trained either. Many companies still do not have dedicated training environment for DBAs where they can test various recovery scenarios, apply patches, test upgrades, learn new features etc.

How to improve ?
Start with better designed, executed and monitored backups and DR procedures. Perform backup to disk, as opposed to tape. Test backups and restores and hire skilled staff. It is better to have or hire smaller team of highly skilled DBAs then to have large team that you can not rely on. If you have no internal resources then use professional specialized service to design and manage backups and DR for you. Perform perpetual DR drills where various scenarios are tested. Set up training environments for DBAs to test for different scenarios - applying patches, upgrades, restores. Be aware that black swans - rare, negative events, have huge impact, inversely proportional to their frequency, and prepare for them.

About Ranko Mosic
Ranko Mosic is consultant - provider of remote Oracle Database Administration Services. He has more than 20 years of experience in IT industry in various consulting roles throught North America. He can be reached at ranko.mosic@gmail.com

Latest Cloud Developer Stories
HP said Wednesday that it would lay off 8% of workforce, 27,000 people, by October or 2014. It figures the move will save it $3 billion-$3.5 billion and expects to re- invest the money in cloud, security and Big Data.
With Cloud Expo 2012 New York (10th Cloud Expo) now under three weeks away, what better time to introduce you in greater detail to the distinguished individuals in our incredible Speaker Faculty for the technical and strategy sessions at the conference... We have technical and...
What do the CTOs of the CIA and the U.S. Dept. of Justice and the CIO of the National Reconnaissance Office have in common with the CEOs of Eucalyptus, GoGrid, ActiveState, Appcara, OpSource and Nortonworks, the CTOs of Rackspace, SoftLayer, SOA Software and AppZero, the Founder ...
Grid Dynamics, an eCommerce technology solutions company, and GridGain Systems, makers of an open source in-memory platform for Big Data processing, on Wednesday announced the expansion of their partnership which began in 2008. Grid Dynamics provides personalization and big data...
ServerCentral, Chicago’s leading provider of colocation, cloud, network connectivity, and managed services, announced on Tuesday that its high performance cloud will debut on June 11 at the 10th International Cloud Expo, held June 11-14 at the Javits Center in New York City. “Se...
Subscribe to the World's Most Powerful Newsletters
Subscribe to Our Rss Feeds & Get Your SYS-CON News Live!
Click to Add our RSS Feeds to the Service of Your Choice:
Google Reader or Homepage Add to My Yahoo! Subscribe with Bloglines Subscribe in NewsGator Online
myFeedster Add to My AOL Subscribe in Rojo Add 'Hugg' to Newsburst from CNET News.com Kinja Digest View Additional SYS-CON Feeds
Publish Your Article! Please send it to editorial(at)sys-con.com!

Advertise on this site! Contact advertising(at)sys-con.com! 201 802-3021

SYS-CON Featured Whitepapers
ADS BY GOOGLE

Breaking Cloud Computing News
Acceleware® Ltd. ("Acceleware" or the "Company") (TSX VENTURE:AXE), a leading developer of high perf...