nl There is also a DUTCH VERSION of this site

Google

More articles

01 Feb - 28 Feb 2010
01 Jan - 31 Jan 2010
01 Dec - 31 Dec 2009
01 Oct - 31 Oct 2009
01 Sep - 30 Sep 2009
01 Aug - 31 Aug 2009
01 Jun - 30 Jun 2009
01 Apr - 30 Apr 2009
01 Mar - 31 Mar 2009
01 Jan - 31 Jan 2009
01 Dec - 31 Dec 2008
01 Oct - 31 Oct 2008
01 Sep - 30 Sep 2008
01 Aug - 31 Aug 2008
01 Jul - 31 Jul 2008
01 Jun - 30 Jun 2008
01 May - 31 May 2008
01 Apr - 30 Apr 2008
01 Mar - 31 Mar 2008
01 Feb - 28 Feb 2008
01 Jan - 31 Jan 2008
01 Dec - 31 Dec 2007
01 Nov - 30 Nov 2007
01 Oct - 31 Oct 2007
01 Sep - 30 Sep 2007
01 Aug - 31 Aug 2007
01 Jul - 31 Jul 2007
01 Jun - 30 Jun 2007
01 May - 31 May 2007
01 Apr - 30 Apr 2007
01 Mar - 31 Mar 2007
01 Feb - 28 Feb 2007
01 Jan - 31 Jan 2007
01 Dec - 31 Dec 2006
01 Nov - 30 Nov 2006
01 Oct - 31 Oct 2006
01 Sep - 30 Sep 2006
01 Aug - 31 Aug 2006


Links

Deze site wordt gehost bij
ATN-Networks

Aanbevolen
Genootschap voor Informatie Architecten
Rene Hamberg
Bas Varkevisser
Ruth Malan
l-rs.org
Informatiekundig bekeken
Bredemeyer Consulting
Gaudi site
Hans Bot ArchITectuur Bedrijven
Security.nl

Nick Malik
Leo de Sousa



Misc

Powered by Pivot - 1.40.1: 'Dreadwind' 
XML: RSS Feed 
XML: Atom Feed 

Technorati Profile

99,999% availability

27 October 06 - 12:49
Area: default - Link to this article

I sometimes hear te strangest figures when availability is discussed.

"The system must be available 95% of the time" or "The system shall never fail" or "We will only accept 99.999% uptime (5 nines)".

Usually these figures are not based on calculations and/or people have no idea about the cost of reaching these numbers.

To make things clear: All hardware will break. The question is not if something breaks, but when.

Some calculations:

There are 24*365=8760 hours in one year. 1% of this is 87,6 hours. A system with an availability of 95% can be unavailable for 438 hours per year. This means 18 full days per year!

On the other end of the horizon is the 99.999% demand. Here a system may only be unavailable for 5 minutes per year, including any repair times! The 99.999% (five nines) is a popular number these days.

Availability can be calculated by multiplying the MTBF with the MTTR.

MTBF

For hardware usually an MTBF is stated (Mean Time Between Failures). A Seagate Cheetah hard disk for instance, has an MTBF of 1.200.000 hours. This means that on average the hard disk will fail every 136 years. A system is built with many components, each with it's own MTBF. Imagine a disk cabinet with 64 disks (this is not unusual in a SAN). In such a setup, every 2 years one of these disks will fail, even with the large MTBF of the Seagate disks.

While disks are the components that fail the most (because they contain many moving parts), other components of a system also have a MTBF. For instance servers (mainly the Fans in the power supplies), routers, switches, and even cabling.

The MTBF figure is mainly a marketing instrument. How can Seagate prove that their disks will actually on average fail every 136 years? Usually this is done using simulations and tests under stress-conditions.

MTTR

Apart from MTBF, there is MTTR: Mean Time To Repair. This is the time needed to fix or replace a broken system(part). Usually the MTTR is kept low by having a service contract with the supplier of the part. Sometimes spare parts can be kept on-site to keep the MTTR low.

Software

Except for hardware, systems contain software. Usually the MTBF and MTTR for software components can not be calculated easily. No programmer will state the MTBF of the software she wrote. Who knows the MTBF of Windows? Of Linux? SAP? Your in-house developed software?

The human aspect

Usually only 20% of the causes of failures are technology failures. In 80% of the cases, human errors are the reason. For instance, a system administrator accidentally pulls a wrong cable or enters an incorrect command. Users sometimes delete inportant (system) files.

Of course it helps to have highly qualified and trained personnel, with a healthy sense of responsibility. Errors are human, however, and there is no MTBF to be calculated here.

Conclusion

As stated above, availability figures of a system are very hard to guarantee. MTBF and MTTR are either unknown, can not be calculated, or are exaggerated.

Availability can only be reported on afterwards, when a system has run for some years. With this knowledge afterwards, new systems can be designed which will probably have a higher availability.

Of course , in the last years much knowledge is gained on how to design high-available systems, for instance by using clustering, failover, redundancy, structured programming, avoiding Single Points of Failures (SPOF's) and implementing proper system management.

IT architects (or security architects for that matter) are responsible for giving availability the attention it deserves. Because the costs of being not-available can be very high, a good match between IT and business is crucial.

IT architects: what's in a name

13 October 06 - 00:00
Area: default - Link to this article

IT architect is a relatively new profession. Because it's so new, there is little consensus about the names of architects.

Names I found on the Internet are (among others): 

  • Enterprise architect
  • Information architect
  • Software architect
  • Infrastructure architect
  • Security architect
  • Project architect
  • Systems architect
  • Application architect

Of course, no one knows exactly what these people do and what their exact expertise is.

I use the following terminology, that was proposed by the Dutch GIA (Genootschap voor Informatie Architecten) and the SCIA (Society for the Certification of Information Architects):

  • Information architect – This architect uses information as a production factor. These are the people that know what information a company uses, and how to use this information to improve businesses.
  • IT-Business architect – These people are concerned with the alignment of the IT environment which contains information, with the business.
  • IT architect – These people can architect IT infrastructures.

The GIA and the SCIA are working on a so-called BOK, a Body Of Knowledge, with the skills of these architects, so it will be clear to everyone what to expect from a certain architect.


More articles: See left pane.

About Sjaak Laan

Sjaak Laan

I am 45 years old and married with Angelina. We have 3 children of 11, 6 and 4 years old. We live in The Netherlands, in a place called Drachten

I work for Logica as Principal IT Architect. I have 20 years IT experience.

I own the following certificates:

itac_l Master Certified IT Architect



CISSP_logo CISSP (Certified Information Systems Security Professional)



TOGAF8_Certified_web TOGAF Certified Architect


I am a member of the:


I manage my business contacts using Linkedin.


I can be reached through sjaak.laan@gmail.com.

This site states my opinion only, and not nessecarily the opinion of my employer or of the clients I work for.