From 2af3c8b08313f70a044ee61d3edde833136ec415 Mon Sep 17 00:00:00 2001
From: Robin Clark <r.clark@energytechnologycontrol.com>
Date: Tue, 9 Nov 2010 19:25:15 +0000
Subject: [PATCH] FMEDA and SIL

+ general tidying
---
 fmmd_concept/fmmd_concept.tex | 130 +++++++++++++++++++++++++---------
 1 file changed, 97 insertions(+), 33 deletions(-)

diff --git a/fmmd_concept/fmmd_concept.tex b/fmmd_concept/fmmd_concept.tex
index fdd3441..696f031 100644
--- a/fmmd_concept/fmmd_concept.tex
+++ b/fmmd_concept/fmmd_concept.tex
@@ -246,7 +246,7 @@ system level outcomes.
 
 \subsection { FMEA }
 
-
+\label{pfmea}
 This is an early static analysis methodology, and concentrates
 on SYSTEM level errors which have been investigated.
 The investigation will typically point to a particular failure
@@ -270,7 +270,7 @@ a prioritised `todo list', with higher the $RPN$ values being the most urgent.
 \paragraph{note.} FMEA is sometimes used in its literal sense, that is to say
 failure Mode effects Analysis, simply looking at a systems internal failure
 modes and determing what may happen as a result.
-FMEA described in this section is sometimes called `production FMEA'.
+FMEA described in this section (\ref{pfmea}) is sometimes called `production FMEA'.
 
 \subsection{FMECA}
 
@@ -319,39 +319,105 @@ Failure Modes, Effects, and Diagnostic Analysis (FMEDA).
 This is a process that takes all the components in a system,
 and from the failure modes of those components, the investigating engineer
 must tie them to possible SYSTEM level events/failure modes.
+This technique 
+evaluates and the product’s self-diagnostic ability,
+The calculations and procedure for FMEDA are
+described in EN61508 Part 2 Appendix C \cite{en61508}[Part 2 App C].
+The following gives an outline of the procedure.
+
+\paragraph{FMEA}
+The first stage is to apply FMEA to the SYSTEM.
+Within the product all failure rates of individual 
+components contribute to the overall product failure rate.
+Failure rates of individual components in the SYSTEM
+are calculated based on component type and
+environmental conditions.
+
+\paragraph{Overall SYSTEM failure rate}
+Product failure rate is the sum of all component
+failure rates. This is the sum of safe and unsafe
+failures.
+
+\paragraph{Self Diagnostics}
+We  next evaluate the SYSTEMS’s self-diagnostic ability.
+
+Each component’s failure mode and its failure rate are listed.
+Failure modes are classified  as safe or dangerous\footnote{Again this is taking a component failure mode and determing
+how it will react with any other components in the SYSTEM and making a decision
+based on hueistics.}.
+detectable failures are labelled `$\lambda_D$' and safe failures `$\lambda_S$' by EN61508.
+
+\paragraph{Determine Detectable and Undetecable Failures}
+Each safe and dangerous failure mode is determined as detectable or un-detectable by the SYSTEMS’s 
+self checking features.
+%
+The result is a list of all components, their failure modes, the failure mode classification 
+as Safe-Detected (SD), Safe-Undetected (SU), Dangerous-Detected (DD) or Dangerous-Undetected (DU),
+and the failure rate of each classification using the failure rate 
+prediction results ($\lambda_{SD}$, $\lambda_{SU}$, $\lambda_{DD}$, $\lambda_{DU}$).
+
+Because some failure modes may  not be discovered theoretically during the 
+next step  is to investigate using an actual working SYSTEM. 
+This requires the deliberate introduction
+of failures; any new failures discovered at this stage are classified 
+$\lambda_{SD}$, $\lambda_{SU}$, $\lambda_{DD}$, $\lambda_{DU}$
+and added to the result set.
+%SD, SU, DD, DU.
+
+\paragraph{Diagnostic Coverage.}
+The diagnostic coverage is simply the ratio
+of the dangerous detected probabilities
+against the probability of all dangerous failures,
+and is normally expressed as a percentage.
+
+$$ DiagnosticCoverage = \Sigma\lambda_{DD} / \Sigma\lambda_D $$
+
+The diagnostic coverage for safe failures is given as
+
+$$ SF = \frac{\Sigma\lambda_SD}{\Sigma\lambda_S} $$
+
+
+\paragraph{Safe Failure Fraction.}
+A key concept in  FMEDA is Safe Failure Fraction (SFF).
+This is the ratio of safe  and dangerous detected failures
+against the safe and dangerous failure probabilities. 
+Again this is usually expressed as a percentage.
+
+$$ SFF = \big( \Sigma\lambda_S + \Sigma\lambda_{DD} \big) / \big( \Sigma\lambda_S + \Sigma\lambda_D \big) $$
+
+This is the ratio of 
+Step 4 Calculate SFF, SIL and PFD
+The SIL level of the product is finally determined from the Safe Failure Fraction (SFF) and the Probability of Failure on Demand (PFD). The following formulas are used.
+SFF = (lSD + lSU + lDD) / (lSD + lSU + lDD + lDU)
+PFD = (lDU)(Proof Test Interval)/2 + (lDD)(Down Time or Repair Time)
 
 % Often a given component failure mode there will be a $\beta$ value, the
 % probability that the component failure mode will cause a given SYSTEM failure.
 
 \paragraph{Risk Mitigation}
 
-The component may be mitigated by a vatriety of factors
-\begin{itemize}
-\item Automatic checking 
-\item Down rating
-\item Coverage of self checking
-\end{itemize}
+The component may be have its risk factor 
+reduced by the checking interval (or $\tau$ time between self checking procedures).
 
 Ultimately this technique calculates a risk factor for each component.
 The risk factors of all the components are summed and 
 give a value for the `safety level' for the equipment in a given environment.
 
-%%-he FMEDA technique considers
-%%-• All components of a design,
-%%-• The functionality of each component,
-%%-• The failure modes of each component,
-%%-• The impact of each component failure mode on the product functionality,
-%%-• The ability of any automatic diagnostics to detect the failure,
-%%-• The design strength (de-rating, safety factors) and
-%%-• The operational profile (environmental stress factors).
+\paragraph{Classification into Safety Integrity Levels (SIL).}
+There are four SIL levels, from 1 to 4 with 4 being the highest safety level.
+In addition to probablistic risk factors, the 
+diagnostic coverage and SFF
+have threshold bands beoming stricter for each level.
+Software techniques and constraints are 
+also become stricter for each SIL level.
 
-This uses MTFF and other statistical models to determine the probability of
-failures occurring.
+FMEDA uses MTFF and other statistical models to determine the probability of
+failures occurring, and provide an adaquate risk level.
 %
-A component failure mode, given its MTTF
-the probability of detecting the fault and its safety relevant validation time $\tau$,
-contributes a simple risk factor that is summed
-in to give a final risk result. 
+%A component failure mode, given its MTTF
+%the probability of detecting the fault and its safety relevant validation time $\tau$,
+%contributes a simple risk factor that is summed
+%in to give a final risk result. 
 %
 Thus a statistical
 model can be implemented on a spreadsheet, where each component
@@ -408,14 +474,15 @@ safety level zones \cite{en61508}. This is a vague way of determining
 safety.
 
 The Statistical Analysis methodology is the core philosophy
-of the Safety Integrity Levels (SIL) of EN61508 \cite{en61508}.
+of the Safety Integrity Levels (SIL) ebodied in EN61508 \cite{en61508}
+and its international analog standard IOC5108.
 
 
 
 \subsubsection{ FMEDA weaknesses }
 \begin{itemize}
 \item Possibility to miss the effects of failure modes at SYSTEM level.
-\item Statistical nature allows critical failures considered acceptable for given S.I.L. level.
+\item Statistical nature allows a proportion of undetected failures  for given S.I.L. level.
 \item Allows a small proportion of `undetectable' error conditions.
 \item No possibility to model base component level double failure modes.
 \end{itemize}
@@ -427,14 +494,9 @@ of the Safety Integrity Levels (SIL) of EN61508 \cite{en61508}.
 \item It should be easy to integrate mechanical, electronic and software models \cite{sccs}[pp.287].
 \item It should be re-usable, in that commonly used modules can be re-used in other designs/projects.
 \item It should have a formal basis, that is to say, it should be able to produce mathematical proofs
-for its results.
-\item It should be capable of producing reliability and danger evaluation statistics.
+for its results, such as system level error causation trees, reliability and safety statistics.
 \item It should be easy to use, Ideally using a graphical syntax (as oppossed to a formal mathematical one).
 \item From the top down, the failure mode model should follow a logical de-composition of the functionality
-for its results.
-\item It should be capable of producing reliability and danger evaluation statistics.
-\item It should be easy to use, ideally using a graphical syntax (as oppossed to a formal mathematical one).
-\item From the top down, the failure mode model should follow a logical de-composition of the functionality
 to smaller and smaller functional modules \cite{maikowski}.
 \item Multiple failure modes may be modelled from the base component level up.
 \end{itemize}
@@ -482,7 +544,8 @@ Top down de-compositon applies functional
 de-composition, because it seeks to break the system down
 into manageable and separately testable entities.
 A second justification for this is that the design process for a product requires both top down and bottom-up
-thinking.
+thinking. To analyse a system from the bottom-up is a useful
+design validatio process in itself \cite{sommerville}.
 
 
 \paragraph{Problem with functional group hierarchy}
@@ -550,7 +613,7 @@ failure mode behaviour, but can check that all failure modes in
 the hierarchy have been considered and tied to causing symptoms.
 
 
-\paragraph{Incremental Stages and {\dcs}}.
+\paragraph{Incremental Stages and \dcs}.
 We can use incremental stages to build the hierarchy.
 We can take small {\fg}s of components, where the {\fg}
 is a small set of components that perform a simple
@@ -594,7 +657,8 @@ the number of failure modes decreases (or exceptionally stays the same) at each
 %
 This decreasing of the number of failure modes is bourne out {\irl}.
 Of the thousands of component failure modes in a typical product
-there are generally only a handful of SYSTEM level failure modes. 
+there are generally only a handful of SYSTEM level failure modes
+(or top level `symptoms' of underlying failures). 
 %
 
 \subsection{Outline of the FMMD process}