Recent from talks
Nothing was collected or created yet.
Common cause and special cause (statistics)
View on WikipediaThis article needs additional citations for verification. (February 2013) |
| Type of variation | Synonyms |
|---|---|
| Common cause | Chance cause Non-assignable cause Noise Natural pattern Random effects Random error |
| Special cause | Assignable cause Signal Unnatural pattern Systematic effects Systematic error |
Common and special causes are the two distinct origins of variation in a process, as defined in the statistical thinking and methods of Walter A. Shewhart and W. Edwards Deming. Briefly, "common causes", also called natural patterns, are the usual, historical, quantifiable variation in a system, while "special causes" are unusual, not previously observed, non-quantifiable variation.
The distinction is fundamental in philosophy of statistics and philosophy of probability, with different treatment of these issues being a classic issue of probability interpretations, being recognised and discussed as early as 1703 by Gottfried Leibniz; various alternative names have been used over the years. The distinction has been particularly important in the thinking of economists Frank Knight, John Maynard Keynes and G. L. S. Shackle.
Origins and concepts
[edit]In 1703, Jacob Bernoulli wrote to Gottfried Leibniz to discuss their shared interest in applying mathematics and probability to games of chance. Bernoulli speculated whether it would be possible to gather mortality data from gravestones and thereby calculate, by their existing practice, the probability of a man currently aged 20 years outliving a man aged 60 years. Leibniz replied that he doubted this was possible:
Nature has established patterns originating in the return of events but only for the most part. New illnesses flood the human race, so that no matter how many experiments you have done on corpses, you have not thereby imposed a limit on the nature of events so that in the future they could not vary.
This captures the central idea that some variation is predictable, at least approximately in frequency. This common-cause variation is evident from the experience base. However, new, unanticipated, emergent or previously neglected phenomena (e.g. "new diseases") result in variation outside the historical experience base. Shewhart and Deming argued that such special-cause variation is fundamentally unpredictable in frequency of occurrence or in severity.
John Maynard Keynes emphasised the importance of special-cause variation when he wrote:
By "uncertain" knowledge ... I do not mean merely to distinguish what is known for certain from what is only probable. The game of roulette is not subject, in this sense, to uncertainty ... The sense in which I am using the term is that in which the prospect of a European war is uncertain, or the price of copper and the rate of interest twenty years hence, or the obsolescence of a new invention ... About these matters there is no scientific basis on which to form any calculable probability whatever. We simply do not know!
Definitions
[edit]Common-cause variations
[edit]Common-cause variation is characterised by:
- Phenomena constantly active within the system;
- Variation predictable probabilistically;
- Irregular variation within a historical experience base; and
- Lack of significance in individual high or low values.
The outcomes of a perfectly balanced roulette wheel are a good example of common-cause variation. Common-cause variation is the noise within the system.
Walter A. Shewhart originally used the term chance cause.[1] The term common cause was coined by Harry Alpert in 1947. The Western Electric Company used the term natural pattern.[2] Shewhart called a process that features only common-cause variation as being in statistical control. This term is deprecated by some modern statisticians who prefer the phrase stable and predictable.
Special-cause variation
[edit]Special-cause variation is characterised by:
- New, unanticipated, emergent or previously neglected phenomena within the system;
- Variation inherently unpredictable, even probabilistically;
- Variation outside the historical experience base; and
- Evidence of some inherent change in the system or our knowledge of it.
Special-cause variation always arrives as a surprise. It is the signal within a system.
Walter A. Shewhart originally used the term assignable cause.[3] The term special-cause was coined by W. Edwards Deming. The Western Electric Company used the term unnatural pattern.[2]
Examples
[edit]Common causes
[edit]- Inappropriate procedures
- Poor design
- Poor maintenance of machines
- Lack of clearly defined standard operating procedures
- Poor working conditions, e.g. lighting, noise, dirt, temperature, ventilation
- Substandard raw materials
- Measurement error
- Quality control error
- Vibration in industrial processes
- Ambient temperature and humidity
- Normal wear and tear
- Variability in settings
- Computer response time
Special causes
[edit]- Faulty adjustment of equipment
- Operator falls asleep
- Defective controllers
- Machine malfunction
- Fall of ground
- Computer crash
- Deficient batch of raw material
- Power surges
- High healthcare demand from elderly people
- Broken part
- Insufficient awareness
- Abnormal traffic (click fraud) on web ads
- Extremely long lab testing turnover time due to switching to a new computer system
- Operator absent[4]
Importance to industrial and quality management
[edit]A special-cause failure is a failure that can be corrected by changing a component or process, whereas a common-cause failure is equivalent to noise in the system and specific actions cannot be made to prevent the failure.
Harry Alpert observed:
A riot occurs in a certain prison. Officials and sociologists turn out a detailed report about the prison, with a full explanation of why and how it happened here, ignoring the fact that the causes were common to a majority of prisons, and that the riot could have happened anywhere.
Alpert recognises that there is a temptation to react to an extreme outcome and to see it as significant, even where its causes are common to many situations and the distinctive circumstances surrounding its occurrence, the results of mere chance. Such behaviour has many implications within management, often leading to ad hoc interventions that merely increase the level of variation and frequency of undesirable outcomes.
Deming and Shewhart both advocated the control chart as a means of managing a business process in an economically efficient manner.
Importance to statistics
[edit]Deming and Shewhart
[edit]Within the frequency probability framework, there is no process whereby a probability can be attached to the future occurrence of special cause.[citation needed] One might naively ask whether the Bayesian approach does allow such a probability to be specified. The existence of special-cause variation led Keynes and Deming to an interest in Bayesian probability, but no formal synthesis emerged from their work. Most statisticians of the Shewhart-Deming school take the view that special causes are not embedded in either experience or in current thinking (that's why they come as a surprise; their prior probability has been neglected—in effect, assigned the value zero) so that any subjective probability is doomed to be hopelessly badly calibrated in practice.
It is immediately apparent from the Leibniz quote above that there are implications for sampling. Deming observed that in any forecasting activity, the population is that of future events while the sampling frame is, inevitably, some subset of historical events. Deming held that the disjoint nature of population and sampling frame was inherently problematic once the existence of special-cause variation was admitted, rejecting the general use of probability and conventional statistics in such situations. He articulated the difficulty as the distinction between analytic and enumerative statistical studies.
Shewhart argued that, as processes subject to special-cause variation were inherently unpredictable, the usual techniques of probability could not be used to separate special-cause from common-cause variation. He developed the control chart as a statistical heuristic to distinguish the two types of variation. Both Deming and Shewhart advocated the control chart as a means of assessing a process's state of statistical control and as a foundation for forecasting.
Keynes
[edit]Keynes identified three domains of probability:[5]
- frequency probability;
- subjective or Bayesian probability; and
- events lying outside the possibility of any description in terms of probability (special causes)
and sought to base a probability theory thereon.
Common mode failure in engineering
[edit]This section possibly contains original research. (February 2013) |
Common mode failure has a more specific meaning in engineering. It refers to events which are not statistically independent. Failures in multiple parts of a system may be caused by a single fault, particularly random failures due to environmental conditions or aging. An example is when all of the pumps for a fire sprinkler system are located in one room. If the room becomes too hot for the pumps to operate, they will all fail at essentially the same time, from one cause (the heat in the room).[6] Another example is an electronic system wherein a fault in a power supply injects noise onto a supply line, causing failures in multiple subsystems.
This is particularly important in safety-critical systems using multiple redundant channels. If the probability of failure in one subsystem is p, then it would be expected that an N channel system would have a probability of failure of pN. However, in practice, the probability of failure is much higher because they are not statistically independent; for example ionizing radiation or electromagnetic interference (EMI) may affect all the channels.[7]
The principle of redundancy states that, when events of failure of a component are statistically independent, the probabilities of their joint occurrence multiply.[8] Thus, for instance, if the probability of failure of a component of a system is one in one thousand per year, the probability of the joint failure of two of them is one in one million per year, provided that the two events are statistically independent. This principle favors the strategy of the redundancy of components. One place this strategy is implemented is in RAID 1, where two hard disks store a computer's data redundantly.
But even so, a system can have many common modes of failure. For example, consider the common modes of failure of a RAID1 where two disks are purchased from an online store and installed in a computer:
- The disks are likely to be from the same manufacturer and of the same model, therefore they share the same design flaws.
- The disks are likely to have similar serial numbers, thus they may share any manufacturing flaws affecting production of the same batch.
- The disks are likely to have been shipped at the same time, thus they are likely to have suffered from the same transportation damage.
- As installed both disks are attached to the same power supply, making them vulnerable to the same power supply issues.
- As installed both disks are in the same case, making them vulnerable to the same overheating events.
- They will be both attached to the same card or motherboard, and driven by the same software, which may have the same bugs.
- Because of the very nature of RAID1, both disks will be subjected to the same workload and very closely similar access patterns, stressing them in the same way.
Also, if the events of failure of two components are maximally statistically dependent, the probability of the joint failure of both is identical to the probability of failure of them individually. In such a case, the advantages of redundancy are negated. Strategies for the avoidance of common mode failures include keeping redundant components physically isolated.
A prime example of redundancy with isolation is a nuclear power plant.[9][10] The new ABWR has three divisions of Emergency Core Cooling Systems, each with its own generators and pumps and each isolated from the others. The new European Pressurized Reactor has two containment buildings, one inside the other. However, even here it is possible for a common mode failure to occur (for example, in the Fukushima Daiichi Nuclear Power Plant, mains power was severed by the Tōhoku earthquake, then the thirteen backup diesel generators were all simultaneously disabled by the subsequent tsunami that flooded the basements of the turbine halls).
See also
[edit]Bibliography
[edit]- Deming, W. E. (1975) On probability as a basis for action, The American Statistician, 29(4), pp. 146–152
- Deming, W. E. (1982) Out of the Crisis: Quality, Productivity and Competitive Position ISBN 0-521-30553-5
- Keynes, J. M. (1936) The General Theory of Employment, Interest and Money ISBN 1-57392-139-4
- Keynes, J. M. (1921)[5]
- Knight, F. H. (1921) Risk, Uncertainty and Profit ISBN 1-58798-126-2
- Shackle, G. L. S. (1972) Epistemics and Economics: A Critique of Economic Doctrines ISBN 1-56000-558-0
- Shewhart, W. A. (1931) Economic Control of Quality of Manufactured Product ISBN 0-87389-076-0
- Shewhart, W. A. (1939) Statistical Method from the Viewpoint of Quality Control ISBN 0-486-65232-7
- Wheeler, D. J. & Chambers, D. S. (1992) Understanding Statistical Process Control ISBN 0-945320-13-2
References
[edit]- ^ Shewhart, Walter A. (1931). Economic control of quality of manufactured product. New York City: D. Van Nostrand Company, Inc. p. 7. OCLC 1045408.
- ^ a b Western Electric Company (1956). Introduction to Statistical Quality Control handbook (1 ed.). Indianapolis, Indiana: Western Electric Co. pp. 23–24. OCLC 33858387.
- ^ Shewhart, Walter A. (1931). Economic control of quality of manufactured product. New York City: D. Van Nostrand Company, Inc. p. 14. OCLC 1045408.
- ^ "Statistical Inference". Archived from the original on 7 October 2006. Retrieved 13 November 2006.
- ^ a b Keynes, J. M. (1921). A Treatise on Probability. ISBN 0-333-10733-0.
{{cite book}}: ISBN / Date incompatibility (help) - ^ Thomson, Jim (February 2012). "Common-Mode Failure Considerations in High-Integrity C&I Systems" (PDF). Safety in Engineering. Retrieved 21 November 2012.
- ^ Randell, Brian Design Fault Tolerance in The Evolution of Fault-Tolerant Computing, Avizienis, A.; Kopetz, H.; Laprie, J.-C. (eds.), pp. 251–270. Springer-Verlag, 1987. ISBN 3-211-81941-X.
- ^ "SEI Framework: Fault Tolerance Mechanisms". Redundancy Management. NIST High Integrity Software Systems Assurance. 30 March 1995. Archived from the original on 24 November 2012. Retrieved 21 November 2012.
- ^ Edwards, G. T.; Watson, I. A. (July 1979). A Study of Common-Mode Failures. UK Atomic Energy Authority: Safety and Reliability Directorate. SRD R146.
{{cite book}}: CS1 maint: publisher location (link) - ^ Bourne, A. J.; Edwards, G. T.; Hunns, D. M.; Poulter, D. R.; Watson, I. A. (January 1981). Defences against Common-Mode Failures in Redundancy Systems – A Guide for Management, Designers and Operators. UK Atomic Energy Authority: Safety and Reliability Directorate. SRD R196.
{{cite book}}: CS1 maint: publisher location (link)
Common cause and special cause (statistics)
View on GrokipediaHistorical Origins
Early Concepts in Quality Control
In the early 20th century, particularly during the interwar period, manufacturing industries faced growing pressures from mass production techniques to improve efficiency and reliability, especially in sectors like telecommunications where defects could lead to significant operational failures.[5] At Bell Telephone Laboratories, Walter Shewhart addressed these challenges by developing control charts in the 1920s, a tool designed to differentiate predictable, inherent variations in production processes from erratic, unpredictable changes that signaled potential issues.[6] This innovation emerged from practical needs at Western Electric's Hawthorne Works, where Shewhart analyzed data from telephone equipment manufacturing to identify when processes were stable versus disrupted by external factors.[7] The initial recognition of process variation as either inherent (random and unavoidable within a stable system) or attributable to identifiable external factors stemmed directly from these industrial demands, as engineers sought ways to minimize downtime and defects without overhauling entire production lines.[2] Shewhart's approach built on statistical principles to classify variations, allowing for targeted interventions rather than blanket inspections, which were costly and inefficient in an era of expanding output. Early quality control practices in manufacturing emphasized reducing waste through systematic inspection and process monitoring, as unchecked variations led to scrap, rework, and lost productivity in high-volume environments.[8] This focus drove the need to separate stable process behavior—where variations were part of normal operations—from disruptions caused by factors like equipment malfunctions or material inconsistencies, enabling manufacturers to maintain economic viability amid interwar economic fluctuations.[5] Shewhart's seminal 1931 book, Economic Control of Quality of Manufactured Product, provided the first formal articulation of these variation types, integrating statistical methods with economic analysis to guide quality decisions in industry.[9] This work laid the groundwork for broader adoption of statistical techniques in quality assurance.[10]Contributions of Shewhart and Deming
Walter A. Shewhart, an American physicist and engineer at Bell Telephone Laboratories, invented the control chart in 1924 as a tool to monitor process stability and identify non-random shifts in variation, thereby distinguishing between inherent process fluctuations and anomalous events.[11] This innovation laid the foundational framework for statistical process control by enabling practitioners to separate predictable variation from unpredictable deviations, influencing subsequent quality management practices. Deming coined the terms 'common cause' for inherent system variation and 'special cause' for assignable anomalies, building on Shewhart's earlier distinctions between chance and assignable causes.[12][7] Building on Shewhart's principles, W. Edwards Deming, an American statistician and management consultant, promoted statistical quality control in Japan following World War II, adapting these ideas to emphasize systemic improvement over isolated fixes.[13] Deming integrated Shewhart's concepts into the Plan-Do-Study-Act (PDSA) cycle, a iterative model for continuous process enhancement that involves planning changes, implementing them, checking results against expectations, and acting on findings to refine the system.[14] This adaptation transformed Shewhart's statistical tools into a broader philosophy for organizational learning and quality assurance.[15] In his 14 Points for Management, outlined in the 1980s, Deming advocated for a leadership approach that treats common-cause variation—random fluctuations inherent to any system—as the responsibility of management to reduce through systemic redesign, rather than attributing it to individual workers.[16] He argued that most performance issues stem from system-wide factors under managerial control, urging leaders to foster cooperation and eliminate fear to address these root causes effectively.[4] This perspective shifted blame from personnel to process design, promoting a holistic view of variation in organizational performance.[17] Deming's influence gained prominence through his lectures to Japanese industrial leaders starting in 1950, organized by the Union of Japanese Scientists and Engineers (JUSE), where he emphasized statistical methods for quality administration and long-term planning.[18] These sessions, including a key address at the Mt. Hakone Conference, inspired Japan's quality revolution by encouraging widespread adoption of statistical control techniques, which contributed to the postwar economic resurgence and the global competitiveness of companies like Toyota.[13] Toyota, in particular, incorporated Deming's teachings into its production system, prioritizing variation reduction and employee involvement to achieve high-quality manufacturing standards.[19] In his 1986 book Out of the Crisis, Deming critiqued the over-reliance on reactive measures for special causes—unusual, identifiable anomalies—without tackling pervasive common causes, which he estimated account for the majority of quality issues and require fundamental management reforms to resolve.[20] He warned that such superficial interventions, often termed "tampering," exacerbate variation and undermine productivity, calling instead for a transformation in management philosophy to prioritize stable systems and profound knowledge of variation.[17] This work solidified Deming's legacy in advocating sustainable quality improvements over short-term fixes.[21]Fundamental Definitions
Common-Cause Variation
Common-cause variation, also referred to as chance cause or system noise, encompasses the inherent, random fluctuations in a process stemming from countless minor, unavoidable factors that collectively produce a stable and predictable pattern of variability centered around the process mean.[22] This type of variation is intrinsic to the system itself, arising from interactions among its components rather than from isolated external disruptions.[4] Key characteristics of common-cause variation include its random distribution across outputs, uniform impact on all process units, and predictability within established statistical bounds, such that it can only be diminished through fundamental systemic improvements like enhanced training protocols or equipment redesign.[23] Walter Shewhart originally described this as "chance-cause" variation in his seminal 1931 book Economic Control of Quality of Manufactured Product, emphasizing its role as the baseline noise in stable systems, while W. Edwards Deming later popularized the term "common cause" to highlight its systemic origins.[2] Deming further estimated that common causes account for approximately 94% of quality issues, underscoring the need for management to address system-wide factors rather than individual errors.[24] Mathematically, common-cause variation is typically modeled using the process standard deviation , where, under a normal distribution assumption for stable processes, approximately 99.73% of data points fall within of the mean, defining the expected range of natural fluctuations.[25] Shewhart established these limits as a practical criterion for delineating predictable variation from anomalies.[26] A process is considered in a state of statistical control—or "in control"—when variation is solely attributable to common causes, rendering it stable, repeatable, and forecastable over time without erratic shifts.[25] In contrast to special causes, which introduce non-random outliers, common-cause variation maintains consistency, allowing for reliable process performance assessment.[22]Special-Cause Variation
Special-cause variation, also known as assignable cause variation, arises from specific, external events or factors that are not inherent to the normal operation of a process, resulting in unpredictable and unusual outliers in performance.[27] This type of variation stems from identifiable sources outside the stable system's routine fluctuations, allowing for targeted investigation and elimination to prevent recurrence.[28] Key characteristics of special-cause variation include the presence of non-random patterns, such as sustained trends, abrupt shifts, or cycles, which typically affect only a subset of process outputs rather than all units uniformly. These causes are often traceable to discrete incidents, such as equipment malfunctions, human errors, or environmental disruptions, enabling individual correction to restore process stability.[23] In contrast to common-cause variation, which represents inherent, random fluctuations within a stable process, special-cause variation signals instability and necessitates prompt intervention to identify and remove the underlying factor, thereby preventing broader impacts on quality and efficiency.[23] Mathematical detection of special-cause variation relies on control charts, where out-of-control signals manifest as data points exceeding the ±3σ control limits from the process mean or exhibiting non-random patterns, such as seven consecutive points on one side of the centerline. Additional rules commonly used for detecting special causes include eight tests for patterns such as runs, trends, and zone violations, to enhance sensitivity beyond basic limits.Practical Examples
Common-Cause Scenarios
In manufacturing processes, common-cause variation often manifests as minor, inherent fluctuations in product specifications due to systemic factors such as subtle differences in machine operation or material composition. For instance, in the bottling of soda, slight weight variations around a target of 355 ml can arise from minor inconsistencies in filling machine speed or ingredient mixing, resulting in a normal distribution of weights that remains predictable and stable over time.[29] These variations are inherent to the process design and do not indicate isolated faults. In the service industry, common-cause variation appears in routine operations influenced by natural differences in inputs or human factors. A typical example is call center response times, which may vary by 1-2 minutes due to differences in customer query complexity or agent fatigue, producing a stable bell-shaped distribution of handling times.[29] Such fluctuations reflect the everyday rhythm of the system rather than external disruptions. In healthcare, common-cause variation is evident in biological and procedural inconsistencies across similar cases. For example, patient recovery times following surgery can differ slightly due to inherent biological variability among individuals, which can be quantified using standard deviation measures without pointing to assignable errors.[29][30] These scenarios highlight common-cause variation as an intrinsic, random element of stable processes, distinguishable from special-cause outliers by its predictability and lack of identifiable triggers. Reducing such variation necessitates systemic redesign, such as overhauling equipment or training protocols, rather than targeting individual instances, as emphasized in quality management principles.[31] To illustrate, consider a hypothetical dataset of soda bottle weights from a stable filling process, all falling within expected limits and forming a normal distribution:| Bottle ID | Weight (ml) |
|---|---|
| 1 | 353.8 |
| 2 | 355.2 |
| 3 | 354.1 |
| 4 | 356.0 |
| 5 | 354.9 |
| 6 | 355.5 |
| 7 | 353.5 |
| 8 | 356.3 |
| 9 | 354.7 |
| 10 | 355.1 |
Special-Cause Scenarios
Special-cause variation manifests in scenarios where external, assignable factors disrupt process stability, producing detectable signals such as points beyond control limits or systematic patterns in monitoring data. These events are distinguishable from inherent process fluctuations because they stem from specific, identifiable root causes like equipment issues or isolated incidents, allowing for targeted interventions to restore stability. In quality control, such variations prompt immediate investigation using tools like control charts to trace and eliminate the special factor, preventing recurrence.[3] In manufacturing, a classic example is a sudden spike in defective widgets resulting from a tool malfunction, such as residue buildup in a molding machine that leads to uneven edges in silicon chip batches. This creates clustered outliers exceeding upper control limits on an Individuals control chart, signaling non-random variation traceable to the equipment failure. Corrective action, like mold replacement and cleaning, removes the special cause and returns the process to common-cause stability.[32][3] A service industry scenario occurs in logistics, where delayed shipments arise from a single vehicle breakdown, imposing a trend in delivery times that appears as eight consecutive points above the centerline on a control chart. This non-random pattern is linked to the mechanical failure, which can be resolved through vehicle repair or contingency routing, thereby eliminating the assignable cause and normalizing operations.[33][34] In healthcare, elevated infection rates in a hospital ward may result from a contaminated batch of supplies, such as improperly sterilized instruments, producing non-random points like a run of values beyond two sigma limits in infection monitoring data. This special cause is attributable to the procedural lapse in sterilization, and addressing it—through supplier audits and enhanced protocols—halts the variation and reduces patient risk.[35] To illustrate detectability, consider a hypothetical X-bar control chart monitoring defect rates in manufacturing, where Western Electric rules identify special causes: a single point beyond the upper control limit (Rule 1) flags the initial tool malfunction impact, followed by two of three points beyond 2-sigma (Rule 2) confirming the clustered outliers. These rules, developed for efficient pattern recognition, enable rapid response without overreacting to common variation.[36]| Point | Defect Rate (%) | Position Relative to Limits | Western Electric Rule Triggered |
|---|---|---|---|
| 1-5 | 2.1-2.3 | Within ±3σ | None (common cause) |
| 6 | 4.5 | Beyond UCL (+3σ) | Rule 1: One point > UCL |
| 7-9 | 4.2, 4.6, 4.1 | Beyond +2σ (two of three) | Rule 2: 2/3 points > +2σ |
| 10+ | 2.2-2.4 | Within limits (post-correction) | None |
