Hubbry Logo
Survival functionSurvival functionMain
Open search
Survival function
Community hub
Survival function
logo
7 pages, 0 posts
0 subscribers
Be the first to start a discussion here.
Be the first to start a discussion here.
Survival function
Survival function
from Wikipedia

The survival function is a function that gives the probability that a patient, device, or other object of interest will survive past a certain time.[1] The survival function is also known as the survivor function[2] or reliability function.[3] The term reliability function is common in engineering while the term survival function is used in a broader range of applications, including human mortality. The survival function is the complementary cumulative distribution function of the lifetime. Sometimes complementary cumulative distribution functions are called survival functions in general.

Definition

[edit]

Let the lifetime be a continuous random variable describing the time to failure. If has cumulative distribution function and probability density function on the interval , then the survival function or reliability function is:

Examples of survival functions

[edit]

The graphs below show examples of hypothetical survival functions. The x-axis is time. The y-axis is the proportion of subjects surviving. The graphs show the probability that a subject will survive beyond time t.

Four survival functions

For example, for survival function 1, the probability of surviving longer than t = 2 months is 0.37. That is, 37% of subjects survive more than 2 months.

Survival function 1

For survival function 2, the probability of surviving longer than t = 2 months is 0.97. That is, 97% of subjects survive more than 2 months.

Survival function 2

Median survival may be determined from the survival function: The median survival is the point where the survival function intersects the value 0.5.[4] For example, for survival function 2, 50% of the subjects survive 3.72 months. Median survival is thus 3.72 months.

Survival function with indicated median survival

Median survival cannot always be determined from the graph alone. For example, in survival function 4, more than 50% of the subjects survive longer than the observation period of 10 months.

Median survival greater than 10 months

The survival function is one of several ways to describe and display survival data. Another useful way to display data is a graph showing the distribution of survival times of subjects. Olkin,[5] page 426, gives the following example of survival data. The number of hours between successive failures of an air-conditioning (AC) system were recorded. The time in hours, t, between successive failures are 1, 3, 5, 7, 11, 11, 11, 12, 14, 14, 14, 16, 16, 20, 21, 23, 42, 47, 52, 62, 71, 71, 87, 90, 95, 120, 120, 225, 246 and 261. The mean time between failures is 59.6. The figure below shows the distribution of the time between failures. The blue tick marks beneath the graph are the actual hours between successive AC failures.

Distribution of AC failure times

In this example, a curve representing the exponential distribution overlays the distribution of AC failure times; the exponential distribution approximates the distribution of AC failure times. This particular exponential curve is specified by the parameter lambda, λ:

λ = 1/(mean time between failures) = 1/59.6 = 0.0168.

The distribution of failure times is the probability density function (PDF), since time can take any positive value. In equations, the PDF is specified as fT. If time can only take discrete values (such as 1 day, 2 days, and so on), the distribution of failure times is called the probability mass function. Most survival analysis methods assume that time can take any positive value, and fT is the PDF. If the time between observed AC failures is approximated using the exponential function, then the exponential curve gives the probability density function, fT, for AC failure times.

Another useful way to display the survival data is a graph showing the cumulative failures up to each time point. These data may be displayed as either the cumulative number or the cumulative proportion of failures up to each time. The graph below shows the cumulative probability (or proportion) of failures at each time for the air conditioning system. The stairstep line in black shows the cumulative proportion of failures. For each step there is a blue tick at the bottom of the graph indicating an observed failure time. The smooth red line represents the exponential curve fitted to the observed data.

CDF for AC failures

A graph of the cumulative probability of failures up to each time point is called the cumulative distribution function (CDF). In survival analysis, the cumulative distribution function gives the probability that the survival time is less than or equal to a specific time, t.

Let T be survival time, which is any positive number. A particular time is designated by the lower case letter t. The cumulative distribution function of T is the function

where the right-hand side represents the probability that the random variable T is less than or equal to t. If time can take on any positive value, then the cumulative distribution function F(t) is the integral of the probability density function f(t).

For the air-conditioning example, the graph of the CDF below illustrates that the probability that the time to failure is less than or equal to 100 hours is 0.81, as estimated using the exponential curve fit to the data.

AC Time to failure LT 100 hours

An alternative to graphing the probability that the failure time is less than or equal to 100 hours is to graph the probability that the failure time is greater than 100 hours. The probability that the failure time is greater than 100 hours must be 1 minus the probability that the failure time is less than or equal to 100 hours, because total probability must sum to 1.

This gives:

This relationship generalizes to all failure times:

This relationship is shown on the graphs below. The graph on the left is the cumulative distribution function, which is Pr(Tt). The graph on the right is Pr(T > t) = 1 − Pr(Tt). The graph on the right is the survival function, S(t). The fact that the S(t) = 1 – CDF is the reason that another name for the survival function is the complementary cumulative distribution function.

Survival function is 1 - CDF

Parametric survival functions

[edit]

In some cases, such as the air conditioner example, the distribution of survival times may be approximated well by a function such as the exponential distribution. Several distributions are commonly used in survival analysis, including the exponential, Weibull, gamma, normal, log-normal, and log-logistic.[3][6] These distributions are defined by parameters. The normal (Gaussian) distribution, for example, is defined by the two parameters mean and standard deviation. Survival functions that are defined by parameters are said to be parametric.

In the four survival function graphs shown above, the shape of the survival function is defined by a particular probability distribution: survival function 1 is defined by an exponential distribution, 2 is defined by a Weibull distribution, 3 is defined by a log-logistic distribution, and 4 is defined by another Weibull distribution.

Exponential survival function

[edit]

For an exponential survival distribution, the probability of failure is the same in every time interval, no matter the age of the individual or device. This fact leads to the "memoryless" property of the exponential survival distribution: the age of a subject has no effect on the probability of failure in the next time interval. The exponential may be a good model for the lifetime of a system where parts are replaced as they fail.[7] It may also be useful for modeling survival of living organisms over short intervals. It is not likely to be a good model of the complete lifespan of a living organism.[8] As Efron and Hastie [9] (p. 134) note, "If human lifetimes were exponential there wouldn't be old or young people, just lucky or unlucky ones".

Weibull survival function

[edit]

A key assumption of the exponential survival function is that the hazard rate is constant. In an example given above, the proportion of men dying each year was constant at 10%, meaning that the hazard rate was constant. The assumption of constant hazard may not be appropriate. For example, among most living organisms, the risk of death is greater in old age than in middle age – that is, the hazard rate increases with time. For some diseases, such as breast cancer, the risk of recurrence is lower after 5 years – that is, the hazard rate decreases with time. The Weibull distribution extends the exponential distribution to allow constant, increasing, or decreasing hazard rates.

Other parametric survival functions

[edit]

There are several other parametric survival functions that may provide a better fit to a particular data set, including normal, lognormal, log-logistic, and gamma. The choice of parametric distribution for a particular application can be made using graphical methods or using formal tests of fit. These distributions and tests are described in textbooks on survival analysis.[1][3] Lawless[10] has extensive coverage of parametric models.

Parametric survival functions are commonly used in manufacturing applications, in part because they enable estimation of the survival function beyond the observation period. However, appropriate use of parametric functions requires that data are well modeled by the chosen distribution. If an appropriate distribution is not available, or cannot be specified before a clinical trial or experiment, then non-parametric survival functions offer a useful alternative.

Non-parametric survival functions

[edit]

A parametric model of survival may not be possible or desirable. In these situations, the most common method to model the survival function is the non-parametric Kaplan–Meier estimator. This estimator requires lifetime data. Periodic case (cohort) and death (and recovery) counts are statistically sufficient to make non-parametric maximum likelihood and least squares estimates of survival functions, without lifetime data.

Properties

[edit]
  • Every survival function is monotonically decreasing, i.e. for all .
    • It is a property of a random variable that maps a set of events, usually associated with mortality or failure of some system, onto time.
  • The time, , represents some origin, typically the beginning of a study or the start of operation of some system. is commonly unity but can be less to represent the probability that the system fails immediately upon operation.
  • Since the CDF is a right-continuous function, the survival function is also right-continuous.
  • The survival function can be related to the probability density function and the hazard function

So that

  • The expected survival time

See also

[edit]

References

[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
The survival function, denoted as S(t), is a fundamental function in and , defined as the probability that a subject, system, or process survives or remains functional beyond a specified time t, mathematically expressed as S(t) = P(T > t) = 1 - F(t), where T represents the for the time until an event (such as failure or ) occurs and F(t) is the of T. This function is inherently non-increasing, starting at S(0) = 1 (certain survival at time zero) and approaching S(∞) = 0 (inevitable event occurrence), and for continuous distributions, it is right-continuous and differentiable where the density exists. Closely related to other key components of survival models, the survival function connects to the hazard function h(t)—which measures the instantaneous rate of event occurrence given survival to t—through the relation h(t) = f(t) / S(t), where f(t) is the , and to the cumulative hazard function H(t) via S(t) = exp(-H(t)), with H(t) = ∫_0^t h(u) du. These interconnections enable the modeling of diverse failure patterns, such as constant hazards in exponential distributions (S(t) = exp(-λt)) or increasing hazards in Weibull distributions (S(t) = exp(-(λt)^p) for p > 1). In practice, the survival function is pivotal for analyzing time-to-event data, particularly when observations are subject to right-censoring (e.g., study end before event occurrence), and it underpins estimation methods like the Kaplan-Meier estimator for non-parametric survival curves. Applications span multiple fields: in and , it quantifies patient , treatment efficacy, and disease progression by modeling survival times from clinical trials or observational studies. In and reliability, it serves as the reliability function to predict component or lifetimes, optimize schedules, and assess failure risks in hardware like or machinery. Additionally, it informs econometric and research on durations such as spells or .

Basic Concepts

Definition

In survival analysis, the survival function describes the probability distribution of a non-negative random variable TT, which represents the time until the occurrence of a specified event, such as death, failure, or disease onset. The survival function, denoted S(t)S(t), is mathematically defined as S(t)=P(T>t)S(t) = P(T > t) for t0t \geq 0, where PP denotes probability. This function quantifies the probability that the event has not yet occurred by time tt. For proper probability distributions of TT, the survival function satisfies the boundary conditions S(0)=1S(0) = 1 and limtS(t)=0\lim_{t \to \infty} S(t) = 0. It is the complement of the F(t)=P(Tt)F(t) = P(T \leq t), so S(t)=1F(t)S(t) = 1 - F(t). The form of S(t)S(t) depends on whether TT is continuous or discrete: in the continuous case, S(t)S(t) is a right-continuous, non-increasing function approaching zero asymptotically; in the discrete case, it is a with jumps at the possible event times.

Relation to Other Probability Functions

The survival function S(t)=P(T>t)S(t) = P(T > t) is directly related to the cumulative distribution function (CDF) F(t)=P(Tt)F(t) = P(T \leq t) of the TT, representing the time until an event occurs, through the equation S(t)=1F(t)S(t) = 1 - F(t). This relationship holds for both discrete and continuous distributions, ensuring that the survival probability complements the probability of the event having occurred by time tt. For continuous random variables TT, the survival function connects to the (PDF) f(t)f(t), which describes the distribution of event times. Specifically, f(t)=dS(t)dtf(t) = -\frac{dS(t)}{dt}, as the density at tt equals the negative rate of change of the survival probability. This derivative relationship arises because a decrease in S(t)S(t) corresponds to the instantaneous probability of the event occurring at tt. The hazard function h(t)h(t), also known as the or , provides the instantaneous rate of occurrence of given up to time tt, defined as h(t)=f(t)S(t)h(t) = \frac{f(t)}{S(t)}. To derive this, consider the : the hazard is the limit h(t)=limΔt0P(tT<t+ΔtTt)Δth(t) = \lim_{\Delta t \to 0} \frac{P(t \leq T < t + \Delta t \mid T \geq t)}{\Delta t}, which approximates the probability of in a small interval [t,t+Δt)[t, t + \Delta t) divided by the interval length, conditional on survival to tt. Substituting the PDF and survival function yields h(t)=f(t)S(t)h(t) = \frac{f(t)}{S(t)}. Alternatively, using the logarithmic derivative, h(t)=ddtlnS(t)h(t) = -\frac{d}{dt} \ln S(t), because ddtlnS(t)=1S(t)dS(t)dt=f(t)S(t)\frac{d}{dt} \ln S(t) = \frac{1}{S(t)} \frac{dS(t)}{dt} = -\frac{f(t)}{S(t)}, confirming the equivalence and emphasizing the hazard as the rate of exponential decay in . In engineering and reliability theory, the survival function is equivalently termed the reliability function R(t)R(t), denoting the probability that a system or component functions without failure beyond time tt. This interpretation bridges survival analysis with reliability engineering, where R(t)=S(t)R(t) = S(t) models the dependability of mechanical or electronic systems under stress or usage.

Examples and Applications

Illustrative Examples

To illustrate the survival function in a continuous setting, consider a random variable TT following a distribution on the interval [0,a][0, a], where a>0a > 0. The survival function is S(t)=1ta,0ta,S(t) = 1 - \frac{t}{a}, \quad 0 \leq t \leq a, with S(t)=1S(t) = 1 for t<0t < 0 and S(t)=[0](/page/0)S(t) = [0](/page/0) for t>at > a. This form demonstrates a linear decline in the probability of surviving beyond time tt, reflecting constant over the support. For example, if a=10a = 10, then S(5)=0.5S(5) = 0.5, indicating that the probability of past halfway through the interval is exactly half. Graphically, S(t)S(t) appears as a straight line decreasing monotonically from S(0)=1S(0) = 1 to S(a)=[0](/page/0)S(a) = [0](/page/0), highlighting the function's non-increasing property from certainty of at t=0t = 0 to impossibility beyond the maximum lifetime. In discrete time, the geometric distribution offers a simple example, where TT represents the number of periods until the first event occurs in a of independent Bernoulli trials, each with success (event) probability pp where 0<p<10 < p < 1. The survival function is S(t)=(1p)t,t=0,1,2,,S(t) = (1 - p)^t, \quad t = 0, 1, 2, \dots, representing the probability of no event in the first tt periods. This exhibits exponential decay in discrete steps, with survival probability halving (or more) as tt increases depending on pp. For instance, if p=0.1p = 0.1, then S(5)=0.950.5905S(5) = 0.9^5 \approx 0.5905, showing about 59% chance of surviving the first five periods. Graphically, S(t)S(t) forms a step function, constant between integers and dropping abruptly at each tt, decreasing from S(0)=1S(0) = 1 toward 0 as tt \to \infty, which underscores the right-continuous and non-increasing behavior required of survival functions. The serves as the discrete analogue to the exponential distribution in continuous survival analysis, both characterized by the memoryless property.

Practical Applications

In medicine, survival functions are widely applied to estimate patient survival probabilities following treatments, particularly in oncology where 5-year survival rates provide critical prognostic information for clinical decision-making and patient counseling. For instance, these functions help quantify the likelihood of disease-free survival after interventions like chemotherapy or surgery, enabling comparisons across patient cohorts and informing public health strategies. Empirical survival curves, such as those derived from the Kaplan-Meier estimator, are routinely used to visualize these probabilities in clinical trials. In engineering reliability, survival functions predict the time-to-failure for components, aiding in the design and maintenance of systems to minimize downtime and costs. For example, they assess the probability that items like light bulbs or industrial machines will operate without failure beyond a specified duration, supporting warranty predictions and preventive replacement schedules. This interpretive framework allows engineers to evaluate system robustness under varying operational stresses. Actuarial science employs survival functions in constructing life tables, which underpin insurance premium calculations by estimating future mortality risks. These functions determine the probability of survival to various ages, enabling actuaries to price life insurance policies and annuities accurately while accounting for demographic trends. Such applications ensure financial products remain viable amid uncertainties in lifespan distributions. Right-censoring poses challenges in these applications by introducing incomplete observations, such as when study participants drop out before an event occurs, potentially biasing survival probability estimates if not properly addressed. This issue is common in longitudinal medical studies or reliability tests where follow-up ends prematurely, requiring careful interpretation to maintain accuracy.

Parametric Survival Functions

Exponential Survival Function

The exponential survival function arises from the , a parametric model commonly used in survival analysis to describe lifetimes or durations where the hazard rate remains constant over time. It assumes that the probability of an event occurring in the next instant does not depend on how much time has already passed, making it suitable for modeling processes without aging or wear-out effects. The survival function for the exponential distribution is given by
S(t)=eλt,S(t) = e^{-\lambda t},
where t0t \geq 0 is the time and λ>0\lambda > 0 is the constant rate parameter representing the instantaneous hazard. This formula implies that the probability of surviving beyond time tt decreases exponentially with λt\lambda t.
A defining characteristic of the exponential distribution is its memoryless property, which states that the conditional probability of surviving an additional time tt given survival up to time ss equals the unconditional probability of surviving time tt:
P(T>t+sT>s)=P(T>t)=S(t)P(T > t + s \mid T > s) = P(T > t) = S(t)
for all t,s>0t, s > 0. To prove this via conditional probability, note that
P(T>t+sT>s)=P(T>t+s)P(T>s)=eλ(t+s)eλs=eλt=S(t),P(T > t + s \mid T > s) = \frac{P(T > t + s)}{P(T > s)} = \frac{e^{-\lambda (t + s)}}{e^{-\lambda s}} = e^{-\lambda t} = S(t),
demonstrating independence from prior survival time. This property uniquely identifies the exponential among continuous distributions with positive support.
The corresponding hazard function is constant:
h(t)=λ,h(t) = \lambda,
indicating a uniform risk of failure at any point, with no increase or decrease due to aging.
For parameter estimation with uncensored data consisting of nn observed failure times t1,,tnt_1, \dots, t_n, the maximum likelihood estimator of λ\lambda is
λ^=ni=1nti,\hat{\lambda} = \frac{n}{\sum_{i=1}^n t_i},
which is the reciprocal of the sample mean lifetime and maximizes the likelihood function L(λ)=i=1nλeλtiL(\lambda) = \prod_{i=1}^n \lambda e^{-\lambda t_i}. This estimator provides an efficient point estimate under the exponential assumption. The exponential model serves as a foundational case, generalized by distributions like the Weibull for time-varying hazards.

Weibull Survival Function

The Weibull survival function is a parametric form widely used in due to its flexibility in modeling diverse failure time behaviors. It is defined as
S(t)=exp{(tα)β},S(t) = \exp\left\{ -\left(\frac{t}{\alpha}\right)^\beta \right\},
where t0t \geq 0, α>0\alpha > 0 is the representing the characteristic life, and β>0\beta > 0 is the that governs the form of the distribution. This two-parameter model arises from and is particularly suited for analyzing time-to-failure data in and medical contexts.
The corresponding hazard function for the Weibull distribution is
h(t)=βα(tα)β1,h(t) = \frac{\beta}{\alpha} \left( \frac{t}{\alpha} \right)^{\beta - 1},
which allows it to capture a range of hazard shapes depending on β\beta. When β>1\beta > 1, the hazard increases with time, reflecting wear-out processes; when 0<β<10 < \beta < 1, it decreases, indicating early failures like infant mortality; and when β=1\beta = 1, the hazard is constant, simplifying to the exponential case. This versatility makes the Weibull distribution a cornerstone for modeling non-constant hazards in survival data.
In reliability engineering, the Weibull distribution is widely used to model the phases of bathtub-shaped failure rate curves, which characterize the three phases of product life—a decreasing hazard (β<1\beta < 1) for infant mortality or initial defects, a constant hazard (β=1\beta = 1) during useful life, and an increasing hazard (β>1\beta > 1) due to wear-out—often in combination to describe the full curve. Specifically, values of β>1\beta > 1 model the wear-out phase, where material degradation leads to accelerating failures, as seen in components like capacitors or mechanical systems. For instance, in , Weibull parameters are estimated to predict long-term reliability under normal conditions. The model reduces to the exponential survival function when β=1\beta = 1, highlighting its generalization of memoryless processes.

Other Parametric Survival Functions

The log-normal survival function models lifetimes where the logarithm of the survival time follows a normal distribution, making it suitable for processes involving multiplicative effects, such as biological growth or degradation over time. Its survival function is given by S(t)=1Φ(lntμσ),S(t) = 1 - \Phi\left(\frac{\ln t - \mu}{\sigma}\right), where Φ\Phi is the cumulative distribution function of the standard normal distribution, μ\mu is the mean of the log-lifetimes, and σ>0\sigma > 0 is the standard deviation. This distribution is particularly common in biological applications, including the analysis of survival times in clinical studies like hemodialysis outcomes, where data exhibit right-skewness and heavy tails reflective of variable physiological responses. The Gompertz survival function is widely used to describe age-related mortality, capturing the exponential increase in rates observed in aging populations across species. It is expressed as S(t)=exp{cλ(eλt1)},S(t) = \exp\left\{-\frac{c}{\lambda}(e^{\lambda t} - 1)\right\}, where c>0c > 0 represents the initial and λ>0\lambda > 0 governs the rate of exponential increase in mortality. This model has been foundational in for quantifying aging processes, as it aligns with empirical observations of accelerating death rates in adult lifespans, distinguishing it from more flexible shapes like those in the . Other parametric families, such as the , extend modeling capabilities to scenarios with non-monotonic . The accommodates unimodal hazard shapes, rising to a peak before declining, which is useful for failure times in reliability or medical contexts where risks initially increase and then wane.
DistributionHazard Shape Characteristics
Log-normalUnimodal, typically increasing to a maximum then decreasing; heavy-tailed, suitable for skewed .
GompertzStrictly increasing and convex (); ideal for monotonically accelerating mortality in aging.
Log-logisticFlexible: monotone increasing/decreasing or unimodal (inverted-U); supports bathtub-like patterns in later tails.
Selection among these parametric functions often hinges on the observed behavior of the , as it reveals underlying dynamics. For instance, heavy-ed distributions like the log-normal are preferred when times show prolonged persistence in the upper quantiles, common in heterogeneous biological processes, whereas the Gompertz excels for with rapidly escalating s indicative of deterministic aging trajectories; the log-logistic is chosen for datasets exhibiting crossover or declining risks after an initial peak. Model fit can be assessed via quantiles or criteria to ensure alignment with empirical heaviness.

Non-Parametric Survival Functions

Kaplan-Meier Estimator

The Kaplan-Meier estimator, also known as the product-limit estimator, is a non-parametric method for estimating the survival function S(t)S(t) from lifetime data subject to right-censoring. It provides a step function that jumps at each observed event time, remaining constant between events, and is widely used in survival analysis to describe the probability of survival beyond time tt without assuming a specific parametric form for the underlying distribution. Introduced in 1958, this estimator is particularly valuable in medical research, reliability engineering, and other fields where follow-up data may be incomplete due to censoring. The estimator is defined as S^(t)=tit(1dini),\hat{S}(t) = \prod_{t_i \leq t} \left(1 - \frac{d_i}{n_i}\right), where the product is taken over the distinct event times tit_i, did_i is the number of events (such as deaths or failures) observed at time tit_i, and nin_i is the number of individuals at risk just prior to time tit_i. The at-risk set nin_i includes all subjects who have not yet experienced an event or been censored before tit_i. At t=0t = 0, S^(0)=1\hat{S}(0) = 1, and the estimate decreases stepwise at each event time by the factor (nidi)/ni(n_i - d_i)/n_i, which represents the conditional survival probability at that instant given survival up to that point. If there are no events by time tt, S^(t)=1\hat{S}(t) = 1; beyond the last event, the estimate is undefined or held constant, depending on the context. Right-censoring occurs when the event time for some subjects is unknown because observation ends before the event (e.g., due to study withdrawal or loss to follow-up), but the censoring time and the fact that the event has not occurred by then are known. The Kaplan-Meier method handles this by including censored subjects in the at-risk set nin_i up to their censoring time, thereby contributing to the denominator for all event times prior to their censoring, but they do not contribute to any did_i since no event is observed for them. Once censored, they are removed from subsequent at-risk sets. This approach assumes that censoring provides partial information about , allowing the to adjust the survival probabilities accordingly without biasing the estimate under the model's assumptions. The method relies on several key assumptions for its validity. Censoring must be independent of times, meaning the probability of censoring does not depend on the underlying survival time (non-informative censoring), which ensures that censored observations are representative of the full . Additionally, the model assumes continuous time, implying no tied event times; in practice, ties are handled by treating them as occurring in rapid succession or using adjustments, but the basic formulation presumes distinct times to avoid complications in the product. Violations of these assumptions, such as informative censoring, can lead to biased estimates. To quantify uncertainty in the Kaplan-Meier estimate, confidence intervals are typically constructed using the asymptotic variance provided by Greenwood's formula: Var(S^(t))S^(t)2titdini(nidi),\text{Var}(\hat{S}(t)) \approx \hat{S}(t)^2 \sum_{t_i \leq t} \frac{d_i}{n_i (n_i - d_i)}, where the sum is over event times up to tt. This variance estimator, derived from the applied to the log of the product-limit, accounts for the binomial variability at each event time and is asymptotically normal for large samples. Approximate (1α)×100%(1 - \alpha) \times 100\% confidence intervals can then be obtained as S^(t)±z1α/2Var(S^(t))\hat{S}(t) \pm z_{1 - \alpha/2} \sqrt{\text{Var}(\hat{S}(t))}
Add your contribution
Related Hubs
User Avatar
No comments yet.