精算模型1 一元生存分析2 参数生存模型

it2024-04-01 51

精算模型1 一元生存分析2 参数生存模型

均匀分布 (de Moivre 1724)指数分布Gompertz分布 (1825)Makeham分布 (1860)Weibull分布 (Frechet,1927; Weibull 1951)Gamma函数Weibull分布的基本生存函数

这一讲介绍几个常用的剩余寿命 $T$ 的分布。

均匀分布 (de Moivre 1724)

假设 $w$ 表示极限年龄，则 $\sim U(0,w)$ , $f_T(t) = \frac{1}{w}I_{0 \le t \le w}$

特点：第一个寿命的连续概率模型；剩余寿命均匀分布，随着年龄增长危险率上升，达到极限年龄时必死无疑；适用性：长时间区间不适用。

性质：

生存函数

S(t)=\frac{w-t}{w}

危险率函数

h(t)=\frac{1}{w-t}

平均剩余寿命（期望）

E[T]=\frac{w}{2}

方差

Var[T]=\frac{w^2}{12}

指数分布

假设 $\sim f(t)$ , $f_T(t)=\frac{1}{\theta}e^{-\frac{1}{\theta}t},t>0$

特点：常值死亡力（危险率函数为常数）；一段时间内的死亡概率与当前年龄无关；是Gamma分布与Weibull分布的特例；适用性：一般用在一年、一年以内的年龄区间。

性质

生存函数

S(t)=e^{-\frac{1}{\theta}t}

危险率函数

h(t)=\frac{1}{\theta}

期望

E[T]=\theta

方差

Var[T]=\theta^2

无记忆性

P(T\ge y+t|T\ge y)=P(T\ge t)

Gompertz分布 (1825)

Gompertz分布通过直接定义危险率函数得到： $Bc^t,t \ge 0, c >1 ,B>0$

它的适用性不强，因为相关的生存分析基本函数的形式非常复杂，生存函数稍微简单一点 $S(t)=\exp \left( \frac{B}{\ln c}(1-c^t) \right)$

Makeham分布 (1860)

Makeham分布是对Gompertz分布的修正，Gompertz分布用幂函数对与年龄相关的危险率进行建模，但没有考虑到所有年龄段共有的一些死亡风险，于是Makeham分布的危险率函数修正为 $A+Bc^t,t \ge 0, c >1 ,B>0, A>-B$

这个形式比Gompertz分布的形式还要复杂一点，因此相关的生存分析基本函数的形式也非常复杂，生存函数为 $S(t)=\exp \left( \frac{B}{\ln c}(1-c^t) -At\right)$

Weibull分布 (Frechet,1927; Weibull 1951)

Weibull分布参数为 $\theta,\gamma$ ，概率密度函数为 $f(x)=\frac{\gamma}{\theta}x^{\gamma-1}e^{-\frac{1}{\theta}x^{\gamma}},x>0,\gamma,\theta>0$

Gamma函数

Gamma函数是阶乘在实数域的延拓，它由如下的积分形式定义： $\Gamma(x)=\int_0^{\infty}t^{x-1}e^{-t}dt,x>0$

Gamma函数及其相关计算技巧在概率统计中非常重要，

性质

\Gamma(x)=x\Gamma(x-1)

\Gamma(1/2)=\sqrt{\pi}

\Gamma(x) \sim \sqrt{2\pi}e^{-x}x^{x-\frac{1}{2}}

(Stirling’s Formula)

证明这里用概率论的思路给出性质3的简单证明，也可以查阅任何一本数学分析的教材，学习用分析的思路证明性质3的方法。

We try to proof it using probability theory. Suppose $X_1$ , $X_2$ , … , $X_n$ independently follow Poisson distribution with mean of each is 1. Define $S_n = \sum_{i=1}^n X_i$ , then $E(S_n)=Var(S_n)=n$ . So $P(S_n = n) = \frac{e^{-n}n^n}{n!}$ According CLT, $\frac{S_n-n}{\sqrt{n}} \to_d N(0,1)$ which means $\forall n \in \mathbb{N}$ , $\forall \epsilon >0$ , $\exists \delta>0$ such that $\forall x \in B(n,\delta)$ , $|P(S_n=n)-[F(0)-F(-\frac{1}{\sqrt{x}})]|<\frac{\epsilon}{2}$ Here $F (x)$ is the CDF of standard normal distribution.Since $P(S_n=n) = P(n-1<S_n\le n) = P(-1/\sqrt{n}<S_n \le 0)$ , this is approximately $F(0)-F(-1/\sqrt{n})$ according to convergence in distribution. Notice $F (x)$ is continuous, so $\forall x \in B(n,\delta)$ , $|[F(0)-F(-\frac{1}{\sqrt{x}})]-[F(0)-F(-\frac{1}{\sqrt{n}})]|<\frac{\epsilon}{2}$ So $|P(S_n=n)-[F(0)-F(-\frac{1}{\sqrt{n}})] | \\ \le |P(S_n=n)-[F(0)-F(-\frac{1}{\sqrt{x}})]| -|F(x)-[F(0)-F(-\frac{1}{\sqrt{n}})]| <\epsilon$ Notice $F (x)$ is also bounded, so $\exists M>0$ such that $\forall x,\ F(x) \le M$ $|\frac{P(S_n=n)}{F(0)-F(-\frac{1}{\sqrt{n}})}-1| <\frac{\epsilon}{M}$ $F(0)-F(-\frac{1}{\sqrt{n}}) = \int_{-\frac{1}{\sqrt{n}}}^0 \frac{1}{\sqrt{2 \pi}} e^{-\frac{x^2}{2}} dx \\ = \int_{-\frac{1}{\sqrt{n}}}^0 \frac{1}{\sqrt{2 \pi}}( 1-\frac{x^2}{2}+o(x^3)) dx \\ = \frac{1}{\sqrt{2 \pi n}} - \frac{1}{\sqrt{12 \pi n^3}} + o(\frac{1}{n^2})$ According to the two equations and inequality, when n is large enough, we can ignore $\frac{1}{\sqrt{12 \pi n^3}} + o(\frac{1}{n^2})$ , so $\frac{e^{-n}n^n}{n!} \to \frac{1}{\sqrt{2\pi}}$

证毕

Weibull分布的基本生存函数

性质

生存函数

S(x)=e^{-\frac{1}{\theta}x^{\gamma}},x \ge 0

危险率函数

h(x)=\frac{\gamma}{\theta}x^{\gamma-1},x \ge 0

显然

\gamma

控制

h

的单调性期望

EX=\theta^{1/\gamma}\Gamma(1+1/\gamma)

r

阶矩

EX^r=\theta^{r/\gamma}\Gamma(1+r/\gamma)

最新回复(0)