Testing epidemic changes of infinite dimensional parameters

icon

22

pages

icon

English

icon

Documents

Écrit par

Publié par

Le téléchargement nécessite un accès à la bibliothèque YouScribe Tout savoir sur nos offres

icon

22

pages

icon

English

icon

Documents

Le téléchargement nécessite un accès à la bibliothèque YouScribe Tout savoir sur nos offres

Niveau: Supérieur, Master, Bac+5
Testing epidemic changes of infinite dimensional parameters Alfredas Racˇkauskas Vilnius University and Institute of Mathematics and Informatics Department of Mathematics, Vilnius University Naugarduko 24, Lt-2006 Vilnius Lithuania Charles Suquet Universite des Sciences et Technologies de Lille Laboratoire P. Painleve, UMR CNRS 8524 Bat. M2, U.F.R. de Mathematiques F-59655 Villeneuve d'Ascq Cedex France Abstract To detect epidemic change in the mean of a sample of size n of random elements in a Banach space, we introduce new test statistics DI based on weighted increments of partial sums. We obtain their limit distributions under the null hypothesis of no change in the mean. Under alternative hypothesis our statistics can detect very short epidemics of length log? n, ? > 1. We present applications to detect epidemic changes in distribution function or characteristic function of real valued observations as well as changes in covariance matrixes of random vectors. Some keywords: change point, epidemic alternative, functional central limit theorem, Holder norm, partial sums processes 1 Introduction A central question in the area of change point detection is testing for changes in the mean of a sample. Indeed many change point problems may be reduced to this basic setting, see e.g. Brodsky and Darkhovsky [1]. Here we present a new illustration of this general approach. Starting from the detection of epidemic changes in the mean of Banach space valued random elements, we construct new tests to detect changes in the distribution function or the characteristic function of real valued observations as well as changes in covariance matrixes of

  • holderian weight

  • detect epidemic

  • gaussian random

  • alternative

  • test statistics

  • bit stronger then

  • dyadic increment

  • sums process

  • sequence n?1


Voir icon arrow

Publié par

Nombre de lectures

24

Langue

English

Testing
epidemic
changes of infinite dimensional parameters
AlfredasskasckauˇaR Vilnius University and Institute of Mathematics and Informatics Department of Mathematics, Vilnius University Naugarduko 24, Lt-2006 Vilnius Lithuania
CharlesSuquet Universite´desSciencesetTechnologiesdeLille LaboratoireP.Painleve´,UMRCNRS8524 Bˆat.M2,U.F.R.deMath´ematiques F-59655 Villeneuve d’Ascq Cedex France
Abstract To detect epidemic change in the mean of a sample of sizenof random elements in a Banach space, we introduce new test statistics DI based on weighted increments of partial sums. We obtain their limit distributions under the null hypothesis of no change in the mean. Under alternative hypothesis our statistics can detect very short epidemics of length logγn, γ >1. We present applications to detect epidemic changes in distribution function or characteristic function of real valued observations as well as changes in covariance matrixes of random vectors. Some keywords:change point, epidemic alternative, functional central limit theorem,H¨oldernorm,partialsumsprocesses
1 Introduction
A central question in the area of change point detection is testing for changes in the mean of a sample. Indeed many change point problems may be reduced to this basic setting, see e.g. Brodsky and Darkhovsky [1]. Here we present a new illustration of this general approach. Starting from the detection of epidemic changes in the mean of Banach space valued random elements, we construct new tests to detect changes in the distribution function or the characteristic function of real valued observations as well as changes in covariance matrixes of random vectors. LetBbe a separable Banach space with a normkxkand dual spaceB0with duality denoted byf(x), fB0, xB that. SupposeX1, . . . , Xnare random
1
elements inBwith meansm1, . . . ,mn want to test the standardrespectively. We null hypothesis of a constant mean
(H0):m1=∙ ∙ ∙=mn against the so called epidemic alternative (HA):there are integers1< k< m< nsuch that m1=m2=∙ ∙ ∙=mk=mm+1=∙ ∙ ∙=mn, mk+1=∙ ∙ ∙=mmandmk6=mk+1. The study of epidemic change models (withB=R) goes back to Levin and Kline [8] who proposed test statistics based on partial sums. Using a maximum likelihood approach, Yao [14] suggested some weighted versions of these statistics whichmaybeviewedassomediscreteH¨oldernormofthepartialsumsprocess, seealsoCs¨org˝oandHorv´ath[3]andthereferencestherein. In[12],westudyalargeclassofstatisticsobtainedbydiscretizingHo¨lder norms of the partial sums process (withB=R). One important feature ofHo¨lderianweightingisthedetectionofshortepidemics.Roughlyspeak-ing,theuseofHo¨lderiantestsallowsthedetectionofepidemicswhoselength l:=mkis at least of the order of lnγnwithγ >1, while the same test statisticswithoutH¨olderianweightdetectsonlyepidemicssuchthatn1/2lgoes to infinity. Among the test statistics suggested in [12], the statistics DI(n, ρ) built on the dyadic increments of partial sums are of special interest because their limiting distribution is explicitly computable. The aim of this new con-tribution is to investigate asymptotical behavior of DI(n, ρ) in the setting ofB valued random elementsXi’s. The paper is organized as follows. Section 2 introduce the dyadic increment test statistics and present general results on their asymptotical behavior under (H0) or under (HA). Several applications are proposed in Section 3, involv-ing epidemic changes in distribution function (through Kolmogorov-Smirnov or Cram´er-vonMisesversionsofDI(n, ρ)), in characteristic function, in the co-variance of a random vector. Section 4 gathers the relevant background on Ho¨lderianfunctionalcentrallimittheoremandtheproofs.
2 The class of DI test statistics
Let us denote by Djthe set of dyadic numbers in [0,1] of levelj, i.e. D0={0,1},Dj=(2l1)2j; 1l2j1, j1. Write forrDj,j0, r:=r2j, r+:=r+ 2j .
For a functionx: [0,1]B, we shall denote
λr(x) :=x(r)21(x(r+) +x(r)),
2
rDj,
j1
(1)
andλr(x) =x(r) in the special caser= 0,1. Consider partial sums S(0) = 0, S(u) =XXk,0< u <. ku
We define also Sn(a, b) :=S(nb)S(na) =XXk,0a < b1 na<knb
and
Sn(t) =Sn(0, t) =XXk,0t1. knt
Dyadic increments statistics DI(n, ρ) depend on a weight functionρ: [0,1]R and are defined by
DI(n, ρ m 1) := 1jaloxgnρ(2j)rmaDxjkλr(Sn)k =211mjlaoxgnρ(12j)rmaDxjXXkX+Xk.(2) nr<knr nr<knr In this paper, we write “log” for the logarithm with basis 2 (log(2j) =j) and “ln” for the natural logarithm (ln(et) =t we assume that). Throughoutρ belongs to the following classR.
Definition 1.LetRbe the class of non decreasing functionsρ: [0,1]R, positive on (0,1], such thatρ(0) = 0 and satisfying
i) for some 0< α1/2, and some functionLwhich is normalized slowly varying at infinity,
ρ(h) =hαL(1/h),0< h1; ii)θ(t) :=t1/2ρ(1/t) isC1on [1,);
(3)
iii) there is aβ >1/2 and somea >0, such thatθ(t) lnβ(t) is non decreasing on [a,).
For a random elementXin a separable Banach spaceBsuch that for every fB0,Ef(X) = 0 andEf2(X)<, its covariance operatorQ=Q(X) is the linear bounded operator fromB0toBdefined byQf=Ef(X)X,fB0. A random elementXB(or covariance operatorQ) is said to bepregaussian if there exists a mean zero Gaussian random elementYBwith the same covariance operator asX all for, i.e.f, gB0,Ef(X)g(X) =Ef(Y)g(Y). Since the distribution of a centered Gaussian random element is defined by its covariance structure, we denote byYQa zero mean Gaussian random element with covariance operatorQ.
3
For any pregaussian covarianceQthere exists aB-valued Brownian motion WQwith parameterQ centered Gaussian process indexed by [0, i.e. a,1] with independent increments such thatWQ(t)WQ(s) has the same distribution as |ts|1/2YQ. A random elementXis said to satisfy the central limit theorem inB(denoted XCLT(B)) if the sequencen1/2(X1+∙ ∙ ∙+Xn) converges in distribution in B, whereX1, . . . , Xnare independent copies ofX. Necessarily then (see [7]),X is mean zero, pregaussian and satisfies the moment condition tlimtPkX1k> t1/2= 0.(4)
However the central limit theorem forXcannot be characterized in general in terms of the only integrability ofXbecause the geometry ofBis involved in the problem. To complete these preliminaries, we recall here the following classical esti-mate (see [7], (3.5) p.59) for the tail of a mean zero Gaussian random element YinB. P(kYk ≥u)4 exp8EkuY2k2, u >0.(5) In order to investigate the asymptotic behavior of the statistics DI(n, ρ) we consider a null hypothesis a little bit stronger thenH0. Namely, (H00):X1, . . . , Xn meanare i.i.d. zero pregaussian random elements inBwith covarianceQ.
Let (YQ, YQ,r, rD) be a collection of non degenerate independent mean zero Gaussian random elements with covarianceQ with. Setθas in Defini-tion 1 ii) DI(ρ, Q) := sjup2θ2(1j)tmaDjxkYQ,rk.(6) 1
Due to the definition of the classRit can be shown (see Theorem 13 below), that the random variable DI(ρ, Q) is well defined for anyρ∈ R. Theorem 2.If under(H00),X1CLT(B)and for everyA >0 limtPkX1k> Aθ(t)= 0,(7) t→∞
then
n1/2DI(n, ρ)D−→DI(ρ, Q). n→∞
(8)
Asθ(t) =t1/2ρ(1/t), Condition (7) is clearly stronger than (4) which is implicitly included in the assumption “X1CLT(B)”. Ifρ(h) =ρα(h) = hα, h[0,1], where 0< α <1/2, then Condition (7) reads PkX1k ≥t=otp(α)withp(α) =21α1.
4
In the case whereρ(h) =ρα,β(h) =h1/2lnβ(c/h), h[0,1] withβ >1/2, Condition (7) is equivalent to
Eexp{dkX1k1}<,for alld >0. For a discussion of the condition “X1CLT(B Let)”, we refer to [7]. us mention simply here that if the spaceBis either of type 2 or has Rosenthal’s property (e.g.Bis any separable Hilbert space,B=Lp(S, µ) with 2p <etc.) then Condition (7) yieldsX1CLT(B). IfBis of cotype 2 (e.g.B=Lp(S, µ),1p2), then “X1CLT(B)” follows fromX1being pregaussian. Due to the independence ofYQ,rthe limiting distribution functionFQ,ρ(u) of the dyadic increments statistic is completely specified by the distribution function ΦQ(u) :=PkYQ,1k ≤u2, u0. Namely, we have
Proposition 3.Ifρ∈ R, then the distribution ofDI(ρ, Q)is absolutely con-tinuous and its distribution functionFQ,ρis given by FQ,ρ(u) :=PDI(ρ, Q)u=YΦQθ(2j)u2j1, u0.(9) j=1
The convergence in (9) is uniform on any interval[ε,),ε >0.
For practical applications we sumarized in the next proposition some esti-mates of the tail of distribution functionFQ,ρ. Denote forJ0 J F(JQ,)ρ(u) =YΦQθ(2j)u2j1, u0 j=1
and X2jjθ22 cJ:=γin>f0nγ+ 8j=J+1θ2(2 ) expγ(4j)o. Proposition 4.For eachρ∈ R,FQ,ρsatisfies the following estimates.
i) For eachu >0 1FQ,ρ(u)4 exp8c0E|u|Y2Q||2.
ii) For eachJ1andu >0 14 exp8cJE||Yu2Q||2FQ(ρ,J)(u)FQ,ρ(u)FQ(,ρJ)(u).
5
(10)
(11)
(12)
We end this section by examining a consistency of rejecting (H00) versus the epidemic alternative (H0A) for large values of DI(n, ρ), where :={k+ 1, . . . , m} (H0A)Xk=(Xmc0k+Xk0fiifkkIIncn:={1, . . . , n} \In wheremc6= 0 may depend onnand theXk0’s satisfy (H00). =Theorem 5.Letρ∈ R. Under(H0A), writel:mkfor the length of epidemics and assume that ck nlimn1/2ρhn(khmn),wherehn:= minnln; 1nlo.(13) =
Then
n1/2DI(n, ρ)pr. n→∞
To discuss Condition (13), consider for simplicity the case wheremcdoes not depend onn. Whenρ(h) =hα, (13) allows us to detectshort epidemicssuch thatl=o(n) andlnδ→ ∞, whereδ= (12α)(22α)1. Symmetrically one can detectlong epidemicssuch thatnl=o(n) and (nl)nδ→ ∞. Whenρ(h) =h1/2lnβ(c/h) withβ >1/2, (13) is satisfied provided that hn=n1lnγn, withγ >2β. This leads to detection of short epidemics such thatl=o(n) andllnγn→ ∞as well as of long ones verifyingnl=o(n) and (nl) lnγn→ ∞.
3 Examples
3.1 Testing change of distribution function
As an example of applications of Theorem 2, here we consider change-point problem for distribution function of a random sample inRunder epidemic alternative. LetZ1, . . . , Znbe real valued random variables with distribution functionsF1, . . . , Fnrespectively. Consider the null hypothesis (H0) :F1=∙ ∙ ∙=Fn=F and the following epidemic alternative: (HA):there are integers1< k< m< nsuch that F1=F2=∙ ∙ ∙=Fk=Fm+1=∙ ∙ ∙=Fn, Fk+1=∙ ∙ ∙=FmandFk6=Fk+1. Tests constructed in Lombard [9] and Gombay [5] are based on rank statistics. Theorem 2 suggests tests based on the dyadic increments of empirical process. For simplicity we consider only the case of continuous functionF we. Then can restrict to the uniform empirical process built on the sample (U1, . . . , Un), whereUk=F(Zk).
6
Define κn(s, t) :=X1{Uit}t t, s,[0,1]. ins ForrDj,j1, define random functionsλr(κn) by λr(κn)(t) :=κn(r, t)21κn(r, t) +κn(r+, t), t[0,1] which for practical computations may be conveniently recast as 2λr(κn)(t) =X1{Uit}tX1{Uit}t, t[0,1]. nr<inr nr<inr+
3.1.1 Cramer-von Mises type DI statistics ´
TheCram´er-vonMisestypedyadicincrementsstatisticsaredenedby 2 CMDI(n, ρ) =1jmaloxgnρ2(21j)rmaDxjλr(κn)2,
(14)
where λr(κn)22Z10 :=λr(κn)(t)2dt. Its limiting distribution is completely defined by the limiting distribution of the classicalCram´er-vonMisesstatistic,namely,bythedistributionfunction L2(u) =PZ10B2(t) dtu, u0,
whereB(t), t[0, The distribution function1] is a standard Brownian bridge. L2(uis well known and several of its representations are available (see [13]).) Theorem 6.Ifρ∈ Rthen nlimPn1CMDI(n, ρ)u=L2(u), for eachu >0, where
1 L2(u) =YhL22θ2(2j)ui2j. j=1
Proof.The result is a straightforward application of Theorem 2 to the random elementsX1, . . . , Xnwith values in the spaceL2(0,1) defined by Xk(t) =1{Ukt}t, t[0,1], k= 1, . . . , n.(15) It is well known that these random elements satisfy the central limit theorem with Brownian bridge as the limiting Gaussian element. The moment condition (7) is fulfiled since theXks are bounded.
7
For practical uses, the following estimate of the tails ofL2can be helpfull. Define foru >0 J L(2J,ρ)(u) =YhL22θ2(2j)ui2j1. j=1
Proposition 7.
i) Ifu >0then 1L2(u)4 exp4c30u. where the constantc0is defined by (10). ii) For eachu >0andJ1 h14 exp4c3JuiL(2J,ρ)(u)L2(u)L(2ρ,J)(u). Proof.The result is a straightforward adaptation of Proposition 4, withYQ=B, a standard Brownian bridge, using the elementary fact thatEkBk22= 1/6. 3.1.2 Kolmogorov-Smirnov type DI statistics
The Kolmogorov-Smirnov type dyadic increments statistics are defined by KSDI(n, ρ) =1jmaloxgnρ(21j)rmaDxjλr(κn), where := supλr(κn)(t). λr(κn)0t1 LetL(u) be the limiting distribution of the classical Kolmogorov-Smirnov statistic, L(u) =P0mtax1|B(t)| ≤u, u0. This distribution function is well known and has several representations (see [13]).
Theorem 8.Ifρ∈ Rthen nlimPn1/2KSDI(n, ρ)u=L(u), for eachu >0, where
i2j1. L(u) =YhL2θ(2j)uj=1
The proof of this theorem is not a straightforward corollary of Theorem 2 and is postponed to Subsection 4.5 below.
8
Voir icon more
Alternate Text