Semiparametric estimation of a mixture of two linear regressions in which one component is known

Laurent Bordes; Ivan Kojadinovic; Pierre Vandekerkhove

Pré-Publication, Document De Travail Année : 2013

Semiparametric estimation of a mixture of two linear regressions in which one component is known

(1) , (1) , (2)

1
2

Laurent Bordes

Fonction : Auteur
PersonId : 170008
IdHAL : laurent-bordes
IdRef : 080088546

Laboratoire de Mathématiques et de leurs Applications [Pau]

Ivan Kojadinovic

Fonction : Auteur
PersonId : 171673
IdHAL : ivan-kojadinovic
ORCID : 0000-0002-2903-1543
IdRef : 069495424

Laboratoire de Mathématiques et de leurs Applications [Pau]

Pierre Vandekerkhove

Fonction : Auteur
PersonId : 832223
IdHAL : pierre-vandekerkhove
ORCID : 0000-0003-3907-7657
IdRef : 272731641

Probabilité et Statistique

Résumé

A new estimation method for the two-component mixture model introduced in \cite{Van12} is proposed. This model, which consists of a two-component mixture of linear regressions in which one component is entirely known while the proportion, the slope, the intercept and the error distribution of the other component are unknown, seems to be of interest for the analysis of large datasets produced from two-color ChIP-chip high-density microarrays. In spite of good performance for datasets of reasonable size, the method proposed in \cite{Van12} suffers from a serious drawback when the sample size becomes large, as it is based on the optimization of a contrast function whose pointwise computation requires $O(n^2)$ operations. The range of applicability of the method derived in this work is substantially larger as it is based on a method-of-moment estimator whose computation only requires $O(n)$ operations. From a theoretical perspective, the asymptotic normality of both the estimator of the Euclidean parameter vector and of the semiparametric estimator of the c.d.f.\ of the error is proved under weak conditions not involving the zero-symmetry assumption typically used this last decade. The finite-sample performance of the latter estimators is studied under various scenarios through Monte Carlo experiments. From a more practical perspective, the proposed method is applied to the tone data analyzed, among others, by \cite{HunYou12}, and to the ChIPmix data studied by \cite{MarMarBer08}. An extension of the considered model involving an unknown scale parameter for the first component is discussed in the final section.

Domaines

Statistiques [math.ST] Théorie [stat.TH]

Fichier principal

mr.pdf (763.41 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Ivan Kojadinovic : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00796198

Soumis le : vendredi 1 mars 2013-19:35:19

Dernière modification le : mardi 30 avril 2024-16:27:11

Archivage à long terme le : dimanche 2 juin 2013-04:01:51

Dates et versions

hal-00796198 , version 1 (01-03-2013)

Identifiants

HAL Id : hal-00796198 , version 1

Citer

Laurent Bordes, Ivan Kojadinovic, Pierre Vandekerkhove. Semiparametric estimation of a mixture of two linear regressions in which one component is known. 2013. ⟨hal-00796198⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-PAU UNIV-MLV LMA-PAU INSMI LAMA_UMR8050 CV_LAMA_UMR8050 LAMA_PS UPEC UNIV-EIFFEL

108 Consultations

54 Téléchargements

Semiparametric estimation of a mixture of two linear regressions in which one component is known

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager