Coefficient of intrinsic dependence: a new measure of association

dc.contributorHsing, Tailen
dc.creatorLiu, Li-yu Daisy
dc.date.accessioned2005-08-29T14:40:39Z
dc.date.accessioned2017-04-07T19:50:14Z
dc.date.available2005-08-29T14:40:39Z
dc.date.available2017-04-07T19:50:14Z
dc.date.created2005-05
dc.date.issued2005-08-29
dc.description.abstractTo detect dependence among variables is an essential task in many scientific investigations. In this study we propose a new measure of association, the coefficient of intrinsic dependence (CID), which takes value in [0,1] and faithfully reflects the full range of dependence for two random variables. The CID is free of distributional and functional assumptions. It can be easily implemented and extended to multivariate situations. Traditionally, the correlation coefficient is the preferred measure of association. However, it's effectiveness is considerably compromised when the random variables are not normally distributed. Besides, the interpretation of the correlation coefficient is difficult when the data are categorical. By contrast, the CID is free of these problems. In our simulation studies, we find that the ability of the CID in differentiating different levels of dependence remains robust across different data types (categorical or continuous) and model features (linear or curvilinear). Also, the CID is particularly effective when the dependence is strong, making it a powerful tool for variable selection. As an illustration, the CID is applied to variable selection in two aspects: classification and prediction. The analysis of actual data from a study of breast cancer gene expression is included. For the classification problem, we identify a pair of genes that best classify a patient's prognosis signature, and for the prediction problem, we identify a pair of genes that best relates to the expression of a specific gene.
dc.identifier.urihttp://hdl.handle.net/1969.1/2397
dc.language.isoen_US
dc.publisherTexas A&M University
dc.subjectMeasure of Association
dc.subjectDependence
dc.subjectCorrelation
dc.subjectVariable Selection
dc.titleCoefficient of intrinsic dependence: a new measure of association
dc.typeBook
dc.typeThesis

Files