Guo Renzhong, Zhang Kequan. Analysis and Comparison of the Techniques for Treating Correlativity Between Variables in Q-mode Cluster Analysis[J]. Geomatics and Information Science of Wuhan University, 1987, 12(3): 64-78.
Citation: Guo Renzhong, Zhang Kequan. Analysis and Comparison of the Techniques for Treating Correlativity Between Variables in Q-mode Cluster Analysis[J]. Geomatics and Information Science of Wuhan University, 1987, 12(3): 64-78.

Analysis and Comparison of the Techniques for Treating Correlativity Between Variables in Q-mode Cluster Analysis

More Information
  • Received Date: October 31, 1986
  • Published Date: March 04, 1987
  • The Euclidean distance,which is most often used in Q-mode cluster analysis,is unable to reflect the influence of correlativity between variables on the results of cluster analysis,so it cannot fully reveal the clustering situation of samples.For this reason three techniques are proposed in a lot of literature to solve this problem;they are.1.the oblique distance method,2.the principle component analysis method,and 3.the Mahalanobis distance method.But there has been no paper researching on the characteristics of and the relationship between these methods.In practice,these methods are chosen quite at random and without rules.This paper,in both theory and practice,analyses and compares the three methods,and gives the following conclusions:1.the oblique distance method functions the same as the principle component analysis method,because (Xij-Xkj)(Xil-Xkl)rjl=d'UΛ1/2Λ1/2U'd,in the above equation,the left part is the expression for computing oblique distance,and the right part is the Euclidean distance from the principle component analysis method.2.the Mahalanobis distance is equal to the Euclidean distance computed from factor scores;this is because,after data standardization,(P1-Pk)'S-1(Pi-Pk)=(Pi-Pk)'R-1(Pi-Pk)=(Pi-Pk)'UΛ-1/2Λ-1/2U'(Pi-Pk), in this equation,the first part is the expression for computing mahalanobis distance,and the last part is the exprssion for computing Euclidean distance from factor scofes.3.the Mahalanobis distance and the oblique distance deal with correlativity in two opposite ways.Generally speaking,the Mahalanobis distance gives correlated variables smaller weight values,and the oblique distance gives them larger weight values,while the Euclidean distance gives equal weights to all variables.If any two samples are located along the direction of major axis of the distribution ellipse,the Mahalanobis distance between them is smaller than the Euclidean distance,while the oblique distance is larger than the Euclidean distance,if located along the direction of minor axis,it will be an opposite conclusion.The three conclusions stated above are instructive in choosing statistics for clustering in practical work,and helpful in avoiding blindness,and studying the computational results of cluster analysis.
  • Related Articles

    [1]WANG Dingpan, DONG Xiaohuan, HUANG Lingyong, WANG Xiaohua, LI Qingjun, JI Shunping. Information Entropy Uncertainty Estimation Based Domain Adaptation for Land Cover Classification from Multi-source Remote Sensing Images[J]. Geomatics and Information Science of Wuhan University, 2024, 49(10): 1940-1952. DOI: 10.13203/j.whugis20220346
    [2]LIN Dongfang, YAO Yibin, LONG Sichun, XIE Jian. The Regularization Method for Forest Height Inversion by Integrating MultiBaseline PolInSAR Observation Information[J]. Geomatics and Information Science of Wuhan University. DOI: 10.13203/j.whugis20240275
    [3]MA Jun, CAO Cheng-du, JIANG Wei-ping, ZHOU Lü. Elimination of Colored Noise in GNSS Station Coordinate Time Series by Using Wavelet Packet Coefficient Information Entropy[J]. Geomatics and Information Science of Wuhan University, 2021, 46(9): 1309-1317. DOI: 10.13203/j.whugis20190353
    [4]LIU Wanzeng, LU Chenni, HUO Liang, WU Chenchen, ZHAO Tingting, ZHU Xiuli. Selection Method of Residential Point Features Constrained by Optimal Information Entropy[J]. Geomatics and Information Science of Wuhan University, 2021, 46(8): 1178-1185. DOI: 10.13203/j.whugis20190305
    [5]SUN Wenzhou, YIN Xiaodong, LI Shujun. A New Navigation Data Fusion Method Based on Entropy Coefficient Algorithm for Underwater Vehicles[J]. Geomatics and Information Science of Wuhan University, 2018, 43(10): 1465-1471. DOI: 10.13203/j.whugis20160550
    [6]CUI Shengcheng, ZHU Wenyue, YANG Shizhi, LI Xuebin. Regularization-Based Retrieval Method for Surface Reflective Property Parameters[J]. Geomatics and Information Science of Wuhan University, 2018, 43(8): 1264-1270. DOI: 10.13203/j.whugis20160244
    [7]SONG Huanhuan, WANG Shuzong. Application of Information Entropy Theory to Analyzing Tasks Scheduling for Submarine Combat System[J]. Geomatics and Information Science of Wuhan University, 2012, 37(12): 1477-1481.
    [8]JIANG Wanshou, PENG Fangyuan, YUE Chunyu, WANG Xiao. Multi-source Remote-sensing Image Matching Based on Ratio-Gradient and Cross-Cumulative Residual Entropy[J]. Geomatics and Information Science of Wuhan University, 2009, 34(9): 1047-1050.
    [9]ZHAO Hongrui, TANG Zhongshi, LI Xiaowen. A Regularization Parameter Choice Method on Nonlinear Ill-posed Quantitative Remote Sensing Inversion[J]. Geomatics and Information Science of Wuhan University, 2008, 33(6): 577-580.
    [10]ZHU Changqing, ZHANG Guoqin WANG Guangxia, . An Error Entropy Model for GIS Spatial Linear[J]. Geomatics and Information Science of Wuhan University, 2005, 30(5): 405-407.

Catalog

    Article views (910) PDF downloads (243) Cited by()
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return