359.Somer's D Concordance Statistic
(2016-03-20 12:58:18)
标签:
somer'sdordinalcoarse-classifying |
分类: 统计分享 |
Somer's D is to test the association between two ordinal variables. In SAS, proc freq gives this statistic. You can calculate it using either formula or simulation. Here assumed a 2 X 3 contingency table.
data have;
input x :$12. good bad;
pg =good/(good+bad);
datalines;
owner 6000 300
renter 1950 540
others 1050 160
;
proc rank data=have out=have;
data have2;
proc freq data=have2;
data formula;
data simulation;
In the simulation, the concordance statistic describes the chance that if one picks a good at random from the goods and a bad at random form the bads, the bad's attribute, xb, will be in a lower class than the good's attribute, xg. The higher this probability, the better the ordering of the characteristic's classes reflects the good-bad split in the population.
From the simulation, D = 0.39502673, comparing to an exact value of 0.359.
In coarse classifying the characteristic for scorecard development, a higher value of Somer's D indicates a more definitive split.