Primary Biliary Cirrhosis (PBC) Data

Well-known data set from Appendix D of Fleming and Harrington (1991) taken from Terry Therneau's website. We have copied the data set and here is SAS code to read in the data and run a standard Cox regression. Note that in the SAS code we eliminate cases past 312 and recode status=1 (liver transplant) to be status=0 (alive, i.e., censored). There are only 276 complete cases when running a Cox regression with all covariates. From Terry Therneau's website, the variables in the data set are

id       = case number
futime   = number of days between registration and the earlier of death,
           transplantion, or study analysis time in July, 1986
status   = 0=alive, 1=liver transplant, 2=dead
drug     = 1= D-penicillamine, 2=placebo
age      = age in days
sex      = 0=male, 1=female
ascites  = presence of ascites: 0=no 1=yes
hepato   = presence of hepatomegaly 0=no 1=yes
spiders  = presence of spiders 0=no 1=yes
edema    = presence of edema 0=no edema and no diuretic therapy for edema;
          .5 = edema present without diuretics, or edema resolved by diuretics;
           1 = edema despite diuretic therapy
bili     = serum bilirubin in mg/dl
chol     = serum cholesterol in mg/dl
albumin  = albumin in gm/dl
copper   = urine copper in ug/day
alk_phos = alkaline phosphatase in U/liter
sgot     = SGOT in U/ml
trig     = triglicerides in mg/dl
platelet = platelets per cubic ml/1000
protime  = prothrombin time in seconds
stage    = histologic stage of disease