|
Penalized Trinomial Logit Models Distinguish Clinical Stages in Patients with Hepatitis
HU Xuemei, YANG Junwen
Acta Mathematicae Applicatae Sinica
2024, 47 (1):
154-173.
DOI: 10.20142/j.cnki.amas.202401008
Viral hepatitis C (simply referred to as hepatitis C) is a form of viral hepatitis caused by infection with the hepatitis C Virus (HCV). HCV will cause chronic inflammation, necrosis, and fibrosis of the liver, some patients may develop cirrhosis and hepatocellular carcinoma (HCC). In this paper we take advantage of the hepatitis data set to construct penalized trinomial logit models to diagnose the disease stages of patients. Firstly, we select 12 physiological indicators of patients as a predictor vector, and choose 3 disease stages of hepatitis C as the response variable. Secondly, we apply the 70% data as the training set to learn LASSO/Ridge/ENet penalized trinomial logit model, and take advantage of the coordinate descent algorithm to complete variable selection and obtain parameter estimations. Thirdly, we apply the remaining 30% data as the testing set, and combine three-class confusion matrix, the ROC (receiver operating characteristic) surface, HUM (hypervolume under the ROC manifold), PDI(polytomous discrimination index) to assess the prediction accuracy to disease stages. Finally, we introduce some machine learning methods such as artificial neural network (ANN), support vector machine (SVM) and random forest (RF)to compare with the proposed penalized trinomial logit models, and found that penalized trinomial logit models possess the best three-class prediction performance. They can not only improve the diagnostic accuracy to disease stages, but also reduce the cost of hepatitis C detection.
Reference |
Related Articles |
Metrics
|
|