Liver and Bile
JHEP Rep. 2025;7(2):101198
Deep learning helps discriminate between autoimmune hepatitis and primary biliary cholangitis
Background and aims: Biliary abnormalities in autoimmune hepatitis (AIH) and interface hepatitis in primary biliary cholangitis (PBC) occur frequently, and misinterpretation may lead to therapeutic mistakes with a negative impact on patients. This study investigates the use of a deep learning (DL)-based pipeline for the diagnosis of AIH and PBC to aid differential diagnosis.
Methods: A multicenter study across 6 European referral centers was conducted, and a library of digitized liver biopsy slides dating from 1997–2023 was built. A training set of 354 cases (266 AIH and 102 PBC) and an external validation set of 92 cases (62 AIH and 30 PBC) were available for analysis. A novel DL model, the autoimmune liver neural estimator (ALNE), was trained on whole-slide images (WSIs) with H&E staining, without human annotations. The ALNE model was evaluated against clinico-pathological diagnoses and tested for interobserver variability among general pathologists.
Results: The ALNE model demonstrated high accuracy in differentiating AIH from PBC, achieving an area under the receiver operating characteristic curve of 0.81 in external validation. Attention heatmaps showed that ALNE tends to focus more on areas with increased inflammation, associating such patterns predominantly with AIH. A multivariate explainable ML model revealed that PBC cases misclassified as AIH more often had alkaline phosphatase values between 1 × upper limit of normal (ULN) and 2 × ULN, coupled with AST values above 1 × ULN. Inconsistency among general pathologists was noticed when evaluating a random sample of the same cases (Fleiss’s kappa value 0.09).
Conclusions: The autoimmune liver neural estimator (ALNE) model is the first system generating a quantitative and accurate differential diagnosis between cases with autoimmune hepatitis or primary biliary cholangitis.