开发专为乳腺癌 HER2 检测而定制的深度学习模型,以帮助病理学家解释 HER2 低的病例。
Development of a deep-learning model tailored for HER2 detection in breast cancer to aid pathologists in interpreting HER2-low cases.
发表日期:2024 Jul 14
作者:
Pierre-Antoine Bannier, Glenn Broeckx, Loïc Herpin, Rémy Dubois, Lydwine Van Praet, Charles Maussion, Frederik Deman, Ellen Amonoo, Anca Mera, Jasmine Timbres, Cheryl Gillett, Elinor Sawyer, Patrycja Gazińska, Piotr Ziolkowski, Magali Lacroix-Triki, Roberto Salgado, Sheeba Irshad
来源:
HISTOPATHOLOGY
摘要:
超过 50% 的乳腺癌病例属于“人表皮生长因子受体 2 (HER2) 低乳腺癌 (BC)”,其特征是 HER2 免疫组织化学 (IHC) 评分为 1 或 2,且荧光原位杂交 (FISH) 测试无扩增。用于治疗 HER2 低乳腺癌的新型抗 HER2 抗体药物偶联物 (ADC) 的开发说明了准确评估 HER2 状态,特别是 HER2 低乳腺癌的重要性。在本研究中,我们评估了用于评估 HER2 的深度学习 (DL) 模型的性能,包括对病理学家与 DL 模型之间 HER2-Null 不一致的原因进行评估。我们特别注重将 DL 模型规则与 ASCO/CAP 指南保持一致,包括染色细胞的染色强度和膜染色的完整性。我们在具有 HER2-IHC 评分的多中心乳腺癌病例队列中训练了 DL 模型 (n = 299 )。该模型在两个独立的多中心验证队列(n = 369 和 n = 92)上进行了验证,所有病例均由三名资深乳腺病理学家进行审查。所有病例均经过三名资深乳腺病理学家的彻底审查,基本事实由病理学家对最终 HER2 评分的多数共识确定。在该研究的整个训练和验证阶段总共使用了 760 个乳腺癌病例。该模型与真实情况的一致性 (ICC = 0.77 [0.68-0.83]; Fisher P = 1.32e-10) 高于三位资深病理学家的平均一致性 (ICC = 0.45 [0.17-0.65]; Fisher P = 2e -3)。在两个验证队列中,DL 模型分别识别出 95% [93% - 98%] 和 97% [91% - 100%] 的 HER2 低肿瘤和 HER2 阳性肿瘤。不一致的结果以形态学特征为特征,例如扩展的纤维化、大量肿瘤浸润淋巴细胞和坏死,而一些人为因素,例如肿瘤细胞细胞质中的非特异性背景细胞质染色也会导致差异。深度学习可以支持病理学家的解释困难的 HER2 低病例。形态学变量和一些特定的人为因素可能会导致病理学家和 DL 模型之间的 HER2 分数出现差异。© 2024 John Wiley
Over 50% of breast cancer cases are "Human epidermal growth factor receptor 2 (HER2) low breast cancer (BC)", characterized by HER2 immunohistochemistry (IHC) scores of 1+ or 2+ alongside no amplification on fluorescence in situ hybridization (FISH) testing. The development of new anti-HER2 antibody-drug conjugates (ADCs) for treating HER2-low breast cancers illustrates the importance of accurately assessing HER2 status, particularly HER2-low breast cancer. In this study we evaluated the performance of a deep-learning (DL) model for the assessment of HER2, including an assessment of the causes of discordances of HER2-Null between a pathologist and the DL model. We specifically focussed on aligning the DL model rules with the ASCO/CAP guidelines, including stained cells' staining intensity and completeness of membrane staining.We trained a DL model on a multicentric cohort of breast cancer cases with HER2-IHC scores (n = 299). The model was validated on two independent multicentric validation cohorts (n = 369 and n = 92), with all cases reviewed by three senior breast pathologists. All cases underwent a thorough review by three senior breast pathologists, with the ground truth determined by a majority consensus on the final HER2 score among the pathologists. In total, 760 breast cancer cases were utilized throughout the training and validation phases of the study. The model's concordance with the ground truth (ICC = 0.77 [0.68-0.83]; Fisher P = 1.32e-10) is higher than the average agreement among the three senior pathologists (ICC = 0.45 [0.17-0.65]; Fisher P = 2e-3). In the two validation cohorts, the DL model identifies 95% [93% - 98%] and 97% [91% - 100%] of HER2-low and HER2-positive tumours, respectively. Discordant results were characterized by morphological features such as extended fibrosis, a high number of tumour-infiltrating lymphocytes, and necrosis, whilst some artefacts such as nonspecific background cytoplasmic stain in the cytoplasm of tumour cells also cause discrepancy.Deep learning can support pathologists' interpretation of difficult HER2-low cases. Morphological variables and some specific artefacts can cause discrepant HER2-scores between the pathologist and the DL model.© 2024 John Wiley & Sons Ltd.