Banner Banner

Detection and Analysis of Cerebral Aneurysms based on X-ray Rotational Angiography – The CADA 2020 Challenge

Matthias Ivantsits
Leonid Goubergrits
Jan-Martin Kuhnigk
Markus Huellebrand
Jan Brning
Tabea Kossen
Boris Pfahringer
Jens Schaller
Andreas Spuler
Titus Kuehne
Yizhuan Jia
Xuesong Li
Suprosanna Shit
Bjoern Menze
Ziyu Su
Jun Ma
Ziwei Nie
Kartik Jain
Yanfei Liu
Yi Lin
Anja Hennemuth

December 16, 2021

The Cerebral Aneurysm Detection and Analysis (CADA) challenge was organized to support the development and benchmarking of algorithms for detecting, analyzing, and risk assessment of cerebral aneurysms in X-ray rotational angiography (3DRA) images. 109 anonymized 3DRA datasets were provided for training, and 22 additional datasets were used to test the algorithmic solutions. Cerebral aneurysm detection was assessed using the F2 score based on recall and precision, and the fit of the delivered bounding box was assessed using the distance to the aneurysm. The segmentation quality was measured using the Jaccard index and a combination of different surface distance measures. Systematic errors were analyzed using volume correlation and bias. Rupture risk assessment was evaluated using the F2 score. 158 participants from 22 countries registered for the CADA challenge. The U-Net-based detection solutions presented by the community show similar accuracy compared to experts (F2 score 0.92), with a small number of missed aneurysms with diameters smaller than 3.5 mm. In addition, the delineation of these structures, based on U-Net variations, is excellent, with a Jaccard score of 0.92. The rupture risk estimation methods achieved an F2 score of 0.71. The performance of the detection and segmentation solutions is equivalent to that of human experts. The best results are obtained in rupture risk estimation by combining different image-based, morphological, and computational fluid dynamic parameters using machine learning methods. Furthermore, we evaluated the best methods pipeline, from detecting and delineating the vessel dilations to estimating the risk of rupture. The chain of these methods achieves an F2-score of 0.70, which is comparable to applying the risk prediction to the ground-truth delineation (0.71).