ABSTRACT
The success of deep learning in identifying complex patterns exceeding human intuition comes at the cost of interpretability. Non-linear entanglement of image features makes deep learning a "black box" lacking human meaningful explanations for the models' decision. We present DISCOVER, a generative model designed to discover the underlying visual properties driving image-based classification models. DISCOVER learns disentangled latent representations, where each latent feature encodes a unique classification-driving visual property. This design enables "human-in-the-loop" interpretation by generating disentangled exaggerated counterfactual explanations. We apply DISCOVER to interpret classification of in vitro fertilization embryo morphology quality. We quantitatively and systematically confirm the interpretation of known embryo properties, discover properties without previous explicit measurements, and quantitatively determine and empirically verify the classification decision of specific embryo instances. We show that DISCOVER provides human-interpretable understanding of "black box" classification models, proposes hypotheses to decipher underlying biomedical mechanisms, and provides transparency for the classification of individual predictions.
Subject(s)
Deep Learning , Fertilization in Vitro , Humans , Fertilization in Vitro/methods , Image Processing, Computer-Assisted/methods , Embryo, Mammalian , FemaleABSTRACT
High-content time-lapse embryo imaging assessed by machine learning is revolutionizing the field of in vitro fertilization (IVF). However, the vast majority of IVF embryos are not transferred to the uterus, and these masses of embryos with unknown implantation outcomes are ignored in current efforts that aim to predict implantation. Here, whether, and to what extent the information encoded within "sibling" embryos from the same IVF cohort contributes to the performance of machine learning-based implantation prediction is explored. First, it is shown that the implantation outcome is correlated with attributes derived from the cohort siblings. Second, it is demonstrated that this unlabeled data boosts implantation prediction performance. Third, the cohort properties driving embryo prediction, especially those that rescued erroneous predictions, are characterized. The results suggest that predictive models for embryo implantation can benefit from the overlooked, widely available unlabeled data of sibling embryos by reducing the inherent noise of the individual transferred embryo.