ABSTRACT
Despite genomic sequencing rapidly transforming from being a bench-side tool to a routine procedure in a hospital, there is a noticeable lack of genomic analysis software that supports both clinical and research workflows as well as crowdsourcing. Furthermore, most existing software packages are not forward-compatible in regards to supporting ever-changing diagnostic rules adopted by the genetics community. Regular updates of genomics databases pose challenges for reproducible and traceable automated genetic diagnostics tools. Lastly, most of the software tools score low on explainability amongst clinicians. We have created a fully open-source variant curation tool, AnFiSA, with the intention to invite and accept contributions from clinicians, researchers, and professional software developers. The design of AnFiSA addresses the aforementioned issues via the following architectural principles: using a multidimensional database management system (DBMS) for genomic data to address reproducibility, curated decision trees adaptable to changing clinical rules, and a crowdsourcing-friendly interface to address difficult-to-diagnose cases. We discuss how we have chosen our technology stack and describe the design and implementation of the software. Finally, we show in detail how selected workflows can be implemented using the current version of AnFiSA by a medical geneticist.
Subject(s)
Genomics , Software , Computational Biology/methods , Database Management Systems , Databases, Genetic , Genomics/methods , Reproducibility of Results , WorkflowABSTRACT
BACKGROUND: Array comparative genomic hybridisation (CGH) is a powerful method for the genetic analysis of lesional and normal tissues to identify genomic imbalances associated with malignancies. However, the use of this technique with DNA extracted from archival formalin fixed, paraffin embedded (FFPE) tissue specimens, the most widely available resource for retrospective studies, is subject to quantitative and qualitative limitations. In this report, the suitability and integrity of the DNA extracted from FFPE MCF7 breast cancer cells fixed for different periods of time for array CGH applications were examined. RESULTS: Using our established cDNA microarray protocol in conjunction with whole genome amplification methods, the genetic profiles of freshly harvested MCF7 cells and their matched FFPE counterparts were analysed. Congruent profiles between FFPE MCF7 cells and their fresh counterpart and between amplified and non-amplified FFPE MCF7 cells were observed. Our results demonstrate that formalin fixation of <20 hours has no significant adverse effect on the integrity of DNA for array CGH studies. CONCLUSIONS: Our findings attest to the fidelity of our array CGH methods to effectively examine material recovered from FFPE tissue specimens for microarray applications. This in turn has great potential to identify novel diagnostic and prognostic markers for human disease.