The field of computer science called bioinformatics is used to analyze whole-genome sequencing data. This involves algorithm, pipeline and software development, and analysis, transfer and storage/database development of genomics data.
A typical whole-genome sequencing workflow contains the following steps:
-
quality control and data grooming;
-
genome assembly and/or variant calling; and
-
post-assembly analysis.
The volume of data that is produced from next-generation sequencing platforms is massive. Data collected pertains not only to the DNA sequencing results but also on the sequencing performance to assist with detection of errors or repetitive sequencing. This presents data management and storage issues. Additionally, special software and fast computing systems are required to process the immense data. Specialized, trained bioinformaticists are essential to the analysis of data generated by next-generation sequencing, as well as the continued success and growth of precision medicine.
Thank you for participating in this module. Click below to download the certificate.