Background & Objective: Coronavirus disease 2019 (COVID‐19) is caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), resulting in a worldwide pandemic. The first case of COVID‐19 was reported from Wuhan in the Hubei Province of China in December 2019; however, the disease's origin is still mysterious. Whole-genome sequence analysis is essential for monitoring the spread of infectious diseases as well as studying the pathogenesis and evolution of viruses. In this study, analysis of 198 fully sequenced genomes from Iran and West Asia was carried out to study mutations, phylogeny, amino acid changes, clades, and lineages of these genomes as well as comparison of these sequences with those of reference Wuhan genome of NC_045512.2.
Methods: In total, 198 completely sequenced genome data from Iran and West Asia were collected from GenBank. Mutation detection was carried out using a trial version of CLC Genomics Workbench v.21.0 (QIAGEN, Germany). Online tools such as GISAID Mutations App and Pangolin were used for further analysis of the results.
Results: In this study, several unique mutation sites were identified in the Iranian genomes (n = 8); positions 1397 G>A and 29742 G>T were the most frequent changes in more than 85% of the Iranian genomes. Mutation rate, mutation per sequence, and transition versus transversion for the Iranian genomes included 4.73, 14.14, and 1.6, respectively. Generally, C>T alteration was the most common substitution in all the sequences.
Conclusion: The ORF1ab, N, and S were the genes with the most changes. The current data can help researchers predict future epidemics and establish better strategies to control viral pandemics.