Dataset of complete genome assembly and analysis of mycobacterium tuberculosis strain SIT745/EAI1-MYS
dc.contributor | Universiti Sains Malaysia | |
dc.contributor | Universiti Sains Malaysia | |
dc.contributor | Universiti Sains Malaysia | |
dc.contributor | Universiti Sains Malaysia | |
dc.contributor.author | Mohammad Abdullah | |
dc.contributor.author | Siti Suraiya | |
dc.contributor.author | Suharni Mohamad | |
dc.contributor.author | Azian Harun | |
dc.date.accessioned | 2023-05-12T00:59:59Z | |
dc.date.available | 2023-05-12T00:59:59Z | |
dc.date.issued | 2020-06-30 | |
dc.description | Complete genome assembly of M.tuberculosis SIT745/EAI1-MYS is done using contigs. BLAST was performed on contigs, corrections and gaps between the sequences are replaced with the reference genome sequence of M. tuberculosis strain H37Rv and M. bovis strain AF2122/97, and genome annotation Link to data: http://dx.doi.org/10.17632/9kgt46cpdh.1 | |
dc.description.abstract | In this dataset, we report the genome assembly and data analysis of Mycobacterium tuberculosis strain SIT745/EAI1-MYS. Previously, this strain was isolated from a Malaysian patient with extra-pulmonary tuberculosis, and identification of this strain is done by spoligotype patterns with fifteen known Shared International Type (SITs). Further analysis showed that this strain has a remarkable phylogeographical specificity for Malaysia. Based on the National Center for Biotechnology Information (NCBI) nucleotide database information, the complete genome consists of 150 contigs with various sequence lengths and was not assembled. In this assembly, the aforementioned contigs along with reference sequence from Mycobacterium tuberculosis strain H37Rv and Mycobacterium bovis strain AF2122/97 was used for gap closures, were assembled into a single circular chromosome length of approximately 4.42 Mega bases (Mb) with an average GC content of 65.6%. The single circular chromosome was shown to contain 4,009 protein-coding sequences, 3 ribosomal RNAs, 45 transfer RNAs, and 12 superclasses distributed with 277 subsystems which constitute nearly 1900 genes, respectively. The genome information will provide fundamental knowledge of this organism as well as insight for understanding genomic and proteomic profiling, phylogenetic relationship. | |
dc.identifier.doi | 10.1016/j.dib.2020.105949 | |
dc.identifier.uri | http://dx.doi.org/10.17632/9kgt46cpdh.1 | |
dc.identifier.uri | https://opendata.usm.my/handle/123456789/74656 | |
dc.publisher | Elsevier | |
dc.rights | CC0 1.0 Universal | * |
dc.rights.uri | http://creativecommons.org/publicdomain/zero/1.0/ | * |
dc.title | Dataset of complete genome assembly and analysis of mycobacterium tuberculosis strain SIT745/EAI1-MYS | |
dc.type | Dataset |