Taxonomic Information
GTDB Taxonomy d__Bacteria; p__Proteobacteria; c__Gammaproteobacteria; o__Enterobacterales; f__Enterobacteriaceae; g__Escherichia; s__Escherichia coli
Filtered NCBI Taxonomy d__Bacteria; p__Proteobacteria; c__Gammaproteobacteria; o__Enterobacterales; f__Enterobacteriaceae; g__Escherichia; s__Escherichia coli
Unfiltered NCBI Taxonomy d__Bacteria; p__Proteobacteria; c__Gammaproteobacteria; o__Enterobacterales; f__Enterobacteriaceae; g__Escherichia; s__Escherichia coli; x__Escherichia coli ARS4.2123
NCBI Strain Identifiers ARS4.2123
GTDB Type Material Designation not type material
GTDB Representative of Species False (representative is GCA_003697165.2)
Genome Characteristics
CheckM Completeness 99.77%
CheckM Contamination 0.04%
CheckM Strain Heterogeneity 0.0
5S Count 3
16S Count 2
23S Count 1
tRNA Count 20
Contig Count 244
N50 Contigs 93,409 bp
Longest Contig 322,393 bp
Scaffold Count 208
N50 Scaffolds 101,418 bp
Longest Scaffold 322,393 bp
Genome Size 4,977,300 bp
Protein Count 4,871
Coding Density 87.80%
GC Percentage 50.47%
Ambiguous Bases N/A
GTDB representative GCA_003697165.2
GTDB representative of species False
NCBI Metadata
Assembly Level Scaffold
Assembly Name ASM30409v2
Assembly Type n/a
Bioproject PRJNA224116
Biosample SAMN00806481
Country USA
Date 2012-10-12
Genbank Assembly Accession GCA_000304095.2
Genome Category Isolate
Genome Representation full
Isolate None
Isolation Source water
Latitude Longitude None
Molecule Count N/A
CDS Count 5,302
Refseq Category na
Seq Rel Date 2012/10/12
Spanned Gaps 36
Species Taxid 562
SSU Count 4
Submitter Institute for Genome Sciences
Taxid 1005563
Total Gap Length 1860
Translation Table 11
tRNA Count (total) 83
Type Material None
Unspanned Gaps 0
Version Status latest
WGS Master AMUL00000000.1