Complete Genome Sequence of Salmonella ... - Semantic Scholar

3 downloads 0 Views 164KB Size Report
Oct 23, 2014 - Present address: T. Freitas, Med Fusion, Lewisville, Texas, USA. Salmonella enterica constitutes a group of enteric pathogens with a broad ...
crossmark

Complete Genome Sequence of Salmonella enterica subsp. enterica Serovar Enteritidis Strain SEJ K. A. Bishop-Lilly,a,b K. G. Frey,a,b H. E. Daligault,c K. W. Davenport,c D. C. Bruce,c P. S. Chain,c S. R. Coyne,d O. Chertkov,c T. Freitas,c* J. Jaissle,c G. I. Koroleva,e J. T. Ladner,e T. D. Minogue,d G. F. Palacios,e C. L. Redden,a,b Y. Xu,c S. L. Johnsonc Naval Medical Research Center (NMRC)–Frederick, Fort Detrick, Maryland, USAa; Henry M. Jackson Foundation, Bethesda, Maryland, USAb; Los Alamos National Laboratory (LANL), Los Alamos, New Mexico, USAc; Diagnostic Systems Division (DSD), United States Army Medical Research Institute of Infectious Diseases (USAMRIID), Fort Detrick, Maryland, USAd; Center for Genome Sciences (CGS), USAMRIID, Fort Detrick, Maryland, USAe * Present address: T. Freitas, Med Fusion, Lewisville, Texas, USA.

Salmonella enterica constitutes a group of enteric pathogens with a broad host range, including humans, reptiles, and birds. S. enterica subsp. enterica is a common cause of inflammatory diarrhea in humans. We present the draft genome of S. enterica subsp. enterica serovar Enteritidis strain SEJ, including a 59-kbp plasmid. Received 12 September 2014 Accepted 16 September 2014 Published 23 October 2014 Citation Bishop-Lilly KA, Frey KG, Daligault HE, Davenport KW, Bruce DC, Chain PS, Coyne SR, Chertkov O, Freitas T, Jaissle J, Koroleva GI, Ladner JT, Minogue TD, Palacios GF, Redden CL, Xu Y, Johnson SL. 2014. Complete genome sequence of Salmonella enterica subsp. enterica serovar Enteritidis strain SEJ. Genome Announc. 2(5):e01084-14. doi: 10.1128/genomeA.01084-14. Copyright © 2014 Bishop-Lilly et al. This is an open-access article distributed under the terms of the Creative Commons Attribution 3.0 Unported license. Address correspondence to S. L. Johnson, [email protected].

S

almonella enterica constitutes a group of enteric pathogens with a broad host range, including humans, reptiles, and birds. Of this group, S. enterica subsp. enterica is the predominant pathogen of birds and mammals. Disease manifestations and/or severity can vary by host status and age but a typical manifestation would be inflammatory diarrhea (reviewed in reference 1). Tracking enteric pathogens such as these for epidemiological purposes has traditionally been performed via pulsed-field gel electrophoresis (PFGE); however, for the serovar Enteriditis, phylogenetic analysis based on whole-genome sequence data was recently shown to provide increased resolution (2). Accordingly, there is utility in increasing the number of highquality references available. We present here the draft genome of S. enterica subsp. Enterica serovar Enteritidis strain SEJ, sequenced to 314⫻ total draft coverage. A QIAgen Genome Tip-500 was used to extract high-quality genomic DNA from a purified isolate. Specifically, nucleic acid was extracted from a 100-mL bacterial culture at stationary phase. This draft genome was derived via a combination of Illumina library technologies (1). A 100-bp unpaired library was constructed (312-fold genome coverage) as well as a longinsert paired-end library (7,944 ⫾ 2,066-bp insert and 2-fold genome coverage). The paired-end data were assembled in Newbler (3) and those consensus sequences computationally shredded into 2 kbp overlapping fake reads (shreds). The standard Illumina sequencing data were assembled with VELVET (2) and the consensus sequence computationally shredded into 1.5-kbp overlapping shreds. All data were additionally assembled together in Allpaths (4) and the consensus sequences computationally shredded into 5-kbp overlapping shreds. All of the shreds were integrated using parallel phrap (High Performance Software, LLC). Possible misassemblies were corrected and re-

September/October 2014 Volume 2 Issue 5 e01084-14

peat regions verified using in-house scripts and manual editing in Consed (3–6). By these methods the chromosome and plasmid sequence for this strain were assembled into a single contig each. Annotation was performed using an Ergatis-based (7) workflow with minor manual curation. The S. enterica SEJ chromosome (CP008928) is 4,678,927 bp (52.2% G⫹C content) and encodes 4,341 coding sequences, 21 rRNAs, and 82 tRNAs. The 59,372-bp plasmid (CP008927) has 52.0% G⫹C content and encodes 77 CDSs. This draft genome expands the repertoire of Salmonella whole-genome sequences available for comparative analysis. Nucleotide sequence accession numbers. The NCBI accession no. for the S. enterica SEJ chromosome is CP008928 and for the plasmid is CP008927. ACKNOWLEDGMENTS Funding for this effort was provided by the Defense Threat Reduction Agency’s Joint Science and Technology Office (DTRA J9-CB/ JSTO). This manuscript is approved by the LANL for unlimited release (LAUR-14-26025). The views expressed in this article are those of the authors and do not necessarily reflect the official policy or position of the Department of the Navy, Department of Defense, or the US. Government.

REFERENCES 1. Bennett S. 2004. Solexa Ltd. Pharmacogenomics 5:433– 438. http:// dx.doi.org/10.1517/14622416.5.4.433. 2. Zerbino DR, Birney E. 2008. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 18:821– 829. http:// dx.doi.org/10.1101/gr.074492.107. 3. Ewing B, Green P. 1998. Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res. 8:186 –194. 4. Butler J, MacCallum I, Kleber M, Shlyakhter IA, Belmonte MK, Lander

Genome Announcements

genomea.asm.org 1

Bishop-Lilly et al.

ES, Nusbaum C, Jaffe DB. 2008. ALLPATHS: de novo assembly of wholegenome shotgun microreads. Genome Res. 18:810 – 820. http://dx.doi.org/ 10.1101/gr.7337908. 5. Ewing B, Hillier L, Wendl MC, Green P. 1998. Base-calling of automated sequencer traces using phred. I. Accuracy Assess. Genome Res. 8:175–185. http://dx.doi.org/10.1101/gr.8.3.175.

2 genomea.asm.org

6. Gordon D, Abajian C, Green P. 1998. Consed: a graphical tool for sequence finishing. Genome Res. 8:195–202. http://dx.doi.org/10.1101/ gr.8.3.195. 7. Hemmerich C, Buechlein A, Podicheti R, Revanna KV, Dong Q. 2010. An Ergatis-based prokaryotic genome annotation web server. Bioinformatics 26:1122–1124. http://dx.doi.org/10.1093/bioinformatics/btq090.

Genome Announcements

September/October 2014 Volume 2 Issue 5 e01084-14