Citation

BibTex format

@article{Derelle:2024:10.1101/gr.279449.124,
author = {Derelle, R and von, Wachsmann J and Mäklin, T and Hellewell, J and Russell, T and Lalvani, A and Chindelevitch, L and Croucher, NJ and Harris, SR and Lees, JA},
doi = {10.1101/gr.279449.124},
journal = {Genome Res},
pages = {1661--1673},
title = {Seamless, rapid, and accurate analyses of outbreak genomic data using split k-mer analysis.},
url = {http://dx.doi.org/10.1101/gr.279449.124},
volume = {34},
year = {2024}
}

RIS format (EndNote, RefMan)

TY  - JOUR
AB - Sequence variation observed in populations of pathogens can be used for important public health and evolutionary genomic analyses, especially outbreak analysis and transmission reconstruction. Identifying this variation is typically achieved by aligning sequence reads to a reference genome, but this approach is susceptible to reference biases and requires careful filtering of called genotypes. There is a need for tools that can process this growing volume of bacterial genome data, providing rapid results, but that remain simple so they can be used without highly trained bioinformaticians, expensive data analysis, and long-term storage and processing of large files. Here we describe split k-mer analysis (SKA2), a method that supports both reference-free and reference-based mapping to quickly and accurately genotype populations of bacteria using sequencing reads or genome assemblies. SKA2 is highly accurate for closely related samples, and in outbreak simulations, we show superior variant recall compared with reference-based methods, with no false positives. SKA2 can also accurately map variants to a reference and be used with recombination detection methods to rapidly reconstruct vertical evolutionary history. SKA2 is many times faster than comparable methods and can be used to add new genomes to an existing call set, allowing sequential use without the need to reanalyze entire collections. With an inherent absence of reference bias, high accuracy, and a robust implementation, SKA2 has the potential to become the tool of choice for genotyping bacteria. SKA2 is implemented in Rust and is freely available as open-source software.
AU - Derelle,R
AU - von,Wachsmann J
AU - Mäklin,T
AU - Hellewell,J
AU - Russell,T
AU - Lalvani,A
AU - Chindelevitch,L
AU - Croucher,NJ
AU - Harris,SR
AU - Lees,JA
DO - 10.1101/gr.279449.124
EP - 1673
PY - 2024///
SP - 1661
TI - Seamless, rapid, and accurate analyses of outbreak genomic data using split k-mer analysis.
T2 - Genome Res
UR - http://dx.doi.org/10.1101/gr.279449.124
UR - https://www.ncbi.nlm.nih.gov/pubmed/39406504
VL - 34
ER -

Contact us


For any enquiries related to the MRC Centre please contact:

Scientific Manager
Susannah Fisher
mrc.gida@imperial.ac.uk

External Relationships and Communications Manager
Dr Sabine van Elsland
s.van-elsland@imperial.ac.uk