Machine learning to predict the source of campylobacteriosis using whole genome data

Author:

Arning NicolasORCID,Sheppard Samuel K.ORCID,Bayliss SionORCID,Clifton David A.,Wilson Daniel J.

Abstract

Campylobacteriosis is among the world’s most common foodborne illnesses, caused predominantly by the bacterium Campylobacter jejuni. Effective interventions require determination of the infection source which is challenging as transmission occurs via multiple sources such as contaminated meat, poultry, and drinking water. Strain variation has allowed source tracking based upon allelic variation in multi-locus sequence typing (MLST) genes allowing isolates from infected individuals to be attributed to specific animal or environmental reservoirs. However, the accuracy of probabilistic attribution models has been limited by the ability to differentiate isolates based upon just 7 MLST genes. Here, we broaden the input data spectrum to include core genome MLST (cgMLST) and whole genome sequences (WGS), and implement multiple machine learning algorithms, allowing more accurate source attribution. We increase attribution accuracy from 64% using the standard iSource population genetic approach to 71% for MLST, 85% for cgMLST and 78% for kmerized WGS data using the classifier we named aiSource. To gain insight beyond the source model prediction, we use Bayesian inference to analyse the relative affinity of C. jejuni strains to infect humans and identified potential differences, in source-human transmission ability among clonally related isolates in the most common disease causing lineage (ST-21 clonal complex). Providing generalizable computationally efficient methods, based upon machine learning and population genetics, we provide a scalable approach to global disease surveillance that can continuously incorporate novel samples for source attribution and identify fine-scale variation in transmission potential.

Funder

Biotechnology and Biological Sciences Research Council

Wellcome Trust

Medical Research Council

Wellcome Trust (GB) and Royal Society

robertson foundation

National Institute for Health Research (NIHR) Oxford Biomedical Research Centre

Publisher

Public Library of Science (PLoS)

Subject

Cancer Research,Genetics (clinical),Genetics,Molecular Biology,Ecology, Evolution, Behavior and Systematics

Reference57 articles.

1. The European Union One Health 2018 Zoonoses Report;EFSA Journal,2019

2. Global Epidemiology of Campylobacter Infection;NO Kaakoush;Clinical Microbiology Reviews,2015

3. Niche segregation and genetic structure of Campylobacter jejuni populations from wild and agricultural host species;SK Sheppard;Molecular Ecology,2011

4. Host Association of Campylobacter Genotypes Transcends Geographic Variation;SK Sheppard;Applied and Environmental Microbiology,2010

5. Campylobacter Species and Guillain-Barré Syndrome;I Nachamkin;Clinical Microbiology Reviews,1998

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3