Machine learning models able to predict phage bacteria...

Machine-learning models able to predict phage-bacteria interactions Diogo Leite1, Grégory Resch2, Yok-Ai Que3, Xavier Brochet1, Carlos Peña1

1School of Business and Engineering Vaud (HEIG-VD), University of Applied Sciences Western Switzerland

(HES-SO), Swiss Institute of Bioinformatics (SIB), Switzerland 2Department of Fundamental Microbiology, University of Lausanne, Lausanne, Switzerland 3Department of Intensive Care Medicine, Bern University Hospital (Inselspital), Bern, Switzerland

Abstract Phage-therapy, a promising alternative to antibiotic-resistance, uses phages to infect and kill pathogenic bacteria. It requires finding per-

fectly matching phage-bacterium pairs, a time and money-consuming task, currently achieved empirically in laboratory. Our project aims at

improving this task by predicting, in-silico, if a given phage-bacterium pair would interact. Predictions are performed on the base of public

genomic data combined with machine-learning algorithms. With such an approach we have obtained around 90% of predictive power. In

order to improve these results, we will extend our methodology and we will validate it with newly-generated clinically-relevant data.

Overview: prediction of phage-bacteria interactions

Datasets Accuracy F-Score Sensitivity Specificity

NB50 89.78% 90.13% 89.56% 90.12%

NBN50 89.79% 90.13% 89.56% 90.12%

S1e-6 85.79% 86.24% 85.43% 86.35%

Future work:

• Explore other types of features to further increase model performance.

• Improve the predicitvity of the dataset by prunning redundant/

correlated variables that may perturb modeling

• Search for new relevant interactions allowing to predict interactions for

different strains of a given bacterial species

• Improve domain interactions scores

For our project we:

• Acquired data from public databases as NCBI [1] and

phagesdb.org [2]. From GeneMarkS [3] we predict

genes in bacterial and phages genomes.

• Constituted a positive dataset with 1064 phage-

bacteria interactions pairs.

• Developed a model able to predict phage-bacteria in-

teractions

Feature engineering: obtention of informative features

On our data we:

• Extracted two kinds of features based on:

• Protein interactions (PFAM Domain-domain interactions) - 18 datasets

• Genomic sequences (% of amino acids, chemical components and

weight) - 1 dataset

• Corrected the over-representation of the Mycobacterium smegmatis

mc2 155 bacterium, reducing it from 86% to 14%

Machine-Learning based modeling

Main results and conclusions

In our machine-learning search we used:

• 19 datasets (12’918 samples)

• Predictive model building with 10-fold cross-validation

• Four approaches: K-Nearest Neighbors (kNN), Random Forests (RF), Support Vector Ma-

chines (SVM), and Artificial Neural Networks (ANN)

In order to identify the best datasets and configurations of algorithms, we performed the

following:

1. Selected the datasets exhibiting the highest scores across the four approaches

2. Further modelling to refine the algorithms’ hyperparameters

The best results on the test, obtained by ANN with 9 neu-

rons and 50 epochs are presented below:

Data acquisition Project overview

Domain-domain scoring Sequence quantification

Process of research used

References:

[1] http://www.ncbi.nlm.nih.gov/; [2] http://phagesdb.org/; [3] http://www.ncbi.nlm.nih.gov/genomes/MICROBES/genemark.cgi [Features extraction based on article by E. Coelho] Computational prediction of the human-microbial oral interactome Found by:

Machine learning models able to predict phage bacteria...

Documents

30. Genetics and recombination in bacteria. Lecture Outline 11/16/05 Replication in bacteria Types of recombination in bacteria –Transduction by phage

Lambda Phage

Adaptive Dynamics of Temperate Phages. Introduction Phages are viruses which infect bacteria A temperate phage can either replicate lytically or lysogenically

Isolate phage Prepare phage for testing Create high titer lysate via flooding method. Isolate phage DNA and replicate using PCR. *Send off isolated phage

Molecular Genetic Analysis in Bacteria › psaxena › BIO366 › figures › chap_14.pdf · Molecular Genetic Analysis in Bacteria 14 FIND CHAP TOC Insert in phage φ105J19 CATATCAAGGAGGAATGAG

Phage stratagies

“Classical” view of bacteria genome Single chromosome May have plasmids and phage Simple gene structure Genes have recognisable phenotype Vibrio y Bacteriodes

Phage display technology - uni- · PDF filePrinciple of the phage display technology . ... This combination of phage display and cell surface display on bacteria ... Volesung-Biochemie-Phage

An age-size structured model for bacteria-phage interaction

Marcie the BacteriophageBacteriophage or phage for short is a virus that infects bacteria. To infect bacteria a phage injects its DNA into the bacteria. There are two common phage

DIVERSITY OF LACTIC ACID BACTERIA … · BACTERIA BACTERIOPHAGES FROM DAIRY ENVIRONMEMENT ... phage genomes, a hypothesis that was proposed previously for other bacteriophages. Le

T4 | s %O>bunshi4.bio.nagoya-u.ac.jp/~bunshi4/class/13bunshiiden/...T4 | s %O> | 19 11. Photograph of phage T2 plaques on a lawn of E. coli bacteria T2 phage T2 mutant phage T2 r :

9-1 Chapter 9 DNA-Protein Interactions in Bacteria Student learning outcomes: Describe examples of structure /function relationships in phage repressors

The phage lambda ( ) A phage (from 'bacteria' and Greek phagein ( , 'to eat') is a virus that infects and sometimes lyses bacteria. Like viruses

Model To Predict Aerial Dispersal of Bacteria during Environmental

New strategies for combating multidrug-resistant bacteria · Advantages Phage therapy is possible in all bacterial infections Phage coevolving with bacteria Specific- no effect on

Chapter 14 Phage Strategies. 14.1 Introduction Bacteriophages are viruses that infect bacteria. Figure 14.CO: TEM of a T4 phage. © Dr. Harold Fisher/Visuals

autoantibodies isolated by phage display human …...Phage display

Bacteria Genetics Exchange of Genetic Informationfdjpkc.fudan.edu.cn/_upload/article/files/6f/20/802d891e...• Lytic or virulent phage–Phage that multiply within the host cell,

Bifurcation analysis of a phage-bacteria interaction model