Skip to content

rnrlab/pipolin_bacteria_screening

Repository files navigation

Identification and characterization of pipolins in Bacteria

DOI

Pipolins constitute a new group of mobile genetic elements (MGEs) that encode a primer-independent B-family DNA polymerase (piPolB). Previously, pipolins had been identified as integrative MGEs in diverse bacteria and as circular plasmids in mitochondria and a few gram positive species. Detailed analysis of E. coli pipolins revealed pipolins are present in diverse phylogroups, encode a diverse repertoire of DNA metabolism genes, and show evidence of recent horizontal transfer between different E. coli strains.

Redrejo-Rodríguez, M., et al. Primer-independent DNA synthesis by a family B DNA polymerase from self-replicating Mobile genetic elements. Cell reports, 2017

Chuprikova, L., Mateo-Cáceres, V., de Toro, M. & Redrejo-Rodríguez, M. ExplorePipolin: reconstruction and annotation of piPolB-encoding bacterial mobile elements from draft genomes. Bioinformatics Advances 2022.

In this repository, we have made available the code developed during our latest work on pipolins. We carried out a pipolin screening of the Assembly (NCBI) database. The analysis of the structure of pipolins revealed that they are commonly integrative elements, usually flanked by direct repeats, sharing known mobile elements integration hotspots (e.g. tRNA genes). Remarkably, integrase dynamics correlates with alternative integration spots and enables diverse lifestyles, ranging from integrative to mobilizable plasmid pipolins. Pipolins harbor a minimal core and a large cargo module enriched for defense factors, which are actively exchanged with other mobile elements. These findings indicate pipolins act as orthogonal reservoirs of defense genes that play a key role in the exchange mechanisms for defense genes in bacterial populations.

Pre-print of this work is available in bioRxiv:

Mateo-Cáceres, V., Redrejo-Rodríguez, M. Pipolins are bimodular platforms that maintain a reservoir of defense systems exchangeable with various bacterial genetic mobile elements. bioxRiv, 2024

Each folder in this repository contains the code used in each project sub-task:

  1. Screening: Download genomes from Assembly using Datasets (NCBI) and pipolin detection with ExplorePipolin
  2. Pipolin metadata: Parse genome metadata and ExplorePipolin results to calculate and plot statistics.
  3. piPolB phylog: Infer and plot piPolB phylogeny.
  4. Pipolin CDS clustering: Cluster proteins encoded in pipolins and reannotation.
  5. Candidate recombinases: Create cluster presence-abscence matrix and find candidate pipolin recombinases.
  6. MGE extTools annot: Parse information of different annotation tools used on pipolins and other MGEs (plasmids, phages, ciMGEs). Plot annotation resuts.
  7. wGRR and RG calculation: Calculate wGRR (wighted gene repertoire relatedness) among pipolins and other annotated MGEs to detect recent events of gene exchange (aka recombining genes or RGs).

Disclaimer: Some scripts may require additional software/packages, files or databases that are not included in this repository. Since some input and output files exceed Github limits, the correct execution of several scripts uploaded here could be difficult. Do not hesitate to ask the authors for any missing file that might be of your interest, and ask for help if you encounter difficulties when using the code in this repository.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Packages

 
 
 

Contributors