🦠 In Silico Analysis of Bacterial Genomes to Identify and Characterize Antibiotic Resistance Genes (ARGs)
Antibiotic resistance is one of the most critical public health threats globally. Antibiotic Resistance Genes (ARGs) enable bacteria to survive antibiotic exposure through mechanisms such as enzymatic drug degradation, target site modification, efflux pumps, and reduced membrane permeability.
This project presents a web-based bioinformatics tool built with Python (Flask) + BLAST+ for real-time detection of ARGs from bacterial genome sequences — supporting precision antibiotic therapy and AMR surveillance.
- Collect and curate bacterial genome sequence datasets from NCBI GenBank and RefSeq
- Retrieve 135 essential bacterial genes from the DEG Database for reference
- Develop a web-based ARG detection tool using Python (Flask), HTML/CSS/JS, and BLAST+
- Enable real-time sequence matching against CARD and ResFinder databases
- Support comparative genomic analysis of ARG distribution
| Mechanism | Example | Gene(s) |
|---|---|---|
| Enzymatic degradation | β-lactamase hydrolyzes penicillin | blaTEM, blaSHV, blaOXA |
| Target site alteration | Modified DNA gyrase | gyrA, parC |
| Efflux pumps | Active drug expulsion | tetA, mexAB-oprM |
| Reduced permeability | Porin loss | oprD |
| Ribosomal modification | 16S rRNA methylation | aac(6')-Ib |
| Tool / Database | Purpose |
|---|---|
| NCBI GenBank / RefSeq | Bacterial genome sequence retrieval |
| CARD (card.mcmaster.ca) | Main ARG reference database |
| ResFinder | Supplementary ARG detection |
| DEG (Database of Essential Genes) | 135 essential gene reference set |
| BLAST+ | Sequence alignment and homology search |
| Python (Flask) | Backend web application |
| HTML / CSS / JavaScript | Frontend interface |
- Bacterial gene sequences retrieved from NCBI GenBank and RefSeq in FASTA format
- ARG families included: β-lactamases, aminoglycoside-modifying enzymes, efflux pumps, target modification genes
- 135 essential genes curated from DEG Database (experimentally verified)
- Covers DNA replication, transcription, translation, and cell wall synthesis
- Used to cross-reference ARG hits and prevent misclassification
Backend (Python / Flask):
- BLAST-based local alignment against CARD ARG sequences
- Threshold: ≥85% sequence identity + e-value ≤ 1×10⁻⁵
- Resistance class mapping: β-lactams, aminoglycosides, macrolides, fluoroquinolones, tetracyclines
Frontend (HTML/CSS/JavaScript):
- FASTA sequence input
- Gene name input
- Real-time results display
Tool Output:
- Matched ARG Name
- ARG ID (ARO ID from CARD)
- Resistance Class
- Percent Identity
- Database Source (CARD / ResFinder)
- Confidence Score
| Category | Example Genes | Function |
|---|---|---|
| DNA replication | Rv0005 (GyrA), Rv0004 (GyrB) | DNA gyrase — fluoroquinolone target |
| Transcription | Rv0667 (RpoB) | RNA polymerase β-subunit — rifampicin target |
| Catalase-peroxidase | Rv1908c (KatG) | Isoniazid activation |
| ATP synthesis | Rv3794 (AtpA) | Energy metabolism |
| Resistance Mechanism | Genes | Antibiotic Class |
|---|---|---|
| KatG mutations | katG | Isoniazid |
| RpoB mutations | rpoB | Rifampicin |
| Efflux pump | mmpS5-mmpL5 | Bedaquiline, Clofazimine |
| Aminoglycoside modifying | aac(6')-Ib | Aminoglycosides |
| β-lactamase | bla genes | β-lactam antibiotics |
# Clone the repository
git clone https://github.com/Shubham9845/arg-detection-tool.git
cd arg-detection-tool
# Install dependencies
pip install flask biopython
# Run the app
python app.pyOpen browser at http://localhost:5000