This repository contains data and scripts for the construction of strain-specific Genome-scale Metabolic Models and some comparisons.
Isolate sequencing data as processed using our isolate nextflow pipeline which consistes of the following steps:
- QC and trimming with FASTP
- Assembly with MEGAHIT
- Classification with GTDB-TK
- Quality control with CHECKM2
- Phylogenetic tree building with FastTree based on the GTDB alignment
The DM38 medium was manually converted from the media recipe by dividing compounds into their respective ions and annotation using the ModelSEED database. The media in MICOM and CARVEME formats are available in this repo here.
Models were built with out model builder nextflow pipeline using CARVEME and gapfilling on the DM38 medium. A joined mode containing all starins is also provided in the
models/data
folder.
The analysis included in the paper can be found in the notebook.