Genome Design and Analytics Tools Team
As part of the Biofoundry, we have two interconnected teams developing novel computational approaches for the design and analysis of microbial platform strains.
The Genome Design team is tasked with developing novel computational tools toward the prospective design and analysis of platform strains. We have developed three computational and data analytics tools that are of primary importance for this effort: 1) Metabolic Reconstructions and Modeling, 2) Genome Sequence (Pangenome) Analysis, and 3) Transcriptional Regulatory Network (iModulon) Analysis. We have assembled these tools into a comprehensive software suite for platform strain design, termed the StrainCAD software package. We are collaborating within the Biofoundry to deploy these tools for priority platform strain design projects at the Center to accelerate experimental workflows with computational predictions. Furthermore, we have pioneered experimental tools to utilize data science principles for synthetic biology, in the form of iModulon engraftment and engineering, to accelerate experimental strain design efforts.
The Genome Analytics team is conducting large-scale biological data analysis in E. coli and other bacteria, specializing in the extraction of actionable knowledge from multi-omics datasets. The team's overarching goal is the development of StrainMD, an integrated platform for diagnosing unexpected traits of engineered strains. At the core of StrainMD is sophisticated multi-omic visualization dashboards that power a collaborative Genome Analytics studio.
Technologies
- Computational Genome Design
- Metabolic modelling
- DNA Sequence-based machine learning
- Computational Data Analytics
- iModulon analysis
- Pangenome analysis
- Structural biology
- Experimental Genome Design
- iModulon engraftment
- iModulon engineering
Software/databases
- Kinetic model construction: MASSef
- Kinetic modelling simulation workflows: MASSpy
- Online short course on GD and GA tools
- Independent Component Analysis of Prokaryotic Gene Expression: iModulonDB
- Metabolic Model Database: BiGG
Selected Publications
- Sastry AV, Gao Y, Szubin R, Hefner Y, Xu S, Kim D, Choudhary KS, Yang L, King ZA, Palsson BO. The Escherichia coli transcriptome mostly consists of independently regulated modules. Nat Commun. 2019 Dec 4;10(1):5536. doi: 10.1038/s41467-019-13483-w. PMID: 31797920; PMCID: PMC6892915.
- Catoiu EA, Phaneuf P, Monk J, Palsson BO. Whole-genome sequences from wild-type and laboratory-evolved strains define the alleleome and establish its hallmarks. Proc Natl Acad Sci U S A. 2023 Apr 11;120(15):e2218835120. doi: 10.1073/pnas.2218835120. Epub 2023 Apr 3. PMID: 37011218; PMCID: PMC10104531.
- Akbari A, Yurkovich JT, Zielinski DC, Palsson BO. The quantitative metabolome is shaped by abiotic constraints. Nat Commun. 2021 May 26;12(1):3178. doi: 10.1038/s41467-021-23214-9. PMID: 34039963; PMCID: PMC8155068.
- Shin J, Rychel K, Palsson BO. Systems biology of competency in Vibrio natriegens is revealed by applying novel data analytics to the transcriptome. Cell Rep. 2023 Jun 27;42(6):112619. doi: 10.1016/j.celrep.2023.112619. Epub 2023 Jun 6. PMID: 37285268.
Contact
Daniel Zielinski
Head of team
dczielin@ucsd.edu