Open DNA collection for building synthetic cells
Nucleus DNA Distribution
The Nucleus DNA Distribution consists of open source genetic constructs that can be used to engineer synthetic cells. The collection is part of the open source Nucleus synthetic cell package, which also includes protocols and tools for synthetic cell engineering that we validate will work together.
Version 0.1.0 consists of a complete set of genetic constructs that can be used to make the PURE system, a promoter set to tune protein expression, a reporter set for reproducible measurement of synthetic cells, and a basic set of open MoClo components for gene assembly.
Version 0.2.0 introduces two (2) plasmids encoding an Emitter module, as well as more tools for synthetic cell building (e.g., reporters and MoClo parts).
This DNA distribution has been prepared under OpenMTA with Free Genes and the BioBricks Foundation.
Distribution Plates
Nucleus v0.1.0 Distribution PlateNucleus v0.2.0 Distribution Platev0.2.0 Distribution
The v0.2.0 distribution is broken into four categories: PURE, Measurement Reporters, Modules, and MoClo.
PURE
These genes encode the proteins making up the PURE cell-free expression system. You can use these plasmids to express and purify these proteins in order to make PURE (see PURE Protocols). We include 50 plasmids encoding 36 proteins used in PURE, plus some alternative variants of some of these proteins. The genes are under the control of the pT7-lacO
promoter for protein production in E. coli T7 production strains (e.g., BL21(DE3) or NEB T7 Express). Each gene is tagged with either an N- or C-terminal 6xHistidine tag as in Shimizu et al. (2001).
The constructs have been substantially upgraded from their original designs, incorporating improvements in DNA design and protein expression techniques:
- Consistent backbones, linkers, and other motifs.
- Dual terminator isolation to limit background expression.
- Direct expression from the original plasmid, but the ability to extract the gene as a MoClo part for use with different promoters, terminators, or tags, or assembly into a multiple-transcription-unit construct.
- Codon optimized for high expression in E. coli.
- Removal of relevant assembly restriction enzyme sites.
Several PURE genes are included as multiple variants or components. The adenylate kinase/myokinase (AK) gene is included as derived from E. coli, S. cerevisiae, chicken (G. gallus) and rabbit (O. cuniculus) sources, spanning those reported or included in other PURE DNA collections. Likewise, creatine kinase (CK) is included as derived from chicken and rabbit. The two heteromeric PURE tRNA synthetases, glyRS and pheRS, are included as standalone components for each subunit, as well as multigene transcription units in both gene orientations, reflecting differences between the original Shimizu and FreeGenes PUREiodic Table construct designs. A final, more highly engineered version of each synthetase is also included, with a histidine tag added to each complex subunit in order to improve protein purification efficiency.
The PURE genes can be expressed directly from pOpen in small quantities, but we recommend subcloning them into an expression vector (e.g., pET28a) using MoClo or HiFi assembly.
Measurement Reporters
Measurement reporters are included to produce comparable measurements of synthetic cells between labs and to provide visual controls for the protein purification steps involved in making PURE from the PURE gene set.
Purification control reporters
The purification control reporters include four chromoproteins, proteins which produce a visible color rather than (or, in some cases, as well as) fluorescence. The chromoproteins are gfasPurple, cjBlue, amajLime, and eforRed. CjBlue and eforRed are each available in multiple versions: a pT7 promoter for direct expression, a pT7-lacO promoter for determining the efficiency of lac repression, and a version with an N-terminal 6xHis tag under control of a pT7-lacO promoter to visually verify efficiency of protein purification.
Level-matched T7 promoters
The level-matched T7 promoters are a set of 10 pT7 promoter variants, selected to allow for transcriptional control across two orders of magnitude, along the same lines as the Anderson Collection promoters for E. coli. The promoters are useful for implementing basic control in a synthetic cell, reducing the expression of toxic genes, and matching the stoichiometries of different proteins within multigene DNA constructs and synthetic cell modules.
Each promoter appears as a MoClo Level 0 ‘P’ part, while also driving expression of efasGFP for immediate use and measurement within PURE or cells containing the T7 RNA polymerase.
Modules
Two (2) detector modules are included, based on lac repression and tet repression. Each module includes a plasmid encoding the repressor protein, lacI and tetR respectively, as well as a reporter construct encoding plamGFP under control of the matching operator site. An integrated lac-tet reporter is also included, allowing for the expression of plamGFP in the presence of both IPTG and tetracycline—a simple AND gated reporter. See Modules - Detector modules for details.
One (1) Emitter module is included, based on the native E. coli acyl homoserine lactone quorum sensing system. This module is made of two (2) plasmids: one (pT7-bjaI
) encodes an enzyme that can produce IV-HSL, the quorum molecule, which can freely diffuse across the synthetic cellular membrane; the other (pBjaR-plamGFP
) encodes an E. coli promoter under the regulation of BjaR, a transcriptional activator that responds to IV-HSL, driving the expression of a green fluorescent reporter. See for details.
MoClo
Several basic MoClo parts are included to facilitate the subcloning of other genes with alternative promoters, UTRs, and terminators. These parts also form a basis for developing a MoClo-compatible collection licensed under the OpenMTA.
The pOpen-MoClo-L1-1 vector is a modification of the FreeGenes pOpenv3 vector making it suitable as a MoClo level 1 destination plasmid. The vector contains an insert encoding J23103-UTR1-cjBlue, allowing for blue-white selection of correct assemblies.
Also included are two 5’ untranslated regions, UTR1 and the reference RBS from Elowitz (1999), and three terminators, tT7, tT7hyb6, and tT7hyb10. The UTR and terminator parts are designed as MoClo level 0 ‘U’ and ‘T’ modules, respectively.
UTR1 is based on the T7 g10 leader sequence and results in highly efficient translation efficiency and consequent high expression levels in systems containing the E. coli ribosome. The Elowitz RBS is the reference RBS used to define RBS efficiency in Elowitz (1999) and by the iGEM community.
The T7 terminator tT7 is the consensus terminator derived from T7 g1, commonly used in protein production and recommended for use in the PURE system. T7hyb6 and T7hyb10 are efficient synthetic terminators developed by Calvopina-Chavez, et al (2022) that have been shown to provide high termination efficiencies against T7 RNA polyermerase in vivo and in vitro as well as terminate transcription from the native E. coli RNA polymerase. The T7hyb6 terminator consists of a single termination domain while the more effective and longer T7hyb10 terminator consists of two termination domains linked by two pause sites.
How do I use this distribution?
All DNA artifacts encoding proteins (i.e., the PURE set, and all fluorescent and colormetric reporters) include a T7 promoter and E. coli ribosome binding site, making them “expression ready”. Just transform these plasmids directly into a T7 RNAP expression system (e.g., BL21(DE3) cells) and go! See Grow and Induce Expression Strains if you want to learn how to use inducible pT7 expression systems to make proteins.
All DNA artifacts are MoClo parts — you digest them with BsaI to generate standard MoClo overhang sites. You can then use Golden Gate assembly to assemble digested parts into a new DNA artifact. To learn more about MoClo, read the original paper here.
Designs
A full manifest of DNA components and sequence files is available on the Nucleus Github:
GitHub nucleus/dna-distribution/v0.1.0-001 at main · bnext-bio/nucleus
PURE Components
Features | Source | |||||
Component | Name | Description | Promoter | 6xHis Tag | MoClo | Sequence |
pT7-lacO-UTR1-AlaRS-tT7hyb6 | AlaRS | Alanine tRNA synthetase | pT7 | N-term | SP-CDS | |
pT7-lacO-UTR1-ArgRS-tT7hyb6 | ArgRS | Arginine tRNA synthetase | pT7 | N-term | SP-CDS | |
pT7-lacO-UTR1-AsnRS-tT7hyb6 | AsnRS | Asparagine tRNA synthetase | pT7 | N-term | SP-CDS | |
pT7-lacO-UTR1-AspRS-tT7hyb6 | AspRS | Aspartic acid tRNA synthetase | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-CysRS-tT7hyb6 | CysRS | Cysteine tRNA synthetase | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-GlnRS-tT7hyb6 | GlnRS | Glutamine tRNA synthetase | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-GluRS-tT7hyb6 | GluRS | Glutamic acid tRNA synthetase | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-GlyS-tT7hyb6 | GlyS | Glycine tRNA synthetase S protein | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-GlyQ-tT7hyb6 | GlyQ | Glycine tRNA synthetase Q protein | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-GlySQ-tT7hyb6 | GlySQ | Glycine tRNA synthetase --- glyS + glyQ | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-GlyQS-tT7hyb6 | GlyQS | Glycine tRNA synthetase --- glyQ + glyS | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-GlyQS-DualHis-tT7hyb6 | GlyQS-DualHis | Glycine tRNA synthetase --- glyQ + glyS (dual 6xHis) | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-HisRS-tT7hyb6 | HisRS | Histidine tRNA synthetase | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-IleRS-tT7hyb6 | IleRS | Isoleucine tRNA synthetase | pT7 | N-term | SP-CDS | |
pT7-lacO-UTR1-LeuRS-tT7hyb6 | LeuRS | Leucine tRNA synthetase | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-LysRS-tT7hyb6 | LysRS | Lysine tRNA synthetase | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-MetRS-tT7hyb6 | MetRS | Methionine tRNA synthetase | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-pheS-tT7hyb6 | pheS | Phenylalanine tRNA synthetase S protein | pT7 | N-term | SP-CDS | |
pT7-lacO-UTR1-pheT-tT7hyb6 | pheT | Phenylalanine tRNA synthetase T protein | pT7 | N-term | SP-CDS | |
pT7-lacO-UTR1-pheST-tT7hyb6 | pheST | Phenylalanine tRNA synthetase --- pheT + pheS | pT7 | N-term | SP-CDS | |
pT7-lacO-UTR1-pheTS-tT7hyb6 | pheTS | Phenylalanine tRNA synthetase --- pheS + pheT | pT7 | N-term | SP-CDS | |
pT7-lacO-UTR1-pheST-DualHis-tT7hyb6 | pheST-DualHis | Phenylalanine tRNA synthetase --- pheS + pheT (dual 6xHis) | pT7 | N-term | SP-CDS | |
pT7-lacO-UTR1-ProRS-tT7hyb6 | ProRS | Proline tRNA synthetase | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-SerRS-tT7hyb6 | SerRS | Serine tRNA synthetase | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-ThrRS-tT7hyb6 | ThrRS | Threonine tRNA synthetase | pT7 | N-term | SP-CDS | |
pT7-lacO-UTR1-TrpRS-tT7hyb6 | TrpRS | Tryptophan tRNA synthetase | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-TyrRS-tT7hyb6 | TyrRS | Tyrosine tRNA synthetase | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-ValRS-tT7hyb6 | ValRS | Valine tRNA synthetase | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-MTF-tT7hyb6 | MTF | Methionyl tRNA transformylase | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-IF1-tT7hyb6 | IF1 | Initiation factor 1 | pT7 | N-term | SP-CDS | |
pT7-lacO-UTR1-IF2-tT7hyb6 | IF2 | Initiation factor 2 | pT7 | N-term | SP-CDS | |
pT7-lacO-UTR1-IF3-tT7hyb6 | IF3 | Initiation factor 3 | pT7 | N-term | SP-CDS | |
pT7-lacO-UTR1-EF-G-tT7hyb6 | EF-G | Elongation factor G | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-EF-Tu-tT7hyb6 | EF-Tu | Elongation factor Tu | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-EF-Ts-tT7hyb6 | EF-Ts | Elongation factor Ts | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-RF1-tT7hyb6 | RF1 | Release factor 1 | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-RF2-tT7hyb6 | RF2 | Release factor 2 | pT7 | N-term | SP-CDS | |
pT7-lacO-UTR1-RF3-tT7hyb6 | RF3 | Release factor 3 | pT7 | N-term | SP-CDS | |
pT7-lacO-UTR1-RRF-tT7hyb6 | RRF | Ribosome recycling factor | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-AK-Gg-tT7hyb6 | AK-Gg | Adenylate Kinase (Gallus gallus) | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-AK-Sc-1-tT7hyb6 | AK-Sc-1 | Adenylate Kinase (S. cerevisiae S288C) | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-AK-Oc-tT7hyb6 | AK-Oc | Adenylate Kinase (O. cuniculus) | pT7 | N-term | SP-CDS | |
pT7-lacO-UTR1-AK-Ec-tT7hyb6 | AK-Ec | Adenylate Kinase (E. coli) | pT7 | N-term | SP-CDS | |
pT7-lacO-UTR1-CK-Oc-tT7hyb6 | CK-Oc | Creatine Kinase (O. cuniculus) | pT7 | N-term | SP-CDS | |
pT7-lacO-UTR1-CK-Gg-tT7hyb6 | CK-Gg | Creatine Kinase (G. gallus) | pT7 | N-term | SP-CDS | |
pT7-lacO-UTR1-NDK-tT7hyb6 | NDK | Nuceloside-diphosphate Kinase | pT7 | N-term | SP-CDS | |
pT7-lacO-UTR1-PPiase-tT7hyb6 | PPiase | Pyrophosphotase | pT7 | C-term | SP-CDS | |
pT7-lacO-UTR1-T7RNAP-tT7hyb6 | T7RNAP | T7 RNA Polymerase | pT7 | N-term | SP-CDS |
References
MoClo Paper - https://doi.org/10.1371/journal.pone.0016765
Anderson Collection - http://parts.igem.org/Promoters/Catalog/Anderson