In vitro compartmentalization (IVC) is an emulsion based technology that generates cell-like compartments in vitro. These compartments are designed such that each contains no more than one gene. When the gene is transcribed and/or translated, its products (RNAs and/or proteins) become ‘trapped’ with the encoding gene inside the compartment. By coupling the genotype (DNA) and phenotype (RNA, protein), compartmentalization allows the selection and evolution of phenotype.
In vitro compartmentalization method was first developed by Tawfik et al. Based on the idea that Darwinian evolution relies on the linkage of genotype to phenotype, Tawfik et al. designed aqueous compartments of water-in-oil (w/o) emulsions to mimic cellular compartments that can link genotype and phenotype. Emulsions of cell-like compartments were formed by adding in vitro transcription/translation reaction mixture to stirred mineral oil containing surfactants. The mean droplet diameter was measured to be 2.6 um by laser diffraction. As a proof of concept, Tawfik el al. designed an experiment that would transcribe and translate M. HaeIII gene in the presence of 107-fold excess of genes encoding another enzyme folA. 3’ of each gene is purposely designed to contain HaeIII R/M sequences, and when HaeIII methyltransferase was expressed from a M.HaeIII gene, it would methylate HaeIII R/M sequence and cause the gene to be resistant to restriction enzyme digestion. By selecting for DNA sequences that survive the endonuclease digestion, Tawfik el al. found there was enrichment for the M.HaeIII genes, i.e. 1000 fold in the first round of selection.
Water-in-oil (w/o) emulsions are created by mixing aqueous and oil phases with the help of surfactants. A typical IVC emulsion is formed by first generating oil-surfactant mixture by stirring, and then gradually adding the aqueous phase to the oil-surfactant mixture. For stable emulsion formation, a mixture of HLB (hydrophile-lipophile balance) and low HLB surfactants are needed. Some combinations of surfactants used to generate oil-surfactant mixture are mineral oil / 0.5% Tween 80 / 4.5% Span 80 / sodium deoxycholate and a more heat stable version, light mineral oil / 0.4% Tween 80 / 4.5% Span 80 / 0.05% Triton X-100. The aqueous phase containing transcription and/or translation components is slowly added to the oil surfactants, and the formation of w/o is facilitated by homogenizing, stirring or using hand extruding device.
The emulsion quality can be determined by light microscopy and/or dynamic light scattering techniques. The emulsion is quite diverse, and greater homogenization speeds helps to produce smaller droplets with narrower size distribution. However, homogenization speeds has to be controlled, since speed over 13,500 r.p.m tends to result in a significant loss of enzyme activity on the level of transcription. The most widely used emulsion formation gives droplets with a mean diameter of 2-3μm, and an average volume of ~5 femtoliters, or 1010 aqueous droplet per ml of emulsions. The ratio of genes to droplets is designed such that most of the droplets contains no more than a single gene statistically.
IVC has used bacterial cell, wheat germ and rabbit reticulocyte (RRL) extracts for transcription and translation. It is also possible to use bacterial reconstituted translation system such as PURE in which translation components are individually purified and later combined. When expressing eukaryote or complex proteins, it is desirable to use eukaryotic translation systems such as wheat germ extract or more superior alternative, RRL extract. In order to use RRL for transcription and translation, traditional emulsion formulation cannot be used as it abolishes translation. Instead, a novel emulsion formulation: 4% Abil EM90 / light mineral oil was developed and demonstrated to be functional in expressing luciferase and human telomerase.
Once transcription and/or translation has completed in the droplets, emulsion will be broken by successive steps of removing mineral oil and surfactants to allow for subsequent selection. At this stage, it is crucial to have a method to ‘track’ each gene products to the encoding gene as they become free floating in a heterogeneous population of molecules. There are three major approaches to track down each phenotype to its genotype. The first method is to attach each DNA molecule with a biotin group and an additional coding sequence for streptavidin (STABLE display). All the newly formed proteins/peptides will be in fusion with streptavidin molecules and bind to their biotinylated coding sequence. An improved version attached two biotin molecules to the ends of a DNA molecule to increase the avidity between DNA molecule and streptavidin-fused peptides, and used a low GC content synthetic streptavidin gene to increase efficiency and specificity during PCR amplification. The second method is to covalently link DNA and protein. Two strategies have been demonstrated. The first is to form M.HaeIII fusion proteins. Each expressed protein/polypeptide will be in fusion with Hae III DNA methyltransferase domain, which is able to bind covalently to DNA fragments containing the sequence 5’-GGC*-3’, where C* is 5-fluoro-2 deoxycytidine. The second strategy is to use monomeric mutant of VirD2 enzyme. When a protein/peptide is expressed in fusion with Agrobacterium protein VirD2, it will bind to its DNA coding sequence that has a single-stranded overhang comprising VirD2 T-border recognition sequences. The third method is to link phenotype and genotype via beads. The beads used will be coated with streptavidin to allow for the binding of biotinylated DNA, in addition, the beads will also display cognate binding partner to the affinity tag that will be expressed in fusion with the protein/peptide.
Depending on the phenotype to be selected, difference selection strategies will be used. Selection strategy can be divided into three major categories: selection for binding, selection for catalysis and selection for regulation. The phenotype to be selected can range from RNA to peptide to protein. By selecting for binding, the most commonly evolved phenotypes are peptide/proteins that have selective affinity to a specific antibody or DNA molecule. An example is the selection of proteins that have affinity to zinc finger DNA by Sepp et al. By selecting for catalytic proteins/RNAs, new variants with novel or improved enzymatic property are usually isolated. For example, new ribozyme variants with trans-ligase activity were selected and exhibited multiple turnovers. By selecting for regulation, inhibitors of DNA nucleases can be selected, such as protein inhibitors of the Colicin E7 DNase.
Comparing to other in vitro display technologies, IVC has two major advantages. The first advantage is its ability to control reactions within the droplets. Hydrophobic and hydrophilic components can be delivered to each droplet in a step-wise fashion without compromising the chemical integrity of the droplet, and thus by controlling what to be added and when to be added, the reaction in each droplet is controlled. In addition, depending on the nature of the reaction to be carried out, the pH of each droplet can also be changed. More recently, photocaged substrates were used and their participation in a reaction was regulated by photo-activation. The second advantage is that IVC allows the selection of catalytic molecules. As an example, Griffiths et al. was able to select for phosphotriesterase variants with higher Kcat by detecting product formation and amount using anti-product antibody and flow cytometry respectively.CIS display