20.109(S07): Start-up expression engineering
Contents
Introduction
From your work so far this term, you have a good understanding of (at least) two fundamental concepts. From our first experimental module it should be clear that the genetic program for running a cell is readable (through sequencing), writable (through molecular biological techniques and synthesis) and somewhat, though not perfectly understandable. Recall how a genetic part can be as small as 13 base pairs, BBa_B0032 for example, to help carry out the work of protein production by enabling a huge protein/RNA complex, the ribosome, to bind an RNA molecule. From our second experimental module, it should be clear that a cell's programming doesn't end at protein production but rather that proteins are dynamic (chemically and spatially). They react to changes in the environment with great speed and sensitivity. We saw that proteins could be considered "digital" information, either present or absent, but are perhaps better thought of as "tunable" analog data since, as far as the cell is concerned, proteins are relevant only when functional, and function can be governed by a regulating protein stability, localization and modifications. Thus, from the work we've done so far this term, you may have the idea that gene expression in a cell is powered by the central dogma (DNA making RNA making protein) and then modulated by varying the properties of proteins once they are made. This strategy minimizes reaction times but is, energetically speaking, quite wasteful. Why make a protein if it's not useful? In this experimental module we'll see how nature has refined the genetic programming language so a cell's proteins are not all constitutively expressed but rather are finely regulated at the level of transcription initiation. This turns out to be only one of many points for regulation but it's an important one.
In the upcoming experiment we'll also see how nature has overcome another design issue, namely space constraints. A cell's dimensions do not increase linearly with DNA content. Instead, eukaryotic cells remain compartmentalized and compact the DNA by wrapping it around assemblies of histone proteins called nucleosomes. Nucleosomes wrap around eachother to form chromatin. This solves the space issue, allowing a meter or so of DNA to be crammed into a space perhaps 10 um across, but creates a new problem. Wrapped DNA is less accessible to the transcription and replication machinery. Gene expression becomes newly and intimately related to chromatin dynamics. This new problem is overcome by other multiprotein complexes that interact with the DNA-wrapping proteins. Nucleosomes are redistributed around genes that are "active" though it remains unclear if this redistribution is a cause or a consequence of the activity. We'll study one chromatin-remodeling complex called SAGA in this experimental module.
The name "SAGA" is an acronym for "Spt-Ada-Gcn5-acetyltransferase." Before describing each of these components, it's important to note that biochemically similar complexes are found in many (all?) of the eukaryotic cells that have been studied. Even more remarkable, these SAGA complexes have similar (identical?) numbers of protein subunits and the proteins have noteable sequence homologies, suggesting conserved functions even in cells with diverse life-styles like yeast and human cells. Natural processes like development and division and disease states like cancer can be understood at the level of transcription (mis)regulation. Chromatin remodeling is required for appropriate gene expression which is, in turn, required for healthy cell behaviors. Thus, there is good reason to believe that an understanding of how SAGA works in yeast can give us insight into its role in cells more medically relevant, like human.
A combination of biochemical and genetic data initially suggested that an enzymatic activity, namely a histone-acetyl transferase, encoded by the GCN5 gene in S. cerevisiae existed as a large protein complex called that the authors named "SAGA" [Grant et al 1997]. Nineteen proteins, including GCN5, associate to form SAGA, though surprisingly not all the proteins are absolutely required for the cell to live. Delete a nonessential subunit and the cell can still grow and divide, although sometimes with impaired functions. A table of all 19 SAGA subunits, including information about their essential or non-essential nature, is included as part of today's protocol. A recent structure for the complex was elucidated through electron microscopy [Wu et al 2004]. The SAGA structure, as shown above, can be imagined to "dock" with the DNA and associated proteins, allowing us to imagine some very elegant models for how chromatin-modifications might be performed and regulated. Many terrific and exciting experiments can be designed to test these models. Your experiment will be a systematic examination of the non-essential SAGA subunits and their role in gene expression. Today you will design some primers to delete from the yeast genome a nonessential subunit of your choosing. Later in this experimental module, you will examine your mutated strain for changes in gene expression, looking for new phenotypes associated with the SAGA-subunit deletion as well as looking by DNA microarray for genes whose expression is affected by the loss of this subunit.
Protocols
Part 1: Choosing a SAGA subunit
You should begin by learning a little about the different classes of SAGA subunits. What does "Spt" stand for (and how do you pronounce that?!)? What is a TAF? Why is Ada5 also called Spt20?
The nomenclature for S. cerevisiae is precise and helpful. Wild type genes are normally given an italicized, three letter acronym based on the phenotype of a mutation in that gene. So a HIS gene is unable to make histidine if the gene is defective (of course, dead cells all have only one phenotype so this presumes loss of the gene product doesn't kill the cell...). Since there exist several genes that can give rise to similar phenotypes, associated genes are given a number, e.g. HIS3, HIS4,, etc. To describe recessive mutant alleles, lower case letters are used. So a strain that is his3 has a mutation that affects the function of the HIS3 gene. Since there can be several different mutations described for any given gene, a second number gets associated with the mutant, e.g. his3-1 or his3delta200. Proteins are distinguished from DNA by capitalizing only the first letter of the gene product: the HIS3 gene makes the His3 protein. Naturally there are exceptions to these rules, but in general you can pretty confidently follow them.
Begin aquainting yourself with the SAGA subunits by copying the following tables to your wiki userpage, then finding relevant information in the Saccharomyces Genome Database for the subunits. You can also search Pubmed to find out a little more about the role of the subunits in gene expression. Do not shortchange yourself on this part of the experiment, since you will be working with the subunit you choose today for the rest of the module.
SAGA subunits, S. cerevisiae
Ada subunits | size,chromosome,null p-type | notes |
---|---|---|
Ada1 (aka HFI1, SUP110, SRM12, GAN1) | 1.467 kb/489 aa, Chr. XVI, viable |
|
Ada2 (aka SWI8) | 1.305 kb/434aa, Chr. IV, viable |
|
Ada3(aka NGG1, SWI7) | 2.109 kb/702aa, Chr. IV, viable |
|
Gcn5 (aka ADA4, SWI9) | 1.32 kb/439aa, Chr. VII, viable |
|
Ada5 (aka SPT20) | 1.815 kb/604aa, Chr. XV, viable |
Spt subunits | size, chromosome, null p-type | notes |
---|---|---|
Spt3 | 1.014 kb/337aa, Chr. IV, viable |
|
Spt7(aka GIT2) | 3.999 kb/1332aa, Chr. II, viable |
|
Spt8 | 1.809 kb/602aa, Chr. XII, viable |
|
Spt20 (aka Ada5) | 1.815 kb/604aa, Chr. XV, viable |
TAF subunits | size, chromosome, null p-type | notes |
---|---|---|
TAF5 (aka TAF90) | 2.397 kb/798aa, Chr. II, inviable | |
TAF6 (aka TAF60) | 1.551 kb/516aa, Chr. VII, inviable | |
TAF9 (aka TAF17) | 0.474 kb/157aa, Chr. XIII, inviable | |
TAF10 (aka TAF23, TAF25) | 0.621 kb/206aa, Chr. IV, inviable | |
TAF12(aka TAF61, TAF68) | 1.620 kb/539aa, Chr. IV, inviable |
Tra1 subunit | size, chromosome, null p-type | notes |
---|---|---|
Tra1 | 11.235 kb/3744aa, Chr. VIII, inviable |
other subunits | size, chromosome, null p-type | notes |
---|---|---|
Sgf73 | 1.974 kb/657aa, Chr. VII , viable |
|
Sgf29 | 0.779 kb/259aa, Chr. III, viable |
|
Sgf11 | 0.3 kb/99aa, Chr.XVI, viable |
|
Ubp8 | 1.416 kb/471aa, Chr. XIII, viable |
|
Sus1 | gene with intron, Chr. II, viable |
Part 2: Designing deletion oligos
Everyone's starting strain is called FY2068, which has the following genotype:MAT(alpha) ura3-52 his3D200 leu2D1 lys2-128delta
Everyone will delete a SAGA subunit of their choosing by replacing it with a selectable marker, namely the URA3 gene. Successful replacement of the SAGA subunit with the URA3 gene will restore growth of the strain on media lacking uracil ("SC" for "synthetic complete" "minus ura" for the absence of uracil). The primers you design today will have sequences identical to the URA3 gene, shown below. The homology will enable the primers to anneal to the URA3 gene and amplify it during the polymerase chain reactions you will perform at the end of lab today. These reactions will be more fully explained later in the experimental module.
Everyone's primers will also include "tails" that will later allow the amplified URA3 gene to replace the SAGA subunit gene of interest. These tails must be at least 39 bases long to allow for sufficient specificity and frequency of recombination once transformed for recombination into the yeast cell. This allows 20 bases for annealing to the URA3 gene since most oligonucleotide synthesis companies change their pricing structure and recommendations for oligos longer than 59 bases. Limitations in the synthesis technology impose the 59 base limit, but it turns out to be minimally intrusive for experiments like these.
Designing the "forward" primer
Designing the "reverse primer
Part 3: PCR
- pRS406 template
DONE!