Difference between revisions of "20.109(F21):M2D1"

From Course Wiki
Jump to: navigation, search
(Created page with "<div style="padding: 10px; width: 820px; border: 5px solid #909497;"> {{Template:20.109(F21)}} ==Introduction== Though the theme of Module 2 is focused on screening for ...")
 
(Part 2: Visually evaluate chemical structures of putative binders)
 
(24 intermediate revisions by 2 users not shown)
Line 1: Line 1:
 
<div style="padding: 10px; width: 820px; border: 5px solid  #909497;">
 
<div style="padding: 10px; width: 820px; border: 5px solid  #909497;">
 
{{Template:20.109(F21)}}
 
{{Template:20.109(F21)}}
 
  
  
 
==Introduction==
 
==Introduction==
 +
A small molecule microarray (SMM) requires the covalent attachment of a library of small molecules to a glass slide. Our library is meant to broadly sample the drug-like chemical space (''i.e.'' all possible chemical structures that have drug-like physical properties) and contains about 50,000 small molecules. Some libraries are much smaller, while many pharmaceutical companies possess high-throughput screening (HTS) collections of millions of compounds. Because this chemical space is very large, it’s difficult to generalize any single chemical reaction for this attachment that can be applied to all small molecules. We take a “one-size-fits-most” approach, where the glass slide is functionalized with a broadly reactive electrophile capable of reacting with nucleophiles present in most drug-like small molecules, such as alcohols or amines. Many small molecules contain multiple nucleophiles suitable for attachment. In this case, our manufacturing will result in a mixture of attachment sites. It’s important to remember that attachment to the glass slide constrains the possible orientation of the protein-small molecule interaction; some orientations are not possible because the glass slide and linker are in the way.
  
Though the theme of Module 2 is focused on screening for small molecules that bind the PF3D7_1351100 protein, today will focus on a few key techniques used in DNA engineering. Because the sequence of proteins is determined by the sequence of the genes that encode them, learning how to manipulate DNA is an important first stepToday you will complete the cloning steps used to incorporate the gene that encodes the PF3D7_1351100 protein into an expression vectorThe expression vector contains the genetic elements needed to express and purify a protein of interest.
+
We start with a glass slide with exposed amines across the surface and attach a short PEG (polyethylene glycol) linker. To the end of this PEG linker, an isocyanate group is attached. Isocyanate, or R-N=C=O, is a resonant structure, and a partial positive charge is stabilized on the carbon atomThis carbon atom is electrophilic, and small molecules with nucleophiles will react hereWe estimate that about 70% of drug-like small molecules are amenable to this reaction, and our library is filtered to contain only these molecules.
  
Expression vectors contain several features important for cloning, plasmid replication, and protein expression -- all of which are important for purifying high-quality protein. To generate this expression plasmid, two common DNA engineering techniques were used: restriction enzyme digestion and ligation. First, the PF3D7 insert was synthesized such that the gene sequence is flanked by restriction enzymes sites. Next, this fragment and the vector were digested to create compatible ends. Last, the compatible ends of the digested insert and vector were  ligated to generate the pET-28b(+)_PF3D7_1351100 expression plasmid.
+
[[Image:Sp17 20.109 M1D4 SMM printing.png|thumb|700px|center|Image from Bradner, J. E. et al. [http://www.ncbi.nlm.nih.gov/pubmed/17406478 PMID: 17406478]]]
  
[[Image:Fa20 M2D1 cloning schematic.png|thumb|center|450px|'''Schematic of pET-28b(+)_PF3D7_1351100 cloning.''']]
+
Our compound library is dissolved in DMSO and stored in 384-well plates. To dispense the compounds onto our functionalized glass slide, a robotic arm with a set of 48 metal pins is used to transfer the compounds to the glass slide. Each metal pin has a small slit in the end, and capillary action is used to precisely withdraw and dispense consistent volumes. When the pins touch the glass slide, the compound in DMSO is dispensed into a small circle of approximately 160 micron in diameter. Each pin prints one compound in two different locations on each slide, and then the pins are washed in water and DMSO. This process is repeated for each compound, resulting in our final microarray. The microarray is divided into 48 subarrays, and each subarray corresponds to one pin and contains 256 discrete spots. Within each subarray, we print a set of fluorescent compounds in the shape of an ‘X’ so that we can precisely determine where each spot is printed. After the compounds react, we quench the surface so that no electrophiles remain. This results in our final microarray; a collection of approximately 12,000 discrete spots displaying one compound each.
  
'''Restriction enzyme digest'''
+
(Written by Rob Wilson.  For more information read Bradner, J. E., McPherson, O. M., and Koehler, A. N. (2006)  ''A method for the covalent capture and screening of diverse small molecules in a microarray format.'' Nature Prot. 1:2344-2352. [http://www.ncbi.nlm.nih.gov/pubmed/17406478 PMID: 17406478].)
  
[[Image:Mod1 1 eco ri.jpg|thumb|right|300px|'''Schematic of DNA digestion.''']]
+
==Protocols==
  
Restriction endonucleases, also called restriction enzymes, 'cut' or 'digest' DNA at specific sequences of bases. The restriction enzymes are named according to the prokaryotic organism from which they were isolated. For example, the restriction endonuclease ''EcoRI'' (pronounced “echo-are-one”) was originally isolated from ''E. coli'' giving it the “Eco” part of the name. “RI” indicates the particular version on the ''E. coli strain'' (RY13) and the fact that it was the first restriction enzyme isolated from this strain.
+
===Part 1: Review SMM experimental details===
  
The sequence of DNA that is bound and cleaved by an endonuclease is called the recognition sequence or restriction site. These sequences are usually four or six base pairs long and palindromic, that is, they read the same 5’ to 3’ on the top and bottom strand of DNA. For example, the recognition sequence for ''EcoRI'' is <font face="courier">5’ GAATTC 3’</font> (see figure at right)''EcoRI'' cleaves the phosphate backbone of DNA between the G and A of the recognition sequence, which generates overhangs or 'sticky ends' of double-stranded DNA.
+
For timing reasons, you will not perform an SMM experiment this semester. Instead you will expand on the results gathered by researchers in the Niles Laboratory to test putative small molecule binders in a secondary assayTo ensure that you are familiar with the steps used in an SMM screen, review the procedure below.
  
Unlike ''EcoRI'', some other restriction enzymes cut precisely in the middle of the palindromic DNA sequence, thus leaving no overhangs after digestion. The single-stranded overhangs resulting from DNA digestion by enzymes such as ''EcoRI'' are called sticky ends, while double-stranded ends resulting from digestion by enzymes such as ''HaeIII'' are called blunt ends. ''HaeIII'' recognizes <font face="courier">5’ GGCC 3’</font> and upon recognition cuts in the center of the sequence.
+
====Perform SMM screen====
  
'''Ligation'''
+
<font color = #0d368e>'''To ensure the steps included below are clear, please watch the video tutorial linked here: [[https://www.dropbox.com/s/0phy1rg1w23fs49/SMM%20Staining.mp4?dl=0 SMM Screen]].  The steps are detailed below so you can follow along!'''</font color>
  
[[Image:Mod1 3 dnaligatn.jpg|thumb|right|400px|'''Schematic of DNA ligation.''']]
+
#Obtain two printed SMM slides from the front laboratory bench.[[Image:Sp17 20.109 M1D4 forceps.png|thumb|250px|right|Only use forceps at the barcoded end of the slide.]]
In a ligation reaction, DNA ends are covalently attached to one another via the ligase enzyme. The efficiency of the reaction is related to type of DNA ends: compatible sticky ends will ligate more efficiently than blunt ends, and non-compatible sticky ends will not be ligated due to the lack of hydrogen bonding between the basepairs. To initiate the ligation reaction, hydrogen bonds are formed between the compatible overhangs of DNA fragmentsThe ligase enzyme then forms a covalent phosphodiester bond between the 3' hydroxyl end of the 'acceptor' nucleotide and the 5' phosphodiester end of the 'donor' nucleotide.
+
#*Only hold them at the barcoded end as you would otherwise disrupt the ligands that are printed on the surface.
 +
#Record the barcode numbers of your slides in your laboratory notebook.
 +
#Carefully place each slide, barcode facing up, into a well of a 4-well chamber dish using the slide forceps.[[Image:Sp17 20.109 M1D4 pipet.png|thumb|200px|right|Add all liquid to the barcoded end of the slide.]]
 +
#*Again, only hold the slide at the barcoded end.
 +
#Add 3 mL of your diluted protein solution to the chambers that contain your slides.
 +
#*Pipet the diluted protein solution onto the barcoded end of the slides.
 +
#*Avoid generating bubbles as you add your diluted protein solution.
 +
#Gently rock the 4-well chamber dish back and forth until the slides are completely covered with the diluted protein solution.
 +
#Cover the 4-well chamber dish with aluminum foil and incubate on the rocking table for 1 hr.
 +
#*During all incubations it is important to keep the slides covered as the fluorescein spots used for alignment during scanning are sensitive to light.[[Image:Sp17 20.109 M1D4 liquid.png|thumb|200px|right|Pour liquid from the corner of the 4-well chamber dish.]]
 +
#Carefully pour the diluted protein solution from your 4-well chamber dish into a waste container by tipping the dish such that the liquid leaves the dish from the corner.
 +
#*The surface tension should keep the slides in the 4-well chamber dish; however, it is good to be careful that they do not fall into the waste container.
 +
#Add ~3 mL of TBS-T to the chambers that contain your slides.
 +
#*Remember, pipet the liquid onto the barcoded end of the slides.
 +
#*Don't 'push' the liquid across the surface of the slide with the pipetInstead, gently rock the 4-well chamber dish back and forth to completely cover your slides.
 +
#Incubate the slides on the rocking table for 2 min.
 +
#*Don't forget to cover the 4-well chamber dish with aluminum foil.
 +
#Pour the TBS-T from the 4-well chamber dish as above into a waste container.
 +
#Complete Steps #8 - 10 a total of 3 times.
 +
#Move your slides to a fresh 4-well chamber dish to eliminate carryover from the protein and/or buffer solutions.
 +
#Prepare 7 mL of anti-His antibody solution in TBS-T.
 +
#*Dilute the antibody stock 1:1000 at the front laboratory bench.
 +
#Add 3 mL of the diluted antibody solution to the chambers that contain your slides.
 +
#*Pipet the diluted antibody solution onto the barcoded end of the slides.
 +
#*Avoid generating bubbles as you add your diluted antibody solution.
 +
#Gently rock the 4-well chamber dish back and forth until the slides are completely covered with the diluted antibody solution.
 +
#Cover the 4-well chamber dish with aluminum foil and incubate on the rocking table for 1 hr.
 +
#Carefully pour the diluted antibody solution from your 4-well chamber dish into a waste container.
 +
#Add ~3 mL of TBS (not TBS-T!) to remove excess detergent and incubate your covered 4-well chamber dish on the rocking table for 2 min, then pour liquid into a waste container.
 +
#Lastly, wash the slides with dH<sub>2</sub>O.
 +
#*Obtain 2 reservoirs and add fresh dH<sub>2</sub>O to each.
 +
#*Label the reservoirs '1' and '2'.
 +
#Wash each slide by removing it from the 4-well chamber dish with the slide forceps and dunking it 8 times into reservoir 1, then dunking it 8 times into reservoir 2.
 +
#Dry your slides by carefully inserting your slides in the 'slide spinner' at the front laboratory bench.
 +
#*Be mindful not to scrape the face of your slides on the slide holder.
 +
#*Centrifuge your slides for 30 sec.
 +
#Transfer your slides into fresh 50 mL conical tubes and wrap in aluminum foil.
 +
#Store SMM slides at 4 &deg;C until imaging.
  
The first step in this process is the addition of AMP (adenylation) to a lysine residue within the active site of DNA ligase, which releases a pyrophosphate.  Next, the AMP is transferred to the 5' phosphate of the donor nucleotide resulting in the formation of a pyrophosphate bond. Lastly, a phosphodiester bond is formed between the 5' phosphate of the donor nucleotide and the 3' hydroxyl of the 3' acceptor nucleotide.
+
====Scan SMM slides====
  
==Protocols==
+
After the printed SMM slides are incubated with the purified PF3D7_20109-F21 protein, the next step is to check which of the small molecules, if any, you screened may be able to bind PF3D7_20109-F21. To do this the slides are imaged using a Genepix microarray scanner.  The scanner measures the fluorescence signal emitted from the slide at two wavelengths: 532 nm
 +
and 635 nm.  The goal for today is to familiarize you with how the SMM slides are scanned and imaged.  These images are the raw data that will be used to identify putative small molecule binders.
  
Because DNA engineering at the benchtop can take days, if not weeks, you will clone the expression plasmid ''in silico'' today. You can use any DNA manipulation software you choose to complete the protocols, but the instructions provided are for SnapGene.  Please note that if you use a different program the Instructors may not be able to assist you.
+
[[Image:Fa21 M2D1 SMM complex & scan.png|thumb|700px|center|'''Overview of SMM slide imaging.''' A. To visualize which small molecule is putative binder of PF3D7_20109-F21, the 6xHis-tag is labeled using an anti-His antibody that is conjugated to a fluorophore. B. The fluorophore used to label protein bound to the small molecule in a particular location on the slide emits a red signal (indicated by white arrow).  Image in Panel A generated using BioRender.]]<br>
  
===Part 1: Synthesis and restriction enzyme digest of PF3D7_1351100 insert===
+
When the SMM slides are imaged, the scanner exposes each slide to excitation light specific to the fluorophores used in the experiment. As shown in the figure above, two fluorophores were used to evaluate small molecule binding to PF2D7_20109-F21.  The green spots represent locations on the SMM slide where fluorescein was printed.  Fluorescein is a fluorescent dye that emits at 532 nm and is used for alignment purposes.  Correct alignment is critical to knowing which small molecules are in which spots on the slide.  Red spots are indicative of small molecules that are bound by PF3D7_20109-F21.  The signal is due to an Alexa Fluor 647 anti-His antibody that emits at 635 nm. 
  
To use SnapGene software off campus you must log into a VPN connection prior to opening the SnapGene. Here is the link to the [https://ist.mit.edu/cisco-anyconnect VPN download] and [http://kb.mit.edu/confluence/x/6QPn installation instructions]. Also you will need to update the SnapGene license number if you have not opened the application since March. The new license information can be found [http://downloads.mit.edu/released/snapgene/group-name_registration-code.txt here].  
+
During imaging, the scanner detects the intensity of the emitted fluorescence light to generate an image that can be analyzed. The intensity and position of the 'red' signal associated with putative small molecule binders is what will be assessed in the next laboratory session. This analysis will provide a list of 'hits' that can be used in future studies to identify drug candidates.
  
As discussed in the [[20.109(F20):Module 2|M2 project overview]], PF3D7_1351100 is an essential protein of unknown function in ''Plasmodium falciparum''.  In an effort to study PF3D7_1351100 a researcher in the Niles Laboratory, Dr. Khan Osman, cloned the gene that encodes the protein into an expression plasmid.  Rather than amplifying the gene from the ''P. falciparum'' genome, gBlock synthesis technology was used. A gBlock is a double-stranded DNA segment that is synthesized commercially without the use of live cells. To generate a gBlock, a researcher simply submits the basepair sequence to be synthesized.  This method is useful for several reasons including: 1. it can be technically difficult to amplify genes from certain organisms, and 2. it can be easier to modify DNA that is synthetically generated.  For this project, the PF3D7_1351100 gBlock sequence was modified such that the codon usage was optimized for expression in ''E. coli'' cells.
+
<font color = #0d368e>'''To ensure you are familiar with the steps involved in imaging the SMM experiment, please watch the video tutorial provided by a researcher from the Koehler Laboratory linked here: [https://www.dropbox.com/s/6o07ruvdtchg4ex/Scanning%20SMM.mp4?dl=0 SMM Scanning]. The steps are detailed below so you can follow along!'''</font color>
  
[[Image:Fa20 M2D1 insert synthesis and digest .png|thumb|right|300px|]]
+
#Place the slide in the scanner barcode side down.
 +
#Set the desired wavelengths that you want to scan.
 +
#*For fluorescein: 532 nm
 +
#*For Alexa Fluor 647: 635 nm
 +
#Set the wavelength, filter, PMT, and power settings.
 +
#Run a preview scan to optimize the PMT for the 635 nm (red) emission.
 +
#Complete a full scan using the optimal PMT value.
 +
#Following the scan, you will see an image of your slide as below.
  
#Open the word document with the PF3D7_1351100 insert sequence (linked[[Media:Fa20 M2D1 PF3D7 1351100 insert sequence.docx| here]]).
+
<font color =  #4a9152 >'''In your laboratory notebook,'''</font color> complete the following:
#*Open SnapGene. From the options, select 'New DNA File...'.
+
*What color is emitted by fluorescein?  What does this indicate on the scan (what is present in spots that emit this signal)?
#*Copy and paste the sequence from the .docx file above.
+
*What color is emitted by Alexa Fluor 647? What does this indicate on the scan (what is present in spots that emit this signal)?
#*Enter "PF3D7_1351100" for the File Name (in the lower, right corner), select 'linear' for the topology (in the lower, left corner), then click 'OK'.
+
#A new window will open with a map of PF3D7_1351100 showing the unique restriction enzyme sites within the sequence. 
+
#In later steps you will generate a map of the PF3D7_1351100 insert cloned into the pET-28b(+) expression vector.  To make the map more visually useful, create a feature that defines the PF3D7_1351100 insert.
+
#*Click 'Sequence' from the options at the bottom of the window.
+
#*Highlight the entire sequence in the window.
+
#*From the toolbar, select 'Features' &rarr; 'Add Feature...'
+
#*In the new window name, type PF3D7_1351100 into the 'Feature:' box.
+
#*Select gene from the dropdown in the 'Type:' box and select the right facing arrowhead (this denotes the directionality of the insert).
+
#*Then click 'OK'.
+
#The PF3D7_1351100 gBlock was modified such that a 6xHis-tag sequence was added to the N-terminal end of the protein.  6xHis-tags are added for protein purification.
+
#<font color =  #4a9152 >'''In your laboratory notebook,'''</font color> draw a schematic diagram that shows the following:
+
#*The gene sequence (as a line) with 5' and 3' orientation denoted.
+
#*The associated protein sequence (again, as a line) with the N' and C' termini denoted.
+
#*The location of the 6xHis-tag on the gene sequence and protein sequence.
+
#Add the 6xHis-tag (5' MGSSHHHHHHSSG 3') to the PF3D7_1351100 insert sequence by setting the cursor to the location in the DNA sequence.  Then begin typing the 6xHis-tag sequence.  A new window will appear with the typed bases.  Confirm the bases are entered correctly, then click 'Insert'.
+
#*Use the steps above to define the 6xHis-tag sequence as a feature.
+
#As shown in the schematic of our cloning strategy, NcoI and BamHI recognition sequences were added to the PF3D7_1351100 gBlock to enable cloning into the pET-28b(+) vector. Specifically, NcoI was added to the 5' end and BamHI to the 3' end of the PF3D7_1351100 sequence. 
+
#*Search the [http://www.neb.com/tools-and-resources/selection-charts/alphabetized-list-of-recognition-specificities NEB enzyme list] to find the NcoI and BamHI recognition sequences. 
+
#<font color = #4a9152 >'''In your laboratory notebook,'''</font color> complete the following:
+
#*Record the recognition sequences for NcoI and BamHI.  Include the cleavage locations within each sequence.
+
#*Include the recognition sites for NcoI and BamHI to the schematic diagram created above.  Should these recognition sites be included on the gene sequence or the protein sequence?
+
#Add the NcoI and BamHI recognition sites to the PF3D7_1351100 insert sequence by setting the cursor to the location in the DNA sequence, then begin typing.  A new window will appear with the typed bases.  Confirm the bases are entered correctly, then click 'Insert'.
+
#Now that you have the PF3D7_1351100 gBlock, you need to digest with NcoI and BamHI to generate 'sticky ends' that will enable you to ligate the PF3D7_1351100 insert into the pET-28b(+) vector.
+
#*On the map of the PF3D7_1351100 insert, select the NcoI recognition site by clicking on the enzyme name.  Then hold the shift key and select the BamHI recognition site.
+
#*This should highlight the area between the enzyme recognition sites.
+
#Click the drop-down arrow next to the 'Copy' icon at the top of the window.
+
#*Select 'Copy Restriction Fragment.'
+
#Click the drop-down arrow next to the 'New' icon at the top of the window.
+
#*Select 'New DNA File...'.
+
#*Paste the restriction fragment from the previous step in the text box, then click 'OK'.
+
#A new window will open with the digested PF3D7_1351100 insert.
+
#<font color =  #4a9152 >'''In your laboratory notebook,'''</font color> complete the following:
+
#*Record the length of the insert.  How does the length of the insert compare to the length of the gBlock.
+
#*Is the insert double-stranded or single-stranded?
+
#*Is the insert a blunt end product or sticky end product?
+
#Save the insert file.
+
  
===Part 2: Restriction enzyme digest of pET-28b(+) expression vector===
+
[[Image:Sp17 20.109 M1D5 Genepix scan.png|thumb|600px|center|]]
  
For the ligation step, it is important to generate compatible 'sticky ends' on the insert and vector.  Above, you digested your PF3D7_1351100 insert with NcoI and BamHI in a double-digest to prepare the insert for your cloning.  Here you will digest the pET-28b(+) vector to create compatible ends that can be ligated.
+
====Analyze SMM images====
  
[[Image:Fa20 M2D1 vector digest.png|thumb|right|250px|]]
+
[[Image:Sp20 M1D6 background, foreground.png|right|800px|thumb]]The microarrayer reads the fluorescence signals emitted from the surface of the SMM slide at two excitation wavelengths.  As noted previously, the 532 nm wavelength was used to excite fluorescein, which was printed in an 'X' pattern to assist with alignment.  The 635 nm wavelength was used to excite Alexa Fluor 647-conjugated anti-His antibody; which would be associated with PF3D7_20109-F21 bound to a small molecule on the slide.  A hit denotes a spot on the slide that emits a red fluorescence signal significantly higher than the background fluorescence level.  In terms of protein binding, a hit denotes that the PF3D7_20109-F21 protein is bound to a small molecule and is therefore localized to a specific position on the slide.  You will analyze the fluorescence signal collected by the microarray scanner using a value termed the robust z-score. 
  
#Open the word document with the pET-28b(+) vector sequence (linked[[Media:Fa20 M2D1 pET-28b(+) vector sequence.docx| here]]).
+
The robust z-score differentiates signal from noise by providing a value that represents the intensity of a signal above background.  In the case of the SMM experiment, the intensity of a fluorescent signal above the background fluorescence is calculated. To do this the fluorescence emitted across the entire slide is grouped to define the Median Absolute Deviation (MAD), which is is a measure of the variability of a univariate dataset. Though beyond the scope of this class, the equation for calculating the robust z-score assigns a value for how much more intense the fluroescent signal at a spot is over backgroundThe higher the value, the more different the signal from background.
#*Copy and paste the vector sequence into a New DNA File window and save this sequence.
+
#*Be sure to select circular from the topology options.
+
#One very useful aspect of SnapGene is that the software is able to recognize features, or sequences that match known genes and binding sites, in DNA sequencesA window titled "Detect Common Features" should appear.
+
#<font color =  #4a9152 >'''In your laboratory notebook,'''</font color> include a summary of the details provided about features in the pET-28b(+) vector.
+
#Select 'Add Features'.
+
#A new window will open with a map of the vector showing the unique restriction enzyme sites and annotated features within the sequence.
+
#To generate the sticky ends that will enable you to ligate the PD3D7_1351100 insert into the vector, view the map of your vector sequence.
+
#*Select the NcoI recognition site by clicking on the enzyme name, then hold the shift key and select the BamHI recognition site.
+
#*Select 'Actions' --> 'Restriction and Insertion Cloning' --> 'Delete Restriction Fragment...' from the toolbar.
+
#<font color = #4a9152 >'''In your laboratory notebook,'''</font color> complete the following:
+
#*What is the length of the digested vector product? 
+
#*How many basepairs were removed (compared to the intact cloning vector)?
+
  
===Part 3: Ligation of PF3D7_1351100 insert and pET-28b(+) expression vector===
+
After putative binders are identified via the robust z-score, the data are examined by-eye using the criteria discussed in the prelab. Please review the prelab notes for how this process is completed.
  
Before you prepare a ligation, one very important step is to calculate the amounts of DNA that will be used in the reaction.  Ideally, you should use a 3:1 molar ratio of insert to vector (note: it is a molar ratio, not a volumetric ratio!). You will use the steps below to calculate the volume amount (based on the molar ratio!) of the PF3D7_1351100 insert and pET-28b(+) expression vector you would use to complete this ligation in the laboratory.
+
<font color =  #4a9152 >'''In your laboratory notebook,'''</font color> complete the following:
 +
*Why is it important to complete a by-eye inspection of the SMM results?
  
[[Image:Sp20 M1D1 recovery gel.png|thumb|right|300px|'''Recovery gel for ligation calculations.''' Lane 1 = pET-28b(+) expression vector, Lane 2 = molecular weight ladder, and Lane 3 = PF3D7_1351100 insert.]]
+
===Part 2: Visually evaluate chemical structures of putative binders===
  
Use the following information to calculate the volume of insert and vector needed to prepare a ligation with a 3:1 molar ratio (insert:vector).
+
One method for assessing protein-small molecule binding is to visually inspect known small molecule binders for common features / structures. Your goal for today is to carefully examine the hits identified by the class and identify any common features / structures. As in the image below, it is possible that multiple features will be present within the same small molecule.
*Concentration of PF3D7_1351100 insert solution = 25 ng/uL
+
*Concentration of pET-28b(+) expression vector solution = 50 ng/uL
+
*Molecular weight of a basepair = 660 g/mol
+
*Sizes, in basepairs, of the insert and vector sequences (this was determined in the exercises above!)
+
  
Though there are are different strategies that can be used to complete the ligation calculations, it may be easier to break the math into the following steps:
+
[[Image:Sp17 20.109 M1D7 chemical structure features.png|thumb|750px|center|]]
#Determine the volume of vector that will be used in the ligation reaction.
+
#*Typically, it is best to use 50 - 100 ng of vector.
+
#Calculate the moles of vector.
+
#Calculate the moles of insert.
+
#*Remember, this number should be 3-fold more than the moles of vector to accomplish a 3:1 molar ratio.
+
#Calculate the volume of insert that contains the appropriate moles of insert.
+
#One additional consideration is the volume of the reaction.  The total volume of the ligation reaction should not be greater than 15 &mu;L.  In this, the total volume of the insert and vector should not be greater than 13.5 &mu;L as additional reagents are required in the reaction.
+
#*If the insert and vector volume total greater than 13.5 &mu;L, you should (1) scale down both DNA amounts, using less than 50 ng backbone and/or (2) stray from the ideal 3:1 molar ratio.
+
#*You may ask the teaching faculty for advice during class if you are unsure what choice is best.
+
#<font color =  #4a9152 >'''In your laboratory notebook,'''</font color> calculate the volume of insert and volume of vector that should be used for a ligation reaction that contains a 3:1 molar ratio of insert:vector.  Show all math!
+
#*Feel free to take a picture of your hand-written work and embed the image in your notebook.
+
#Next you will complete this ligation ''in silico'' to generate a map, or visual representation, of the pET-28b(+)_PF3D7_1351100 plasmid.[[Image:Fa20 M2D1 expression plasmid ligation.png|thumb|right|400px|]]
+
#To ligate the PF3D7_1351100 insert into the pET-28b(+) expression vector, select 'Actions' --> 'Restriction and Insertion Cloning' --> 'Insert Fragment...'.
+
#*A new window will open.  In the bottom workspace of the window, a cloning schematic will appear showing a vector and insert icon.
+
#*Click on the 'Vector' label.  Then in the workspace at the the right of the window, select the vector file from the 'Vector:' drop-down.
+
#*Select the restriction enzymes used to digest the expression vector from the drop-down boxes next to the text boxes that contain 'cut'.
+
#Next, click on the 'Insert' label at the bottom of the window and complete the steps as done for the expression vector.
+
#*For the insert, use the PF3D7_1351100 ''undigested'' file.
+
#Click 'Clone'.
+
#A new window will open with the cloned pET-28b(+)_PF3D7_1351100 product!
+
#<font color =  #4a9152 >'''In your laboratory notebook,'''</font color> complete the following:
+
#*What is the size of the plasmid?  Does this make sense given the lengths of the insert and vector?
+
#*Does your sequence still contain a NcoI recognition sequence?  A BamHI recognition sequence?
+
  
===Part 4: Confirmation digest of pET-28b(+)_PF3D7_1351100===
 
  
To confirm the pET-28b(+)_PF3D7_1351100 construct that we will use for this module, you will perform a 'diagnostic' or 'confirmation' digest. As discussed in prelab, this step is an important control -- you want to be sure that the products you use in your research are correct! This  step is used to check products you clone yourself and, perhaps more importantly, those that you may receive from another researcher. 
+
With your partner, review the hits that were identified in the SMM screen completed for PF3D7_20109-F21 (see table below)To see the chemical structures, translate the SMILES strings using one of the methods described in the text below the table. It may be easier to copy / paste the small molecule images into a powerpoint file so you can readily see all of the structuresAlso, it may be helpful to use a color-coding system (like the one in the image provided above) to highlight features / structures that are common to the small molecules that putatively bind PF3D7_20109-F21.
 
+
Ideally you will use a single enzyme that cuts once within the vector and once within your insert.  Unfortunately, this is rarely an option and you instead need to select an enzyme that cuts once within the vector and a second, compatible enzyme that cuts once within the insert.  Enzyme compatibility is determined by the buffer.  If two enzymes are active, or able to cleave DNA, in the same buffer, they are compatibleThe [http://nebcloner.neb.com/#!/redigest NEB double digest online tool] will prove very helpful in identifying compatible enzyme combinations!
+
 
+
Use the information from prelab, the 20.109 list of enzymes (linked [[media:20109Enzymes.docx |here]]), and the plasmid map you generated above to choose the enzymes you will use.
+
#To choose restriction enzymes for your confirmation digest, look at the plasmid map for your pET-28b(+)_PF3D7_1351100 construct.
+
#*Identify possible sites that will enable to you confirm the pET-28b(+)_PF3D7_1351100 sequence.
+
#*Remember the guidelines discussed in prelab!
+
#After you identify the enzymes that you will use for the confirmation digest, complete a virtual digest in using the pET-28b(+)_PF3D7_1351100 map you generated above.
+
#*On the map of pET-28b(+)_PF3D7_1351100, select the first recognition site by clicking on the enzyme nameThen hold the shift key and select the second recognition site.
+
#*Select 'Tools' --> 'Simulate Agarose Gel' from the toolbar.
+
#<font color =  #4a9152 >'''In your laboratory notebook,'''</font color> complete the following:
+
#*Record the expected fragment sizes from the confirmation digest.
+
#*Are the fragments distinct or ambiguously close together?
+
#Now that you identified which enzyme(s) to use in your confirmation digest, consider which controls should be included to ensure the results are interpretable.
+
#<font color =  #4a9152 >'''In your laboratory notebook,'''</font color> explain why the following reactions are included as controls for the confirmation digest experiment:
+
#*Undigested pET-28b(+)_PF3D7_1351100.
+
#*Single digests of pET-28b(+)_PF3D7_1351100 (each enzyme used alone in a digest with pET-28b(+)_PF3D7_1351100).
+
#Use the table below to calculate the volumes of each reagent that should be included in the confirmation digest reactions.
+
#*The 20.109 enzyme stocks are always the "S" size and concentration when you search for them on the NEB website.
+
#*To find the concentration of the enzyme(s) you choose, search the [http://www.neb.com/products/restriction-endonucleases/restriction-endonucleases NEB site].
+
  
 
<center>
 
<center>
 
{| border="1"
 
{| border="1"
|
 
! Diagnostic digest <br>(enzyme #1 AND enzyme #2)
 
! Enzyme #1 ONLY
 
! Enzyme #2 ONLY
 
! Uncut <br>(NO enzyme)
 
 
|-
 
|-
| pET-28b(+)_PF3D7_1351100
+
| '''Compound ID'''
| 5 &mu;L
+
| '''Compound #'''
| 5 &mu;L
+
| '''SMILES'''
| 5 &mu;L
+
| 5 &mu;L
+
 
|-
 
|-
| 10X NEB buffer <br>
+
| 43847864
(buffer name:____________)
+
| 3
| 2.5 &mu;L
+
| CN(CCOC)C[C@@H]1CN(C[C@@H]1CO)CC2=CC3=CC=CC=C3C=C2
| 2.5 &mu;L
+
| 2.5 &mu;L
+
| 2.5 &mu;L
+
 
|-
 
|-
| Enzyme #1 <br>
+
| 30020341
(enzyme name:____________)
+
| 2
| ____ &mu;L
+
| CN(CCO)C[C@@H]1CN(C[C@@H]1CO)CC2=CC3=CC=CC=C3C=C2
| ____ &mu;L
+
|
+
|
+
 
|-
 
|-
| Enzyme #2 <br>
+
| 10351573
(enzyme name:____________)
+
| 4
| ____ &mu;L
+
| C1CC1[C@H]2CN(C[C@@H]2N)C(=O)C3=CN=C(N=C3)NC4=CC=CC=C4
|
+
|-
| ____ &mu;L
+
| 86998996
|
+
| 1
 +
| CCNC1=NC=C(C=N1)C2=CC(=NC(=C2C#N)N)C3=CC=CN3
 
|-
 
|-
| H<sub>2</sub>O
 
! colspan="4"| to a final volume of 25 &mu;L
 
 
|}
 
|}
 
</center>
 
</center>
  
<font color = #0d368e>'''To ensure the steps required for preparing a digest are clear, the Instructor will provide a live demonstration of this process. You should provide a written description of the procedure in your laboratory notebook!'''</font color>
+
 
 +
'''These online resources may be helpful to learning more about the hits identified in the SMM:'''
 +
*Cloud version of ChemDraw [https://chemdrawdirect.perkinelmer.cloud/js/sample/index.html here].
 +
**Copy and paste the small molecule smiles into the work space to get a chemical structure
 +
*Platform to transform the smiles information into a PubChem ID [https://pubchem.ncbi.nlm.nih.gov/idexchange/idexchange.cgi here].
 +
**Copy and paste the smiles into the input ID search to determine the ID number.
 +
*PubChem database of chemical information [https://pubchem.ncbi.nlm.nih.gov here].
 +
**Includes small molecule molecular weight and other useful information.
  
 
<font color =  #4a9152 >'''In your laboratory notebook,'''</font color> complete the following:
 
<font color =  #4a9152 >'''In your laboratory notebook,'''</font color> complete the following:
  
*Provide a written overview / description of the the procedure used to prepare a restriction enzyme digest (from the live demonstration).
+
*How many features did you identify that are present in two or more of the small molecules that putatively bind PF3D7_20109-F21?  Are there more or less than you expected?
*For how long will the digests incubate and at what temperature?
+
*Is there a feature present in all of the identified small molecules?  What might this suggest about the binding site(s) and / or binding ability of PF3D7_20109-F21?
 
+
*Can you assign the identified small molecules to sub-groups based on the common features that are present?
===Part 5: Electrophorese confirmation digests===
+
*What might the different features represent?  More specifically, consider whether each subgroup has a unique binding site on the target protein or if each subgroup represents different solutions for interacting with the same binding site.  
 
+
*How might you make modifications to the small molecules / features to probe binding? As a hint, consider how different functional groups could be positioned at a given site without altering qualitative binding in the SMM assay to translate that into some testable ideas (e.g. quantitative binding properties may be occurring that are functionally relevant, but not discernible by SMM assays; or such a site is not critical for binding and may allow for modifications that confer beneficial properties of the compound).
Electrophoresis is a technique that separates large molecules by size using an applied electrical field and a sieving matrix. DNA, RNA and proteins are the molecules most often studied with this technique; agarose and acrylamide gels are the two most common sieves. The molecules to be separated enter the matrix through a well at one end and are pulled through the matrix when a current is applied across it. The larger molecules get entwined in the matrix and are stalled; the smaller molecules wind through the matrix more easily and travel farther away from the well. The distance a DNA fragment travels is inversely proportional to the log of its length. Over time fragments of similar length accumulate into “bands” in the gel. Higher concentrations of agarose can be used to resolve smaller DNA fragments.
+
 
+
[[Image:Fa20 M2D1 gel electrophoresis.png|thumb|right|550px|'''Agarose gel loading and electrophoresis.''' (A) To separate DNA fragments after a digestion reaction, the sample is loaded into the sample slots, or wells, in the agarose.  (B) Then an electrophoresis chamber is used to apply an electrical current.  The result is that larger sized DNA molecules remain close to the well where the sample was loaded and smaller DNA molecules migrate through the agarose gel.  This is due to the negatively charged DNA backbone and position of the electrodes in the electrophoresis chamber.]]
+
 
+
DNA and RNA are negatively charged molecules due to their phosphate backbone, and they naturally travel toward the positive electrode at the far end of the gel. Today you will separate DNA fragments using an agarose matrix. Agarose is a polymer that comes from seaweed. To prepare these gels, agarose and 1X TAE buffer (Tris base, acetic acid, and EDTA) are microwaved until the agarose is melted and fully dissolved. The molten agar is then poured into a horizontal casting tray, and a comb is added. Once the agar has solidified, the comb is removed, leaving wells into which the DNA samples can be loaded.
+
 
+
For the digests that were prepared in the previous laboratory session, a 1% agarose gel with SYBR Safe DNA stain was used to separate the DNA fragments in the four digest reactions.  In addition, a well was loaded with a molecular weight marker (also called a DNA ladder) to determine the size of the fragments.
+
 
+
<font color = #0d368e>'''To ensure the steps included below are clear, please watch the video tutorial linked here: [[https://www.dropbox.com/s/ijo0d5pi8pg2f5c/DNA%20Gel%20electrophoresis.mp4?dl=0 DNA gel electrophoresis]].  The steps are detailed below so you can follow along!'''</font color>
+
 
+
#Add 5 &mu;L of 6x loading dye to the digests.
+
#*Loading dye contains bromophenol blue as a tracking dye, which enables you to follow the progress of the electrophoresis.
+
#*Glycerol is also included to weight the samples such that the liquid sinks into well.  
+
#Flick the eppendorf tubes to mix the contents, then quick spin them in the microfuge to bring the contents of the tubes to the bottom.  
+
#Load 25 &mu;L of each digest into the gel, as well as 10 &mu;L of 1kb DNA ladder.
+
#*Be sure to record the order in which you load your samples!
+
#*To load your samples, draw the volume listed above into the tip of your P200 or P20. Lower the tip below the surface of the buffer and directly over the well. Avoid lowering the tip too far into the well itself so as to not puncture the well. Expel your sample slowly into the well. Do not release the pipet plunger until after you have removed the tip from the gel box (or you'll draw your sample back into the tip!).
+
#Once all the samples have been loaded, attach the gel box to the power supply and electrophorese the gel at 125 V for 45 minutes.
+
#Lastly, visualize the DNA fragments in the agarose gel using the gel documentation system.
+
  
 
==Reagents list==
 
==Reagents list==
*pET-28b(+)_PF3D7_1351100 (concentration = 25 ng/&mu;L) (a gift from the Niles Laboratory)
+
*small-molecule microarray slides (a gift from Koehler Laboratory)
*10X buffer; the buffer will depend on the enzymes you use for your confirmation digest (from NEB)
+
*Tris-HCl buffered saline (TBS): 50 mM Tris-Hcl (pH = 7.5), 150 mM NaCl (from BioRad)
*restriction enzyme(s); the concentration of each enzyme is listed on the product information page (from NEB)
+
*TBS containing 0.1% Tween20 (TBS-T) (from BioRad)
*1% agarose in 1X TAE (agarose from VWR)
+
*Alexa Fluor 647 anti-His antibody (from Qiagen)
**with 10% (v/v) &mu;L SYBR Safe DNA stain (from Invitrogen)
+
*1X TAE gel electrophoresis buffer: 40 mm Tris, 20 mM acetic acid, 1 mM EDTA (from BioRad)
+
*6X gel loading dye, blue (from NEB)
+
*1 kb DNA ladder (from NEB)
+
  
 
==Navigation links==
 
==Navigation links==
Next day: [[]]<br>
+
Next day: [[20.109(F21):M2D2 |Complete in silico cloning of protein expression plasmid]] <br>
 
Previous day: [[20.109(F21):M1D7 | Analyze data using statistical methods]]<br>
 
Previous day: [[20.109(F21):M1D7 | Analyze data using statistical methods]]<br>

Latest revision as of 20:00, 15 November 2021

20.109(F21): Laboratory Fundamentals of Biological Engineering
Drawing provided by Marissa A., 20.109 student in Sp21 term.  Schematic generated using BioRender.

Fall 2021 schedule        FYI        Assignments        Homework        Class data        Communication        Accessibility

       Module 1: Genomic instability                          Module 2: Drug discovery       


Introduction

A small molecule microarray (SMM) requires the covalent attachment of a library of small molecules to a glass slide. Our library is meant to broadly sample the drug-like chemical space (i.e. all possible chemical structures that have drug-like physical properties) and contains about 50,000 small molecules. Some libraries are much smaller, while many pharmaceutical companies possess high-throughput screening (HTS) collections of millions of compounds. Because this chemical space is very large, it’s difficult to generalize any single chemical reaction for this attachment that can be applied to all small molecules. We take a “one-size-fits-most” approach, where the glass slide is functionalized with a broadly reactive electrophile capable of reacting with nucleophiles present in most drug-like small molecules, such as alcohols or amines. Many small molecules contain multiple nucleophiles suitable for attachment. In this case, our manufacturing will result in a mixture of attachment sites. It’s important to remember that attachment to the glass slide constrains the possible orientation of the protein-small molecule interaction; some orientations are not possible because the glass slide and linker are in the way.

We start with a glass slide with exposed amines across the surface and attach a short PEG (polyethylene glycol) linker. To the end of this PEG linker, an isocyanate group is attached. Isocyanate, or R-N=C=O, is a resonant structure, and a partial positive charge is stabilized on the carbon atom. This carbon atom is electrophilic, and small molecules with nucleophiles will react here. We estimate that about 70% of drug-like small molecules are amenable to this reaction, and our library is filtered to contain only these molecules.

Image from Bradner, J. E. et al. PMID: 17406478

Our compound library is dissolved in DMSO and stored in 384-well plates. To dispense the compounds onto our functionalized glass slide, a robotic arm with a set of 48 metal pins is used to transfer the compounds to the glass slide. Each metal pin has a small slit in the end, and capillary action is used to precisely withdraw and dispense consistent volumes. When the pins touch the glass slide, the compound in DMSO is dispensed into a small circle of approximately 160 micron in diameter. Each pin prints one compound in two different locations on each slide, and then the pins are washed in water and DMSO. This process is repeated for each compound, resulting in our final microarray. The microarray is divided into 48 subarrays, and each subarray corresponds to one pin and contains 256 discrete spots. Within each subarray, we print a set of fluorescent compounds in the shape of an ‘X’ so that we can precisely determine where each spot is printed. After the compounds react, we quench the surface so that no electrophiles remain. This results in our final microarray; a collection of approximately 12,000 discrete spots displaying one compound each.

(Written by Rob Wilson. For more information read Bradner, J. E., McPherson, O. M., and Koehler, A. N. (2006) A method for the covalent capture and screening of diverse small molecules in a microarray format. Nature Prot. 1:2344-2352. PMID: 17406478.)

Protocols

Part 1: Review SMM experimental details

For timing reasons, you will not perform an SMM experiment this semester. Instead you will expand on the results gathered by researchers in the Niles Laboratory to test putative small molecule binders in a secondary assay. To ensure that you are familiar with the steps used in an SMM screen, review the procedure below.

Perform SMM screen

To ensure the steps included below are clear, please watch the video tutorial linked here: [SMM Screen]. The steps are detailed below so you can follow along!

  1. Obtain two printed SMM slides from the front laboratory bench.
    Only use forceps at the barcoded end of the slide.
    • Only hold them at the barcoded end as you would otherwise disrupt the ligands that are printed on the surface.
  2. Record the barcode numbers of your slides in your laboratory notebook.
  3. Carefully place each slide, barcode facing up, into a well of a 4-well chamber dish using the slide forceps.
    Add all liquid to the barcoded end of the slide.
    • Again, only hold the slide at the barcoded end.
  4. Add 3 mL of your diluted protein solution to the chambers that contain your slides.
    • Pipet the diluted protein solution onto the barcoded end of the slides.
    • Avoid generating bubbles as you add your diluted protein solution.
  5. Gently rock the 4-well chamber dish back and forth until the slides are completely covered with the diluted protein solution.
  6. Cover the 4-well chamber dish with aluminum foil and incubate on the rocking table for 1 hr.
    • During all incubations it is important to keep the slides covered as the fluorescein spots used for alignment during scanning are sensitive to light.
      Pour liquid from the corner of the 4-well chamber dish.
  7. Carefully pour the diluted protein solution from your 4-well chamber dish into a waste container by tipping the dish such that the liquid leaves the dish from the corner.
    • The surface tension should keep the slides in the 4-well chamber dish; however, it is good to be careful that they do not fall into the waste container.
  8. Add ~3 mL of TBS-T to the chambers that contain your slides.
    • Remember, pipet the liquid onto the barcoded end of the slides.
    • Don't 'push' the liquid across the surface of the slide with the pipet. Instead, gently rock the 4-well chamber dish back and forth to completely cover your slides.
  9. Incubate the slides on the rocking table for 2 min.
    • Don't forget to cover the 4-well chamber dish with aluminum foil.
  10. Pour the TBS-T from the 4-well chamber dish as above into a waste container.
  11. Complete Steps #8 - 10 a total of 3 times.
  12. Move your slides to a fresh 4-well chamber dish to eliminate carryover from the protein and/or buffer solutions.
  13. Prepare 7 mL of anti-His antibody solution in TBS-T.
    • Dilute the antibody stock 1:1000 at the front laboratory bench.
  14. Add 3 mL of the diluted antibody solution to the chambers that contain your slides.
    • Pipet the diluted antibody solution onto the barcoded end of the slides.
    • Avoid generating bubbles as you add your diluted antibody solution.
  15. Gently rock the 4-well chamber dish back and forth until the slides are completely covered with the diluted antibody solution.
  16. Cover the 4-well chamber dish with aluminum foil and incubate on the rocking table for 1 hr.
  17. Carefully pour the diluted antibody solution from your 4-well chamber dish into a waste container.
  18. Add ~3 mL of TBS (not TBS-T!) to remove excess detergent and incubate your covered 4-well chamber dish on the rocking table for 2 min, then pour liquid into a waste container.
  19. Lastly, wash the slides with dH2O.
    • Obtain 2 reservoirs and add fresh dH2O to each.
    • Label the reservoirs '1' and '2'.
  20. Wash each slide by removing it from the 4-well chamber dish with the slide forceps and dunking it 8 times into reservoir 1, then dunking it 8 times into reservoir 2.
  21. Dry your slides by carefully inserting your slides in the 'slide spinner' at the front laboratory bench.
    • Be mindful not to scrape the face of your slides on the slide holder.
    • Centrifuge your slides for 30 sec.
  22. Transfer your slides into fresh 50 mL conical tubes and wrap in aluminum foil.
  23. Store SMM slides at 4 °C until imaging.

Scan SMM slides

After the printed SMM slides are incubated with the purified PF3D7_20109-F21 protein, the next step is to check which of the small molecules, if any, you screened may be able to bind PF3D7_20109-F21. To do this the slides are imaged using a Genepix microarray scanner. The scanner measures the fluorescence signal emitted from the slide at two wavelengths: 532 nm and 635 nm. The goal for today is to familiarize you with how the SMM slides are scanned and imaged. These images are the raw data that will be used to identify putative small molecule binders.

Overview of SMM slide imaging. A. To visualize which small molecule is putative binder of PF3D7_20109-F21, the 6xHis-tag is labeled using an anti-His antibody that is conjugated to a fluorophore. B. The fluorophore used to label protein bound to the small molecule in a particular location on the slide emits a red signal (indicated by white arrow). Image in Panel A generated using BioRender.

When the SMM slides are imaged, the scanner exposes each slide to excitation light specific to the fluorophores used in the experiment. As shown in the figure above, two fluorophores were used to evaluate small molecule binding to PF2D7_20109-F21. The green spots represent locations on the SMM slide where fluorescein was printed. Fluorescein is a fluorescent dye that emits at 532 nm and is used for alignment purposes. Correct alignment is critical to knowing which small molecules are in which spots on the slide. Red spots are indicative of small molecules that are bound by PF3D7_20109-F21. The signal is due to an Alexa Fluor 647 anti-His antibody that emits at 635 nm.

During imaging, the scanner detects the intensity of the emitted fluorescence light to generate an image that can be analyzed. The intensity and position of the 'red' signal associated with putative small molecule binders is what will be assessed in the next laboratory session. This analysis will provide a list of 'hits' that can be used in future studies to identify drug candidates.

To ensure you are familiar with the steps involved in imaging the SMM experiment, please watch the video tutorial provided by a researcher from the Koehler Laboratory linked here: SMM Scanning. The steps are detailed below so you can follow along!

  1. Place the slide in the scanner barcode side down.
  2. Set the desired wavelengths that you want to scan.
    • For fluorescein: 532 nm
    • For Alexa Fluor 647: 635 nm
  3. Set the wavelength, filter, PMT, and power settings.
  4. Run a preview scan to optimize the PMT for the 635 nm (red) emission.
  5. Complete a full scan using the optimal PMT value.
  6. Following the scan, you will see an image of your slide as below.

In your laboratory notebook, complete the following:

  • What color is emitted by fluorescein? What does this indicate on the scan (what is present in spots that emit this signal)?
  • What color is emitted by Alexa Fluor 647? What does this indicate on the scan (what is present in spots that emit this signal)?
Sp17 20.109 M1D5 Genepix scan.png

Analyze SMM images

Sp20 M1D6 background, foreground.png
The microarrayer reads the fluorescence signals emitted from the surface of the SMM slide at two excitation wavelengths. As noted previously, the 532 nm wavelength was used to excite fluorescein, which was printed in an 'X' pattern to assist with alignment. The 635 nm wavelength was used to excite Alexa Fluor 647-conjugated anti-His antibody; which would be associated with PF3D7_20109-F21 bound to a small molecule on the slide. A hit denotes a spot on the slide that emits a red fluorescence signal significantly higher than the background fluorescence level. In terms of protein binding, a hit denotes that the PF3D7_20109-F21 protein is bound to a small molecule and is therefore localized to a specific position on the slide. You will analyze the fluorescence signal collected by the microarray scanner using a value termed the robust z-score.

The robust z-score differentiates signal from noise by providing a value that represents the intensity of a signal above background. In the case of the SMM experiment, the intensity of a fluorescent signal above the background fluorescence is calculated. To do this the fluorescence emitted across the entire slide is grouped to define the Median Absolute Deviation (MAD), which is is a measure of the variability of a univariate dataset. Though beyond the scope of this class, the equation for calculating the robust z-score assigns a value for how much more intense the fluroescent signal at a spot is over background. The higher the value, the more different the signal from background.

After putative binders are identified via the robust z-score, the data are examined by-eye using the criteria discussed in the prelab. Please review the prelab notes for how this process is completed.

In your laboratory notebook, complete the following:

  • Why is it important to complete a by-eye inspection of the SMM results?

Part 2: Visually evaluate chemical structures of putative binders

One method for assessing protein-small molecule binding is to visually inspect known small molecule binders for common features / structures. Your goal for today is to carefully examine the hits identified by the class and identify any common features / structures. As in the image below, it is possible that multiple features will be present within the same small molecule.

Sp17 20.109 M1D7 chemical structure features.png


With your partner, review the hits that were identified in the SMM screen completed for PF3D7_20109-F21 (see table below). To see the chemical structures, translate the SMILES strings using one of the methods described in the text below the table. It may be easier to copy / paste the small molecule images into a powerpoint file so you can readily see all of the structures. Also, it may be helpful to use a color-coding system (like the one in the image provided above) to highlight features / structures that are common to the small molecules that putatively bind PF3D7_20109-F21.

Compound ID Compound # SMILES
43847864 3 CN(CCOC)C[C@@H]1CN(C[C@@H]1CO)CC2=CC3=CC=CC=C3C=C2
30020341 2 CN(CCO)C[C@@H]1CN(C[C@@H]1CO)CC2=CC3=CC=CC=C3C=C2
10351573 4 C1CC1[C@H]2CN(C[C@@H]2N)C(=O)C3=CN=C(N=C3)NC4=CC=CC=C4
86998996 1 CCNC1=NC=C(C=N1)C2=CC(=NC(=C2C#N)N)C3=CC=CN3


These online resources may be helpful to learning more about the hits identified in the SMM:

  • Cloud version of ChemDraw here.
    • Copy and paste the small molecule smiles into the work space to get a chemical structure
  • Platform to transform the smiles information into a PubChem ID here.
    • Copy and paste the smiles into the input ID search to determine the ID number.
  • PubChem database of chemical information here.
    • Includes small molecule molecular weight and other useful information.

In your laboratory notebook, complete the following:

  • How many features did you identify that are present in two or more of the small molecules that putatively bind PF3D7_20109-F21? Are there more or less than you expected?
  • Is there a feature present in all of the identified small molecules? What might this suggest about the binding site(s) and / or binding ability of PF3D7_20109-F21?
  • Can you assign the identified small molecules to sub-groups based on the common features that are present?
  • What might the different features represent? More specifically, consider whether each subgroup has a unique binding site on the target protein or if each subgroup represents different solutions for interacting with the same binding site.
  • How might you make modifications to the small molecules / features to probe binding? As a hint, consider how different functional groups could be positioned at a given site without altering qualitative binding in the SMM assay to translate that into some testable ideas (e.g. quantitative binding properties may be occurring that are functionally relevant, but not discernible by SMM assays; or such a site is not critical for binding and may allow for modifications that confer beneficial properties of the compound).

Reagents list

  • small-molecule microarray slides (a gift from Koehler Laboratory)
  • Tris-HCl buffered saline (TBS): 50 mM Tris-Hcl (pH = 7.5), 150 mM NaCl (from BioRad)
  • TBS containing 0.1% Tween20 (TBS-T) (from BioRad)
  • Alexa Fluor 647 anti-His antibody (from Qiagen)

Navigation links

Next day: Complete in silico cloning of protein expression plasmid

Previous day: Analyze data using statistical methods