<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>VeRTEx</title>
	<atom:link href="http://vertexmath.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://vertexmath.wordpress.com</link>
	<description>A research group in mathematics at the University of Oklahoma</description>
	<lastBuildDate>Sun, 21 Aug 2011 05:18:34 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='vertexmath.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>VeRTEx</title>
		<link>http://vertexmath.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://vertexmath.wordpress.com/osd.xml" title="VeRTEx" />
	<atom:link rel='hub' href='http://vertexmath.wordpress.com/?pushpress=hub'/>
		<item>
		<title>Sections 8.9 &#8211; 8.11</title>
		<link>http://vertexmath.wordpress.com/2011/08/21/sections-8-9-8-11/</link>
		<comments>http://vertexmath.wordpress.com/2011/08/21/sections-8-9-8-11/#comments</comments>
		<pubDate>Sun, 21 Aug 2011 05:18:30 +0000</pubDate>
		<dc:creator>alyssaleone</dc:creator>
				<category><![CDATA[Math]]></category>

		<guid isPermaLink="false">http://vertexmath.wordpress.com/?p=721</guid>
		<description><![CDATA[Section 8.9 delves further into the complications of fragment assembly, and how those complications are currently addressed.   While there are complications from errors in reads as well as not knowing if a read comes from a target strand or its compliment, the major problem is that the current length of the reads (500 &#8211; [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=vertexmath.wordpress.com&amp;blog=7721056&amp;post=721&amp;subd=vertexmath&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Section 8.9 delves further into the complications of fragment assembly, and how those complications are currently addressed.   While there are complications from errors in reads as well as not knowing if a read comes from a target strand or its compliment, the major problem is that the current length of the reads (500 &#8211; 700 base pairs) is not long enough to deal with repeats in DNA.  They state that increasing the length of reads would solve this problem, but the sequencing technology has not yet advanced that far.</p>
<p>They illustrate this problem quite well using the example of the frog puzzle (though I&#8217;m probably biased since I worked that puzzle many, many times growing up) &#8211; since all of the frogs are repeated, you can&#8217;t know which one goes where.  Likewise, if you have several 500 bp reads that all happen to be within a 1000 bp repeat, you have no way of knowing which read goes with which copy.</p>
<p>The original method, BAC-by-BAC sequencing, simplified the computation by reducing the number of repeats, but complicated the sequencing project.  The current method, mate-pair reads sequencing, attempts to ensure that each read has some kind of unique identifying information associated with it.  To do this, they use inserts of a given length (longer than the read and most repeats), and sequence both ends of the insert.  This makes it almost certain that at least one of the ends will contain a unique, non-repeated portion of the DNA.  Armed with these fragments, they describe the method for obtaining the DNA sequence.</p>
<p>Section 8.10 on protein sequencing and identification leans more toward being a biological overview of the subject and its challenges.  It alludes to some computational problems, but does not delve into any.  Several applications of protein sequencing and identification are given, as well as an intro to mass spectrometry.</p>
<p>Section 8.11 introduces the peptide sequencing problem.  The idea behind sequencing peptides seems to be to break the peptide into two parts (fragment it) and record the masses and ion types: in the fragmenting process peptides can lose molecules such as ammonia or water, and these partial peptides with pieces missing are called ion types.  If <img src='http://s0.wp.com/latex.php?latex=%5CDelta&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;Delta' title='&#92;Delta' class='latex' /> is a set of numbers representing the masses of possible chemical groups removed during fragmentation, then <img src='http://s0.wp.com/latex.php?latex=%5CDelta&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;Delta' title='&#92;Delta' class='latex' /> is called the set of ion types.</p>
<p>To sequence the peptides, two spectra are generated: given a known peptide, the theoretical spectrum is the set of masses obtained by subtracting all ion types from the masses of all partial peptides.  Conversely, not knowing the peptide, the experimental spectrum is a set of numbers obtained by mass spectrometry.  The goal of the peptide sequencing problem, then, is to find a peptide whose theoretical spectrum is the best match to some experimental spectrum.</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/vertexmath.wordpress.com/721/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/vertexmath.wordpress.com/721/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/vertexmath.wordpress.com/721/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/vertexmath.wordpress.com/721/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/vertexmath.wordpress.com/721/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/vertexmath.wordpress.com/721/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/vertexmath.wordpress.com/721/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/vertexmath.wordpress.com/721/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/vertexmath.wordpress.com/721/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/vertexmath.wordpress.com/721/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/vertexmath.wordpress.com/721/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/vertexmath.wordpress.com/721/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/vertexmath.wordpress.com/721/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/vertexmath.wordpress.com/721/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=vertexmath.wordpress.com&amp;blog=7721056&amp;post=721&amp;subd=vertexmath&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://vertexmath.wordpress.com/2011/08/21/sections-8-9-8-11/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/be5c35fe91c3473e8e38aec35da25ed8?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">alyssaleone</media:title>
		</media:content>
	</item>
		<item>
		<title>Sections 8.6-8.8</title>
		<link>http://vertexmath.wordpress.com/2011/08/16/sections-8-6-8-8/</link>
		<comments>http://vertexmath.wordpress.com/2011/08/16/sections-8-6-8-8/#comments</comments>
		<pubDate>Tue, 16 Aug 2011 05:46:54 +0000</pubDate>
		<dc:creator>calcnerd76</dc:creator>
				<category><![CDATA[Math]]></category>

		<guid isPermaLink="false">http://vertexmath.wordpress.com/?p=717</guid>
		<description><![CDATA[In section 8.6, the book introduces the concept of sequencing by hybridization. It describes the spectrum, or l-mer composition, of a DNA sequence. For example, when s= TATGGTGC and l=3, the spectrum is {TAT, ATG, TGG, GGT, GTG, TGC}. The sequencing by hybridization problem is the task of constructing the DNA sequence that produces the [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=vertexmath.wordpress.com&amp;blog=7721056&amp;post=717&amp;subd=vertexmath&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>In section 8.6, the book introduces the concept of sequencing by hybridization. It describes the spectrum, or l-mer composition, of a DNA sequence. For example, when s= TATGGTGC and l=3, the spectrum is {TAT, ATG, TGG, GGT, GTG, TGC}. The sequencing by hybridization problem is the task of constructing the DNA sequence that produces the given spectrum. Of course, the spectrum would generally not be given in the correct order.</p>
<p>The book mentions that a linear time algorithm exists to solve this problem. It then compares SBH to using DNA arrays as mentioned in the previous section. A main difference is that in SBH, the target DNA sequence is unknown while with DNA arrays, the target is either known or &#8220;almost&#8221; known. Of course, there are problems with the real life application of the algorithm, since it may be difficult to distinguish between perfect matches and highly stable mismatches.</p>
<p>In section 8.7, the book discusses SBH as a Hamiltonian path problem. The idea is to construct a directed graph by introducing a vertex for every l-mer in the spectrum and then connecting two vertices if they overlap. Two l-mers p and q overlap if the first l &#8211; 1 letters of p coincide with the last l &#8211; 1 letters of q. However, with complicated examples, there may be more than one Hamiltonian path, each corresponding with a different possible reconstruction.</p>
<p>In section 8.8, the book discusses SBH as an Eulerian path problem. This technique is more fruitful than the Hamiltonian path problem, since it can be solved using a linear time algorithm. This time, a graph is constructed whose edges, not vertices, correspond to l-mers from the spectrum. Then a path must be found that visits every edge exactly once. The book thoroughly describes how to produce such a graph on page 273. Finally, after proving that a connected graph is Eulerian if and only if each of its vertices is balanced, it describes the algorithm that is used to find an Eulerian cycle.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/vertexmath.wordpress.com/717/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/vertexmath.wordpress.com/717/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/vertexmath.wordpress.com/717/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/vertexmath.wordpress.com/717/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/vertexmath.wordpress.com/717/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/vertexmath.wordpress.com/717/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/vertexmath.wordpress.com/717/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/vertexmath.wordpress.com/717/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/vertexmath.wordpress.com/717/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/vertexmath.wordpress.com/717/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/vertexmath.wordpress.com/717/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/vertexmath.wordpress.com/717/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/vertexmath.wordpress.com/717/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/vertexmath.wordpress.com/717/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=vertexmath.wordpress.com&amp;blog=7721056&amp;post=717&amp;subd=vertexmath&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://vertexmath.wordpress.com/2011/08/16/sections-8-6-8-8/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/505c56f4fd30fb44191427c9049645ff?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">calcnerd76</media:title>
		</media:content>
	</item>
		<item>
		<title>Sections 8.3-8.5</title>
		<link>http://vertexmath.wordpress.com/2011/08/09/sections-8-3-8-5/</link>
		<comments>http://vertexmath.wordpress.com/2011/08/09/sections-8-3-8-5/#comments</comments>
		<pubDate>Tue, 09 Aug 2011 21:04:39 +0000</pubDate>
		<dc:creator>lajo9062</dc:creator>
				<category><![CDATA[Math]]></category>

		<guid isPermaLink="false">http://vertexmath.wordpress.com/?p=709</guid>
		<description><![CDATA[Hey everyone. Sorry for the late reply, I was out of town all weekend. Sections 8.3-8.5 are prettybasic, just giving a bit of a biological background on how scientists go about determining the A, G, C, T letter sequence of some unknown DNA sample. Section 8.3 &#8211; DNA sequencing This section talks about fragment assmebly, a [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=vertexmath.wordpress.com&amp;blog=7721056&amp;post=709&amp;subd=vertexmath&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Hey everyone. Sorry for the late reply, I was out of town all weekend. Sections 8.3-8.5 are prettybasic, just giving a bit of a biological background on how scientists go about determining the A, G, C, T letter sequence of some unknown DNA sample.</p>
<p>Section 8.3 &#8211; DNA sequencing</p>
<p>This section talks about <em>fragment assmebly</em>, a method of DNA sequencing in which one can determine an entire genome by cutting it up into thousands of little DNA fragments and then reassembling them based on overlapping patterns. One extremely successful method of DNA sequencing, and probably the most renowned, was developed by Fred Sanger. By leaving out one of the four bases (A, G, C, or T) when making copies of the unknown DNA sequence, and of course running this experiment once for each base, Sanger discovered that he could in fact copy different length fragments of the DNA. He then organized these fragments by length, and read off the sequence accordingly. The way it sort of works is: each location of a particular base (which ever is left out of the experiment) is highlighted by the fragments that are created. For example, the sequence ACGTAAGCTA is cut into the fragments ACG and ACGTAAGC whenever T is left out of the experiment, highlighting that the base following G (in the first fragment) and C (in the second) is a T. If we took the same sequence, ACGTAAGCTA, and left out G, we would get back the fragments AC and ACGTAA. Once we&#8217;ve run the experiment four times, we look at all of our &#8221;ladder&#8221; of fragments and can reconstruct the original sequence based off of the overlapping patterns seen. Though the process seems simple enough, it does become a bit complicated because billions of fragments must be measured in order to read the ladder.</p>
<p>8.4 &#8211; Shortest Superstring Problem</p>
<p>The sequence that contains all of the information from the readings is know as the <em>superstring</em>. There can be more than one superstring for any given set of data, but the one most useful to us is the shortest, in which no information is duplicated. For example, given the set of strings {000, 001, 010, 011, 100, 101, 110, 111}, we can create a superstring by simply linking each fragment one after the other to create the sequence 000 001 010 011 100 101 110 111. However, by taking into account any overlapping info from our original set of eight strings, we can see that the string 0001110100 also contains all information from the orginal set and it is presented in a much more simplified manner. This shortest superstring turns out to be a reasonable first guess at the unknown genomic DNA sequence. </p>
<p>8.5 &#8211; DNA Arrays as an Alternative Sequencing Technique</p>
<p>An alternative sequencing technique, <em>Sequencing by Hybridization</em>, was developed in an attempt to overcome the time-consuming nature of the Sanger method. This method involves constructing a mini <em>DNA array</em>, or <em>chip</em>, that contains thousands of short DNA fragments, called <em>probes</em>. A flourescently labeled strand of DNA is then applied to the array. Those probes which are complimentary to a substring of the added target strand will hybridize, or weakly bond to the target strand at that location. For example, the probe ACCGTGGA would hybridize to the target CCC<strong>TGGCACCT</strong>A where it is bolded, because ACCGTGGA is complimentary to TGGCACCT. This method of sequencing didn&#8217;t really takeoff until1991 with Fodor&#8217;s <em>light-directed polymer synthesis, </em>because many were hesitant that creating a bunch of probes and assembling arrays in order to sequence the DNA might  not be any more efficient than Sanger&#8217;s method. Yet with Fodor&#8217;s technique, building an array with <img title="" src="http://s0.wp.com/latex.php?latex=4^l&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0" alt="" /> probes of length <img title="" src="http://s0.wp.com/latex.php?latex=l&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0" alt="" /> requires only <img title="" src="http://s0.wp.com/latex.php?latex=4l&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0" alt="" /> reactions, rather than the expected <img title="" src="http://s0.wp.com/latex.php?latex=4^l&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0" alt="" />.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/vertexmath.wordpress.com/709/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/vertexmath.wordpress.com/709/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/vertexmath.wordpress.com/709/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/vertexmath.wordpress.com/709/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/vertexmath.wordpress.com/709/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/vertexmath.wordpress.com/709/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/vertexmath.wordpress.com/709/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/vertexmath.wordpress.com/709/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/vertexmath.wordpress.com/709/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/vertexmath.wordpress.com/709/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/vertexmath.wordpress.com/709/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/vertexmath.wordpress.com/709/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/vertexmath.wordpress.com/709/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/vertexmath.wordpress.com/709/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=vertexmath.wordpress.com&amp;blog=7721056&amp;post=709&amp;subd=vertexmath&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://vertexmath.wordpress.com/2011/08/09/sections-8-3-8-5/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/9fa0c80f400788e480c3fa9182c02cde?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">lajo9062</media:title>
		</media:content>

		<media:content url="http://s0.wp.com/latex.php?latex=4l&#038;bg=ffffff&#038;fg=4b5d67&#038;s=0" medium="image" />

		<media:content url="http://s0.wp.com/latex.php?latex=l&#038;bg=ffffff&#038;fg=4b5d67&#038;s=0" medium="image" />

		<media:content url="http://s0.wp.com/latex.php?latex=4l&#038;bg=ffffff&#038;fg=4b5d67&#038;s=0" medium="image" />

		<media:content url="http://s0.wp.com/latex.php?latex=4l&#038;bg=ffffff&#038;fg=4b5d67&#038;s=0" medium="image" />
	</item>
		<item>
		<title>sections 8.1-8.2</title>
		<link>http://vertexmath.wordpress.com/2011/08/05/sections-8-1-8-2/</link>
		<comments>http://vertexmath.wordpress.com/2011/08/05/sections-8-1-8-2/#comments</comments>
		<pubDate>Fri, 05 Aug 2011 22:30:36 +0000</pubDate>
		<dc:creator>oreadex</dc:creator>
				<category><![CDATA[Math]]></category>

		<guid isPermaLink="false">http://vertexmath.wordpress.com/?p=705</guid>
		<description><![CDATA[Hello guys, sorry for the late reply. I have been a little bit busy. Section 8.1 is just an introduction to what graphs are, which in our case should be more of a recap. As expected, the book did explain basic terms such as vertices, edges, degree of vertex, indegree, outdegree,and connected and disconnected graphs. [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=vertexmath.wordpress.com&amp;blog=7721056&amp;post=705&amp;subd=vertexmath&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Hello guys, sorry for the late reply. I have been a little bit busy.</p>
<p>Section 8.1 is just an introduction to what graphs are, which in our case should be more of a recap. As expected, the book did explain basic terms such as vertices, edges, degree of vertex, indegree, outdegree,and connected and disconnected graphs. I do not think there is a need to define any of them here, but if anyone wants me to, just make a comment, and I will be sure to reply.  The book also included very popular problems involving graphs such as the bridge problem, the Hamiltonian cycle problem, and others similar to them. I am guessing most of us have heard and practiced these problems at least once in our lives.</p>
<p>Section 8.2 gave a nice description on how Seymour Benzer used graphs and graph theory, along with his experiment on bacteria to show that genes are linear.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/vertexmath.wordpress.com/705/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/vertexmath.wordpress.com/705/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/vertexmath.wordpress.com/705/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/vertexmath.wordpress.com/705/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/vertexmath.wordpress.com/705/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/vertexmath.wordpress.com/705/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/vertexmath.wordpress.com/705/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/vertexmath.wordpress.com/705/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/vertexmath.wordpress.com/705/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/vertexmath.wordpress.com/705/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/vertexmath.wordpress.com/705/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/vertexmath.wordpress.com/705/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/vertexmath.wordpress.com/705/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/vertexmath.wordpress.com/705/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=vertexmath.wordpress.com&amp;blog=7721056&amp;post=705&amp;subd=vertexmath&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://vertexmath.wordpress.com/2011/08/05/sections-8-1-8-2/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ed6c1b119bca55b6243abbc1b30c5055?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">oreadex</media:title>
		</media:content>
	</item>
		<item>
		<title>Sections 7.4-7.5</title>
		<link>http://vertexmath.wordpress.com/2011/08/01/sections-7-4-7-5/</link>
		<comments>http://vertexmath.wordpress.com/2011/08/01/sections-7-4-7-5/#comments</comments>
		<pubDate>Mon, 01 Aug 2011 19:53:39 +0000</pubDate>
		<dc:creator>jonkujawa</dc:creator>
				<category><![CDATA[Math]]></category>

		<guid isPermaLink="false">http://vertexmath.wordpress.com/?p=703</guid>
		<description><![CDATA[In Section 7.4 they talk about using the 4-Russians technique on the longest common subsequence problem.   The short version is that once you subdivide into t by t squares, you need to solve the problem for each vertex on the edges, not just the corners.  But even still you can improve the algorithm from to [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=vertexmath.wordpress.com&amp;blog=7721056&amp;post=703&amp;subd=vertexmath&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>In Section 7.4 they talk about using the 4-Russians technique on the longest common subsequence problem.   The short version is that once you subdivide into t by t squares, you need to solve the problem for each vertex on the edges, not just the corners.  But even still you can improve the algorithm from <img src='http://s0.wp.com/latex.php?latex=%5Cmathcal%7BO%7D%28n%5E2%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;mathcal{O}(n^2)' title='&#92;mathcal{O}(n^2)' class='latex' /> to <img src='http://s0.wp.com/latex.php?latex=%5Cmathcal%7BO%7D%28n%5E2%2Flog%28n%29%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;mathcal{O}(n^2/log(n))' title='&#92;mathcal{O}(n^2/log(n))' class='latex' /> by taking <img src='http://s0.wp.com/latex.php?latex=t+%3D+log%28n%29%2F4&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='t = log(n)/4' title='t = log(n)/4' class='latex' />.   Lest you think that is not much of an improvement, I used Wolfram Alpha to compute these two functions for n=100, and the first is 10,000 and the second is 2171 (approximately).  That is, for n=100 the first algorithm takes nearly 5 times longer than the second one!   And the difference between the two grows as n gets larger (I&#8217;ll let you sort that out).</p>
<p>&nbsp;</p>
<p>In Section 7.5 is some history, which I&#8217;ll leave to you to read.</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/vertexmath.wordpress.com/703/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/vertexmath.wordpress.com/703/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/vertexmath.wordpress.com/703/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/vertexmath.wordpress.com/703/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/vertexmath.wordpress.com/703/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/vertexmath.wordpress.com/703/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/vertexmath.wordpress.com/703/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/vertexmath.wordpress.com/703/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/vertexmath.wordpress.com/703/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/vertexmath.wordpress.com/703/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/vertexmath.wordpress.com/703/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/vertexmath.wordpress.com/703/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/vertexmath.wordpress.com/703/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/vertexmath.wordpress.com/703/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=vertexmath.wordpress.com&amp;blog=7721056&amp;post=703&amp;subd=vertexmath&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://vertexmath.wordpress.com/2011/08/01/sections-7-4-7-5/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/217a27f4f1f6787e7f1f926e786559f8?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">jonkujawa</media:title>
		</media:content>
	</item>
		<item>
		<title>Sections 7.1 &#8211; 7.3</title>
		<link>http://vertexmath.wordpress.com/2011/07/29/sections-7-1-7-3/</link>
		<comments>http://vertexmath.wordpress.com/2011/07/29/sections-7-1-7-3/#comments</comments>
		<pubDate>Fri, 29 Jul 2011 12:00:14 +0000</pubDate>
		<dc:creator>alyssaleone</dc:creator>
				<category><![CDATA[Math]]></category>

		<guid isPermaLink="false">http://vertexmath.wordpress.com/?p=645</guid>
		<description><![CDATA[Chapter 7 explores divide-and-conquer algorithms, which are exactly what they sound like: breaking a problem into smaller problems, then finding a solution to the original problem using the solutions to the smaller problems. Secion 7.1 revisits the Sorting problem from chapter 2, when we looked at an algorithm requiring quadratic time.  With the new divide-and-conquer [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=vertexmath.wordpress.com&amp;blog=7721056&amp;post=645&amp;subd=vertexmath&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Chapter 7 explores divide-and-conquer algorithms, which are exactly what they sound like: breaking a problem into smaller problems, then finding a solution to the original problem using the solutions to the smaller problems.</p>
<p>Secion 7.1 revisits the Sorting problem from chapter 2, when we looked at an algorithm requiring quadratic time.  With the new divide-and-conquer approach, we will be able to find an algorithm requiring <img src='http://s0.wp.com/latex.php?latex=O%28n%5Clog%7Bn%7D%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O(n&#92;log{n})' title='O(n&#92;log{n})' class='latex' /> time.  It looks like there are two keys to this divide-and-conquer sorting algorithm: first, if we&#8217;re given two sorted lists of integers it is relatively easy (<img src='http://s0.wp.com/latex.php?latex=O%28n%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O(n)' title='O(n)' class='latex' />) to combine the two into one sorted list; second, a single-element list is, by definition, already sorted. Therefore it makes sense to construct a recursive algorithm which breaks down a list of integers into single-element lists, then take these sorted lists and repeatedly combine them to form the desired sorted list.</p>
<p>The algorithms themselves are fairly straightforward, and I can follow the first explanation of the <img src='http://s0.wp.com/latex.php?latex=O%28n%5Clog%7Bn%7D%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O(n&#92;log{n})' title='O(n&#92;log{n})' class='latex' /> running time, but given the recurrence relation on page 230 I don&#8217;t think I could have come up with <img src='http://s0.wp.com/latex.php?latex=O%28n%5Clog%7Bn%7D%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O(n&#92;log{n})' title='O(n&#92;log{n})' class='latex' /> on my own.  The second explanation is more intuitive, but the semantics in the last paragraph of section 7.1 seemed inconsistent.  He uses the term &#8220;recursion tree&#8221; to refer to the entire graph (which is not a mathematical tree, based on my understanding) in Figure 7.1, but goes on to state that it has 3 levels, which would indicate that <em>either</em> the Divide <em>or</em> the Conquer portion is the actual recursion tree.   Also, it looks like he uses &#8220;top level&#8221; (line 4) to refer to the level that is, strictly speaking, the second level from the bottom.  For those more familiar with computer science, is this typical?</p>
<p>In 7.2, the goal is to reduce the <em>space</em> complexity of the sequence alignment algorithm.  It is important to note that the algorithm presented will only compute the score of an alignment, not the alignment itself.  The implication seems to be that if we need the actual alignment, the space complexity remains <img src='http://s0.wp.com/latex.php?latex=O%28mn%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O(mn)' title='O(mn)' class='latex' />.</p>
<p>Previously, in order to find the longest path in an edit graph, we needed the entire backtracking matrix, which again requires <img src='http://s0.wp.com/latex.php?latex=O%28mn%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O(mn)' title='O(mn)' class='latex' /> space.  In order to do this, we need to come up with a way to find the longest path in the edit graph without using a backtracking matrix.  The new technique depends on the fact that we can find the middle vertex of the longest path without actually knowing the longest path &#8211; all we need to do is find the length of the longest path from the source to each vertex in the middle column (<em>prefix</em>(<em>i</em>)<em>, </em>where <em>i </em>indexes each vertex in the middle column), and the length of the longest path from each vertex in the middle column to the sink (<em>suffix</em>(<em>i</em>)).  Then, since the length of the longest path from source to sink passing through vertex <em>i</em> in the middle column is <em>length</em>(<em>i</em>) = <em>prefix</em>(<em>i</em>) + <em>suffix</em>(<em>i</em>), maximizing <em>length</em>(<em>i</em>) over all <em>i </em>gives the length of the longest path from source to sink and, more importantly, determines the middle vertex in this path.</p>
<p>It took me a while to understand why this reduces the space requirement &#8211; i.e., why no backtracking matrix is required &#8211; but I think the key is that in finding this middle vertex, all we need are the lengths of two longest paths.  Since we don&#8217;t need the actual paths, we don&#8217;t need a backtracking matrix.</p>
<p>It&#8217;s easy to see that continuing this process &#8211; repeatedly finding middle vertices &#8211; will eventually give us each vertex in the longest path, hence the path itself.  Since this process must terminate, and each stage requires time equal to the rectangles to the left and right of the current middle column, the time required to compute the longest path is a partial sum of the geometric series with common ration 1/2 and first term equal to the area of the edit graph, <em>nm</em>, hence is less than 2<em>nm</em>.  So this algorithm requires <img src='http://s0.wp.com/latex.php?latex=O%28mn%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O(mn)' title='O(mn)' class='latex' /> time.  We&#8217;re also told that it requires <img src='http://s0.wp.com/latex.php?latex=O%28n%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O(n)' title='O(n)' class='latex' /> space, since to compute the alignment scores in a column we only use the scores from the preceding column, so we only need 2<em>n</em> values.</p>
<p>Section 7.3 revisits the Longest Common Subsequence problem and develops an algorithm with <img src='http://s0.wp.com/latex.php?latex=O%28%5Cfrac%7Bn%5E2%7D%7B%5Clog%7Bn%7D%7D%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O(&#92;frac{n^2}{&#92;log{n}})' title='O(&#92;frac{n^2}{&#92;log{n}})' class='latex' /> running time, using block alignment and the Four-Russians technique.  As a side note, I looked up an article on the <a href="http://www.cl.cam.ac.uk/~hk331/Supervision/Lent09_Cambridge/BioInfo/Gusfield97_FourRussians.pdf" target="_blank">Four-Russians technique</a> to find out why it&#8217;s called that &#8211; it turns out that the technique comes from a paper by four authors, but only one is Russian&#8230;</p>
<p>Block alignment of two DNA sequences &#8211; in our case, both are assumed to have length <em>n</em> &#8211; is simply dividing the grid formed by the sequences into subgrids of size <em>t </em>x <em>t</em>, where <em>t</em> divides <em>n</em>.  A block path is one which enters and leaves each square at the corners.  The block alignment problem is to find the longest block path through the graph.  To do this, we must find the alignment score for each of the squares, which means solving <img src='http://s0.wp.com/latex.php?latex=%5Cfrac%7Bn%5E2%7D%7Bt%5E2%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;frac{n^2}{t^2}' title='&#92;frac{n^2}{t^2}' class='latex' /> alignment problems, then piecing together the alignment scores to find the longest path.  Correct me if I&#8217;m wrong, but I think this ends up requiring <img src='http://s0.wp.com/latex.php?latex=O%28n%5E2+%2B+%5Cfrac%7Bn%5E2%7D%7Bt%5E2%7D%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O(n^2 + &#92;frac{n^2}{t^2})' title='O(n^2 + &#92;frac{n^2}{t^2})' class='latex' /> time. However, we&#8217;re looking for an algorithm with subquadratic time.</p>
<p>This is where the Four-Russians technique comes in, and in order to apply it we must have <img src='http://s0.wp.com/latex.php?latex=t+%5Capproxeq+%5Clog%7Bn%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='t &#92;approxeq &#92;log{n}' title='t &#92;approxeq &#92;log{n}' class='latex' />.  The key lies in constructing <img src='http://s0.wp.com/latex.php?latex=4%5Et+%5Ctimes+4%5Et&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='4^t &#92;times 4^t' title='4^t &#92;times 4^t' class='latex' /> minialignments and storing each alignment score in a lookup table &#8211; if we assume <img src='http://s0.wp.com/latex.php?latex=t%3D%5Cfrac%7B%5Clog%7Bn%7D%7D%7B4%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='t=&#92;frac{&#92;log{n}}{4}' title='t=&#92;frac{&#92;log{n}}{4}' class='latex' /> (which, since it&#8217;s a scalar multiple I think that for running time this counts as <img src='http://s0.wp.com/latex.php?latex=t+%5Capproxeq+%5Clog%7Bn%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='t &#92;approxeq &#92;log{n}' title='t &#92;approxeq &#92;log{n}' class='latex' />), this lookup table will have only <em>n</em> entries.  The running time calculations make use of <img src='http://s0.wp.com/latex.php?latex=t+%5Capproxeq+%5Clog%7Bn%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='t &#92;approxeq &#92;log{n}' title='t &#92;approxeq &#92;log{n}' class='latex' />:</p>
<p>Computing the score for each <em>t x t</em> minialignment takes <img src='http://s0.wp.com/latex.php?latex=O%28t%5E2%29%3DO%28%28%5Clog%7Bn%7D%29%5E2%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O(t^2)=O((&#92;log{n})^2)' title='O(t^2)=O((&#92;log{n})^2)' class='latex' /> time, so with <em>n</em> entries it takes <img src='http://s0.wp.com/latex.php?latex=O%28n%28%5Clog%7Bn%7D%29%5E2%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O(n(&#92;log{n})^2)' title='O(n(&#92;log{n})^2)' class='latex' /> running time.  Apparently, though, the overall running time is dominated by the <img src='http://s0.wp.com/latex.php?latex=%5Cfrac%7Bn%5E2%7D%7Bt%5E2%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='&#92;frac{n^2}{t^2}' title='&#92;frac{n^2}{t^2}' class='latex' /> accesses (could someone explain why there&#8217;s that number of accesses?), and each access takes <img src='http://s0.wp.com/latex.php?latex=O%28%5Clog%7Bn%7D%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O(&#92;log{n})' title='O(&#92;log{n})' class='latex' /> time, so the overall running time is <img src='http://s0.wp.com/latex.php?latex=O%28%5Clog%7Bn%7D%5Ccdot%5Cfrac%7Bn%5E2%7D%7Bt%5E2%7D%29%3DO%28%5Clog%7Bn%7D%5Ccdot%5Cfrac%7Bn%5E2%7D%7B%28%5Clog%7Bn%7D%29%5E2%7D%29%3DO%28%5Cfrac%7Bn%5E2%7D%7B%5Clog%7Bn%7D%7D%29&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='O(&#92;log{n}&#92;cdot&#92;frac{n^2}{t^2})=O(&#92;log{n}&#92;cdot&#92;frac{n^2}{(&#92;log{n})^2})=O(&#92;frac{n^2}{&#92;log{n}})' title='O(&#92;log{n}&#92;cdot&#92;frac{n^2}{t^2})=O(&#92;log{n}&#92;cdot&#92;frac{n^2}{(&#92;log{n})^2})=O(&#92;frac{n^2}{&#92;log{n}})' class='latex' />.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/vertexmath.wordpress.com/645/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/vertexmath.wordpress.com/645/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/vertexmath.wordpress.com/645/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/vertexmath.wordpress.com/645/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/vertexmath.wordpress.com/645/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/vertexmath.wordpress.com/645/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/vertexmath.wordpress.com/645/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/vertexmath.wordpress.com/645/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/vertexmath.wordpress.com/645/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/vertexmath.wordpress.com/645/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/vertexmath.wordpress.com/645/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/vertexmath.wordpress.com/645/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/vertexmath.wordpress.com/645/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/vertexmath.wordpress.com/645/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=vertexmath.wordpress.com&amp;blog=7721056&amp;post=645&amp;subd=vertexmath&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://vertexmath.wordpress.com/2011/07/29/sections-7-1-7-3/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/be5c35fe91c3473e8e38aec35da25ed8?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">alyssaleone</media:title>
		</media:content>
	</item>
		<item>
		<title>Sections 6.13-6.15</title>
		<link>http://vertexmath.wordpress.com/2011/07/27/sections-6-13-6-15/</link>
		<comments>http://vertexmath.wordpress.com/2011/07/27/sections-6-13-6-15/#comments</comments>
		<pubDate>Wed, 27 Jul 2011 17:38:20 +0000</pubDate>
		<dc:creator>calcnerd76</dc:creator>
				<category><![CDATA[Math]]></category>

		<guid isPermaLink="false">http://vertexmath.wordpress.com/?p=697</guid>
		<description><![CDATA[Sorry it took me so long to post. I recently got a new job and I have been considerably busier than I expected. Anyway, in section 6.13 the book discusses the similarity-based approach to gene prediction. In this method, previously sequenced genes are used as a template for recognizing unknown genes. According to the book, [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=vertexmath.wordpress.com&amp;blog=7721056&amp;post=697&amp;subd=vertexmath&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><span style="font-size:small;"><span style="font-family:Calibri;">Sorry it took me so long to post. I recently got a new job and I have been considerably busier than I expected. Anyway, in section 6.13 the book discusses the similarity-based approach to gene prediction. In this method, previously sequenced genes are used as a template for recognizing unknown genes. According to the book, this boils down to the problem of finding a set of candidate exons whose splicing will best fit the target. </span></span></p>
<p><span style="font-size:small;"><span style="font-family:Calibri;">They model a putative exon with a weighted interval. Its weight is supposed to reflect the actual biological likelihood that the interval is an exon. They display the problem using a graph like the one seen on page 202. Again, the dynamic programming technique is utilized, and the longest path ending at vertex <img src='http://s0.wp.com/latex.php?latex=v_%7Bi%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='v_{i}' title='v_{i}' class='latex' /> is determined. This is repeated for every vertex from 1 to 2n with <img src='http://s0.wp.com/latex.php?latex=v_%7B2n%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='v_{2n}' title='v_{2n}' class='latex' /> being the solution to the problem. </span></span></p>
<p><span style="font-size:small;"><span style="font-family:Calibri;">Of course, a common theme is that this algorithm has problems with its biological application. For instance, the endpoints of the putative exons are not well defined. Also, the optimal chain may not actually be a valid alignment.</span></span></p>
<p><span style="font-size:small;"><span style="font-family:Calibri;">In section 6.14, they discuss the spliced alignment approach. In it, a related protein within one genome is used to reconstruct the exon-intron structure of a gene in another genome. They use the example of “It was brilliant thrilling morning and the slimy, hellish, lithe doves gyrated and gambled nimbly in the waves” as the genome sequence and Lewis Carroll’s line “twas brillig, and the slithy toves did gyre and gimble in the wabe” as the target sequence. </span></span></p>
<p><span style="font-size:small;"><span style="font-family:Calibri;">They then discuss an algorithm that computes the similarity score between the i-prefix of the spliced alignment graph and the j-prefix of the target sequence. The score is computed until the end of the sequence, at which point the maximum over all possible blocks is the solution. Again this has problems, one of which is the mosaic effect. If the number of short blocks is high, then chains of these blocks can replace actual exons in spliced alignments. This occurs because it is more probable to create a long string from many short strings than from the same number of longer strings. To avoid these problems, the book suggests that candidate exons should be subjected to some filtering procedure.</span></span></p>
<p><span style="font-size:small;"><span style="font-family:Calibri;">In section 6.15, they present an interesting history on dynamic programming algorithms. </span></span></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/vertexmath.wordpress.com/697/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/vertexmath.wordpress.com/697/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/vertexmath.wordpress.com/697/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/vertexmath.wordpress.com/697/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/vertexmath.wordpress.com/697/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/vertexmath.wordpress.com/697/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/vertexmath.wordpress.com/697/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/vertexmath.wordpress.com/697/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/vertexmath.wordpress.com/697/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/vertexmath.wordpress.com/697/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/vertexmath.wordpress.com/697/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/vertexmath.wordpress.com/697/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/vertexmath.wordpress.com/697/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/vertexmath.wordpress.com/697/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=vertexmath.wordpress.com&amp;blog=7721056&amp;post=697&amp;subd=vertexmath&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://vertexmath.wordpress.com/2011/07/27/sections-6-13-6-15/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/505c56f4fd30fb44191427c9049645ff?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">calcnerd76</media:title>
		</media:content>
	</item>
		<item>
		<title>Sections 6.11-6.12</title>
		<link>http://vertexmath.wordpress.com/2011/07/21/sections-6-11-6-12/</link>
		<comments>http://vertexmath.wordpress.com/2011/07/21/sections-6-11-6-12/#comments</comments>
		<pubDate>Thu, 21 Jul 2011 00:05:12 +0000</pubDate>
		<dc:creator>lajo9062</dc:creator>
				<category><![CDATA[Math]]></category>

		<guid isPermaLink="false">http://vertexmath.wordpress.com/?p=636</guid>
		<description><![CDATA[So, it seems to me, that the basic problem at hand in these sections is trying to figure out the best way to predict a particular gene&#8217;s location within a genome. I will go ahead now and admit that, though short and not all that complex, I found the information in these sections to be presented sort [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=vertexmath.wordpress.com&amp;blog=7721056&amp;post=636&amp;subd=vertexmath&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>So, it seems to me, that the basic problem at hand in these sections is trying to figure out the best way to predict a particular gene&#8217;s location within a genome. I will go ahead now and admit that, though short and not all that complex, I found the information in these sections to be presented sort of as a jumbled mess. None the less, I&#8217;ll do my best to pick out the main ideas and piece them together as best I can. However, I would greatly appreciate any additional input or comments (especially about the splicing signals/HMM).</p>
<p>6.11 &#8211; Gene Prediction</p>
<p>During transcription, DNA is used as a  template to create a corresponding strand of mRNA. Like DNA, mRNA is simply a string of nucleotides (the only difference being that Uracil is used instead of Thymine). In this mRNA strand, each segment of 3 nucleotides is referred to as a triplet or <em>codon</em>. In 1961, Brenner and Crick found that each of these codons, or 3-nucleotide segments, codes for 1 amino acid. These amino acids then basically string together to form a protein. Yanofsky went on to prove that the gene and its protein product are colinear (meaning that the first codon codes the first amino acid, the second codon codes for the second amino acid, and so on).</p>
<p>And so it was up until Sharp and Roberts&#8217; discovery of <em>split genes </em>in 1977, that most everyone in the biology world thought that proteins were merely encoded by a long string of triplets, each one directly after the other. Sharp and Roberts were studying a particular protein, found in adenovirus, known as hexon. They discovered that this protein was built from 4 separate fragments of the adenovirus genome. These 4 continuous fragments are known as <em>exons, </em>and contain all of the gene&#8217;s to-be-expressed information. The segments between these exons are called <em>introns, </em>and contain all of the information not expressed, or the &#8220;junk&#8221;. The book compares these split genes to a magazine article that begins on the cover page and then continues on page 13 and then picks up again on, say, page 40.</p>
<p>Now, this intron-exon model does not exist in prokaryotic organisms which makes gene prediction for prokaryotes considerably easier  than predicting gene location in eukaryotes. To complicate things even further for gene prediction in eukaryotes: 1) the complexity of the organism is not a direct indication of the complexity of its genome (as the book mentioned,  the salamander has genome 10 times larger than the human, indicating that the salamander&#8217;s genome contains a great deal more of the &#8220;junk DNA&#8221;) and    2) the jumps between different parts of split genes differ from species to species (i.e. the information found on one exon in the human genome may be found split across two exons in the related gene of a mouse).</p>
<p>As for methods of going about predicting gene location, we are introduced to two main approaches. First is the <em>statistical approach</em>, which looks for features that appear frequently in genes and infrequently elsewhere. We will take a closer look at this approach shortly. Second (and perhaps the more promising of the two), we have the<em> similarity-based approach</em>, which is based on the idea that a newly sequenced gene has a decent chance of being related to an already known gene. The similarity-based approach is covered in greater depth in section 6.13.</p>
<p>Section 6.12 &#8211; Statistical Approaches to Gene Prediction</p>
<p>In this section, we look at attempting to locate a gene by detecting small statistical variations between the exons and introns in our mRNA strand. One method, and seemingly the simplest, is to look at Open Reading Frames (ORFs). An ORF is any segment of the mRNA strand that begins with the start codon, ATG, and ends with one of the three “stop” codons (TAA, TAG, or TGA). ORFs can overlap due to the three different “reading frames” (starting at positions 1, 2, and 3) and the additional three “reading frames” on the reverse strand. Typically, “junk DNA” contains a considerably greater amount of the stop codons than does the coding-region of DNA. Thus, any ORF that is reasonably long enough (meaning there were not a considerable amount of stop codons present) could indicate a potential gene. Unfortunately, searching for lengthy ORFs may cause any short genes or genes with short exons to go unnoticed.</p>
<p>Biases in codon usage is one of the statistical features used in many of the statistical gene prediction algorithms. With 64 possible triplet combinations and only 20 standard amino acids, it is obvious that more than one triplet or codon will code for a particular amino acid. As such, merely knowing the sequence of amino acids tells us very little about the sequence of nucleotides composing our mRNA. This is where using the statistical biases comes into play. The codon usage array (page 199) shows which of those triplets is most likely to occur. Note that the codon usage arrays differ for coding and noncoding-DNA, as well as from species to species. As I understand it, if we are given an ORF containing the amino acid Arg, and by consulting our usage array we see that CGC codes for Arg 38% of the time and AGG codes for Arg only 3% of the time, then it is reasonable to assume that our mRNA strand more likely contains the triplet CGC. Sliding down our genomic sequence, we can often relate peaks in likelihood to genes. Unfortunately, however, this and other methods, such as Borodovsky’s in-frame hexamer count, though beneficial for gene prediction in prokaryotes, are often greatly complicated by the eukaryote’s exon-intron model. Often exons in eukaryotes are too short to produce any reliable peaks in the likelihood plot. As a result, many researchers have turned to studying splicing signals at exon-intron junctions using Hidden Markov Models, such as GENSCAN developed in 1997 by Burge and Karlin. This too, though, is not always completely accurate.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/vertexmath.wordpress.com/636/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/vertexmath.wordpress.com/636/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/vertexmath.wordpress.com/636/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/vertexmath.wordpress.com/636/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/vertexmath.wordpress.com/636/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/vertexmath.wordpress.com/636/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/vertexmath.wordpress.com/636/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/vertexmath.wordpress.com/636/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/vertexmath.wordpress.com/636/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/vertexmath.wordpress.com/636/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/vertexmath.wordpress.com/636/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/vertexmath.wordpress.com/636/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/vertexmath.wordpress.com/636/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/vertexmath.wordpress.com/636/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=vertexmath.wordpress.com&amp;blog=7721056&amp;post=636&amp;subd=vertexmath&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://vertexmath.wordpress.com/2011/07/21/sections-6-11-6-12/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/9fa0c80f400788e480c3fa9182c02cde?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">lajo9062</media:title>
		</media:content>
	</item>
		<item>
		<title>sections 6.9 &#8211; 6.10</title>
		<link>http://vertexmath.wordpress.com/2011/07/17/sections-6-9-6-10/</link>
		<comments>http://vertexmath.wordpress.com/2011/07/17/sections-6-9-6-10/#comments</comments>
		<pubDate>Sun, 17 Jul 2011 22:45:50 +0000</pubDate>
		<dc:creator>oreadex</dc:creator>
				<category><![CDATA[Math]]></category>

		<guid isPermaLink="false">http://vertexmath.wordpress.com/?p=613</guid>
		<description><![CDATA[Firstly, I would like to make  apologies before hand for any delays, and how short this summary is going to be. I have a really really bad cold, and I do not seem to be showing any sign of relief yet. If anyone needs further explanation on anything about these sections, feel free to add a comment. Anyway, back [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=vertexmath.wordpress.com&amp;blog=7721056&amp;post=613&amp;subd=vertexmath&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Firstly, I would like to make  apologies before hand for any delays, and how short this summary is going to be. I have a really really bad cold, and I do not seem to be showing any sign of relief yet. If anyone needs further explanation on anything about these sections, feel free to add a comment. Anyway, back to business.</p>
<p>Section 6.9 touches on affine gap penalties as an example of a nicer approach to penalizing gaps in alignments. According to the book, it is a linearly weighted score for large gaps. It is easy and common sense to make adjustments on edit graphs (discussed in previous sections) for alignments with affine gap penalties by adding vertical and horizontal edges the length of the gap score from any vertex to another south or east of it. Also, increasing the number of edges in the edit graph does increase the running time for the alignment algorithm but not by a whole lot. Lastly, the book does explain how to reduce alignments with affine gap penalties to the Manhattan Tourist problem.</p>
<p>Section 6.10 introduces the idea of multiple alignment as opposed to pairwise alignment in cases where pairwise alignment is not always the best way to find biologically related sequences, especially when sequences have weak similarities between them.  To be honest, it was not necessary to include that much to define multiple alignment, I would have understood it as well if they said it was similar to pairwise alignment, but with more  than two sequences, and no column should contain only spaces. The rest of this section shows different methods to finding out the string with the most common characters in each column of the alignment, their flaws, and any improvements.  A programming algorithm involving <em>k-</em>dimensional edit graph would solve it, but the running time increases by a lot as<em> k</em> gets bigger. Computing optimal pairwise alignments between every pair of strings and then combine them together into a multiple alignment would not really work because some pairwise alignments are incompatible.</p>
<p>In fact, all the approaches discussed in this section have one issue or the other, but at least they work. This just shows that it is difficult to attain perfection when it comes to algorithms. However, there is always room for improvement, and as time goes on, someone always comes up with a new idea that makes everything better.</p>
<h4></h4>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/vertexmath.wordpress.com/613/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/vertexmath.wordpress.com/613/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/vertexmath.wordpress.com/613/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/vertexmath.wordpress.com/613/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/vertexmath.wordpress.com/613/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/vertexmath.wordpress.com/613/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/vertexmath.wordpress.com/613/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/vertexmath.wordpress.com/613/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/vertexmath.wordpress.com/613/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/vertexmath.wordpress.com/613/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/vertexmath.wordpress.com/613/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/vertexmath.wordpress.com/613/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/vertexmath.wordpress.com/613/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/vertexmath.wordpress.com/613/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=vertexmath.wordpress.com&amp;blog=7721056&amp;post=613&amp;subd=vertexmath&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://vertexmath.wordpress.com/2011/07/17/sections-6-9-6-10/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/ed6c1b119bca55b6243abbc1b30c5055?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">oreadex</media:title>
		</media:content>
	</item>
		<item>
		<title>Sections 6.6-6.8</title>
		<link>http://vertexmath.wordpress.com/2011/07/15/sections-6-6-6-8/</link>
		<comments>http://vertexmath.wordpress.com/2011/07/15/sections-6-6-6-8/#comments</comments>
		<pubDate>Fri, 15 Jul 2011 06:20:40 +0000</pubDate>
		<dc:creator>lmaingi</dc:creator>
				<category><![CDATA[Math]]></category>

		<guid isPermaLink="false">http://vertexmath.wordpress.com/?p=608</guid>
		<description><![CDATA[Sorry for the delay. I&#8217;ve been busier than expected these past few days, and I found I had way more to say about this than expected. The overarching goal of these 3 sections is to find a way to measure how &#8216;similar&#8217; two strings of either nucleotides or of amino acids is, continuing the ideas [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=vertexmath.wordpress.com&amp;blog=7721056&amp;post=608&amp;subd=vertexmath&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Sorry for the delay. I&#8217;ve been busier than expected these past few days, and I found I had way more to say about this than expected.</p>
<p>The overarching goal of these 3 sections is to find a way to measure how &#8216;similar&#8217; two strings of either nucleotides or of amino acids is, continuing the ideas from the previous sections. The real mathematical difficulty here is to capture the notion of similarity with a mathematical definition which is both robust and efficiently computable.</p>
<p>In sections 6.4 and 6.5 we considered adding &#8216;gaps&#8217;  to the Hamming distance to get a better notion of the distance between two strings. In 6.6, we consider a generalization of this method. Let δ be a (k+1) x (k+1) matrix, called the scoring matrix, where k is the number of letters in the alphabet we are using (whether they be nucleotides or amino acids). We&#8217;ll abuse notation a bit and write δ(x,y) for the entries of δ, where x and y are either letters or the gap character &#8216;-&#8217;.<strong></strong><strong></strong></p>
<p>The idea is that we&#8217;ll parse through the strings as we did before, either diagonally or treating the character as a gap, and at each step we add δ(x,y) to a running total, where x and y are characters extracted from the respective strings, or the gap. For a particular way to parse the strings, we get a number, which I&#8217;ll call the similarity of the two strings, and then we minimize that number over all possible paths. Algorithmically, there&#8217;s a nice way to do this, which I won&#8217;t discuss since it&#8217;s rather prominently placed in the chapter.</p>
<p>While this certainly is better than any of the previous methods, and is probably in a form that is usable as is, it isn&#8217;t terribly satisfying to me. At the very least, using it to predict the probability of a particular mutation occurring over a long period of time is certainly not very accurate, so there&#8217;s a question as to whether this represents the actual &#8220;biological similarity&#8221; of the sequences (if such a thing actually exists). To explain better, I&#8217;d need to draw an analogy to statistical mechanics, which I expect won&#8217;t be familiar to most people here. If there&#8217;s any interest in this analogy (which I think is pretty interesting, but extremely tangential) then I can post it in the comments. In the analogy, the similarity we have come up with corresponds to the limit that the &#8216;temperature&#8217; (analogous to mutation rate) goes to 0. On the other hand, my idea is really of only theoretical interest, since it doesn&#8217;t lend itself to an algorithmic solution.</p>
<p>A special case of this solution is the  case where δ has diagonal elements 1 in all but the &#8216;-&#8217; row and column, -μ off-diagonal elements in these rows and columns, and -σ in the last row and column, for real μ, σ (typically nonnegative). This doesn&#8217;t seem to actually speed up the algorithm significantly, but it certainly reduces the amount of memory required. It&#8217;s useful for DNA sequence comparisons, but less so for protein sequences because these can often change quite a lot and still be qualitatively very similar.</p>
<p>An observation I made when reading is that, if the diagonal elements (other δ(-,-), which is unimportant) are all equal and positive, and if δ is symmetric, and all the off-diagonal elements are less than the diagonal elements, then given the set of strings of fixed length n, we can do an affine transformation on the similarity function to make it into a metric space. If we allow the affine function do also be a function of the lengths of the two strings (affine in each of these parameters as well) then we can make the set of all strings into a metric space. The assumptions all seem somewhat reasonable, but I&#8217;m no expert in biology. I don&#8217;t know if this is useful, but metric spaces are nice enough things that it&#8217;s plausible that it has some application.</p>
<p>Section 6.7 deals with the problem of comparing protein sequences. They first explain an algorithm that one might think up to compute good values of δ. The big problem with this &#8220;algorithm&#8221; is that it needs δ to compute δ. So if we don&#8217;t already know what δ should be, the algorithm won&#8217;t actually give us δ. We might expect that this means it&#8217;s useless, but in fact in many cases we can start with a guess, and if it&#8217;s good enough the algorithm will make it better. We can think of the algorithm as a continuous function <img src='http://s0.wp.com/latex.php?latex=f%3A+M_%7Bk%2B1%7D+%5Crightarrow+M_%7Bk%2B1%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='f: M_{k+1} &#92;rightarrow M_{k+1}' title='f: M_{k+1} &#92;rightarrow M_{k+1}' class='latex' />, and study the dynamical system generated by repeated application of <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='f' title='f' class='latex' />. Our desired δ is a fixed point of <img src='http://s0.wp.com/latex.php?latex=f&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='f' title='f' class='latex' />. If it is attracting on some &#8220;large&#8221; subset of <img src='http://s0.wp.com/latex.php?latex=M_%7Bk%2B1%7D&amp;bg=ffffff&amp;fg=4b5d67&amp;s=0' alt='M_{k+1}' title='M_{k+1}' class='latex' /> then the algorithm applied iteratively will converge to δ if we start with a member of this &#8220;large&#8221; subset. There are several interesting analytical issues here (most notably proving that the fixed point is attracting), but this is digressing a lot. I did try to prove this for about 5 minutes, but wasn&#8217;t successful.</p>
<p>Section 6.7 also gives an alternate way to get a good δ, which is essentially based on statistical analysis of protein sequences. It should be fairly straightforward to understand, and I don&#8217;t have much to say about it.</p>
<p>Section 6.8 discusses the problem of finding substrings of two given strings which are very similar. In many cases, we can&#8217;t expect the strings to be similar overall, but certain parts should be very similar. This is called the local alignment problem (as opposed to the global problem). The problem has the same complexity as the global problem. I honestly didn&#8217;t find that too surprising, but apparently the authors did. It only requires a small modification of the method from section 6.6, so I&#8217;ll leave out the in-depth description since everyone will read it in the book.</p>
<p>In all honesty, this solution isn&#8217;t terribly mathematically elegant to me, because I don&#8217;t see any reason to expect that the similarities of pairs of strings of different lengths are comparable the way we&#8217;ve defined it. If anyone sees a good argument as to why they should be, that&#8217;d be interesting.</p>
<p>These algorithms are probably robust enough to work for biological data, but I expect more specialized algorithms will outperform them most of the time.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/vertexmath.wordpress.com/608/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/vertexmath.wordpress.com/608/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/vertexmath.wordpress.com/608/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/vertexmath.wordpress.com/608/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/vertexmath.wordpress.com/608/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/vertexmath.wordpress.com/608/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/vertexmath.wordpress.com/608/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/vertexmath.wordpress.com/608/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/vertexmath.wordpress.com/608/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/vertexmath.wordpress.com/608/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/vertexmath.wordpress.com/608/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/vertexmath.wordpress.com/608/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/vertexmath.wordpress.com/608/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/vertexmath.wordpress.com/608/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=vertexmath.wordpress.com&amp;blog=7721056&amp;post=608&amp;subd=vertexmath&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://vertexmath.wordpress.com/2011/07/15/sections-6-6-6-8/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/15ba29b95750c2801daa19f4879376e9?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">lmaingi</media:title>
		</media:content>
	</item>
	</channel>
</rss>
