<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>bio+tech</title>
	<atom:link href="http://www.bioplustech.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.bioplustech.com</link>
	<description>monthly meetup in SF</description>
	<lastBuildDate>Sat, 19 May 2012 22:07:32 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>The Day-long Think Tank</title>
		<link>http://www.bioplustech.com/2012/05/the-day-long-think-tank/</link>
		<comments>http://www.bioplustech.com/2012/05/the-day-long-think-tank/#comments</comments>
		<pubDate>Mon, 14 May 2012 02:07:25 +0000</pubDate>
		<dc:creator>Mo</dc:creator>
				<category><![CDATA[bioinformatics]]></category>
		<category><![CDATA[hackathon]]></category>

		<guid isPermaLink="false">http://www.bioplustech.com/?p=87</guid>
		<description><![CDATA[With the summer blockbuster film season starting we are getting to see teams of superheroes come together and save the earth by putting aside their differences and working together for brief periods of time. The traditional view of the life sciences &#8230; <a href="http://www.bioplustech.com/2012/05/the-day-long-think-tank/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>With the summer blockbuster film season starting we are getting to see teams of superheroes come together and save the earth by putting aside their differences and working together for brief periods of time. The traditional view of the life sciences is one where slow, steady progress is made over the lifespan of ones career. This is how innovation and discoveries are made, or so we were taught <a href="http://www.des.emory.edu/mfp/kuhnobit.html" target="_blank">[1]</a>.</p>
<div id="attachment_91" class="wp-caption aligncenter" style="width: 470px"><a href="http://www.salon.com/2011/10/12/atom_bomb_cheeseburger/singleton/"><img class="size-full wp-image-91 " title="manhattan-project" src="http://www.bioplustech.com/wp-content/uploads/2012/05/manhattan-project-png-460x307.png" alt="" width="460" height="307" /></a><p class="wp-caption-text">Manhattan Project Scientists in 1940 Meeting at UC Berkeley</p></div>
<p>However, recent advances in other fields, have shown us that this is not the only plausible model. Hackathons, are usually a day or two day long gatherings where software engineers come together to either work on a guided problem or on their own projects. These gatherings have had very notable successes of late, spinning off many companies, new tools, and significant discoveries <a href="http://techcrunch.com/2010/08/26/inception-a-hackday-dream-the-story-of-groupme/" target="_blank">[2]</a>. It has been said, more frequently of late, that more than just in technology the information, consumer, and social web have advanced the way technical organizations operate. In many ways not only are the products of these companies evolving at a faster rate than the traditional life sciences, the organizational structures and operational protocols themselves leave much to be desired in the traditional sciences.</p>
<div id="attachment_96" class="wp-caption aligncenter" style="width: 860px"><a href="http://assemblathon.org/"><img class=" wp-image-96 " title="original" src="http://www.bioplustech.com/wp-content/uploads/2012/05/original.png" alt="" width="850" height="200" /></a><p class="wp-caption-text">The Assemblathon is a set of periodic collaborative efforts that all help improve methods of genome assembly.</p></div>
<p>Hackathons are just the tip of this iceberg, one which we discuss to a relative depth in this piece. As mentioned above, the idea of bringing together the best minds in a particular field for a day or two in order to solve guided or open-ended problems is a stark contrast to the gradual nature of scientific research as usual. Of course the most obvious counter-point to applying this form of problem solving structure to life science research is the nature of working with hardware, and materials, &#8220;wet science&#8221; as it has been dubbed. Here it is prudent to distinguish and acknowledge data generation and analysis as the bottlenecks in life science research today. Whether we are speaking of genomics, pharmacology, or the large hadron collider, researchers are being drowned by a tsunami of data <a href="http://www.scientificcomputing.com/articles-HPC-Confronting-the-Data-Tsunami-040710.aspx" target="_blank">[3]</a>. Additionally, the previously adequate research dissemination structure of scientific publishing within established journals is being exposed as a highly monopolized and inaccessible medium <a href="http://www.michaeleisen.org/blog/?p=1058" target="_blank">[4]</a>. This is but a glimpse into the problems we are faced with. However, it is exactly the information related nature of these obstacles that make them ideal for the concentrated and unpredictable undertakings of a hackathon.</p>
<div id="attachment_100" class="wp-caption aligncenter" style="width: 727px"><a href="http://www.devhouse.org/"><img class="size-full wp-image-100" title="003" src="http://www.bioplustech.com/wp-content/uploads/2012/05/003.jpg" alt="" width="717" height="361" /></a><p class="wp-caption-text">SHDH is a party for hackers and thinkers, combining serious and not-so-serious productivity</p></div>
<p>There are two approaches in guiding solutions via this method, hands on and hands off, just like most other things. The hands on approach involves setting goals, determining metrics of success, and stratifying the participant population. At the  polar end is a <em>laissez</em>-<em>faire</em> approach, where a time, place, and beer are provided, the rest is left up to probability <a href="http://petridishtalk.com/2011/04/17/decided-no-we-just-finished-saying-good-morning-sage-congress-2011/" target="_blank">[5]</a>. The key in both approaches is gathering the right people for the right problem. Though in the latter approach it is of upmost importance, in addition to assuring those involved have a thorough understanding of the nuances of the particular field of focus. In some ways it can be argued that the Manhattan Project was a progenitor of what may yet be the science hackathon, although much more dramatic in scope and time, but perhaps not in what is at stake.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bioplustech.com/2012/05/the-day-long-think-tank/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Closing The Gap Between Computational &amp; Pharmaceutical Innovation</title>
		<link>http://www.bioplustech.com/2012/04/closing-the-gap-between-computational-pharmaceutical-innovation/</link>
		<comments>http://www.bioplustech.com/2012/04/closing-the-gap-between-computational-pharmaceutical-innovation/#comments</comments>
		<pubDate>Tue, 03 Apr 2012 18:39:48 +0000</pubDate>
		<dc:creator>Mo</dc:creator>
				<category><![CDATA[bioinformatics]]></category>
		<category><![CDATA[pharmacogenomics]]></category>

		<guid isPermaLink="false">http://www.bioplustech.com/?p=74</guid>
		<description><![CDATA[When confronted with the mortality of life, it becomes painfully clear that medicine has not been able to keep up with information and computational innovations. At the heart of the problem stands  the drug development process, where an average of &#8230; <a href="http://www.bioplustech.com/2012/04/closing-the-gap-between-computational-pharmaceutical-innovation/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>When confronted with the mortality of life, it becomes painfully clear that medicine has not been able to keep up with information and computational innovations. At the heart of the problem stands  the drug development process, where an average of 5 to 10 years of research and billions of dollars worth of investment often fails to produce a product.</p>
<div id="attachment_923"><a href="http://mokaspetridish.files.wordpress.com/2012/01/screen-shot-2012-01-29-at-11-00-58-pm.png"><img class="aligncenter" title="Drug Probability of Success to Market" src="http://mokaspetridish.files.wordpress.com/2012/01/screen-shot-2012-01-29-at-11-00-58-pm.png?w=500&amp;h=257" alt="Drug Probability of Success to Market" width="500" height="257" /></a></div>
<div><em>Figure 1 | Probability of success to market from key milestones. Data: cohort of 14 companies.</em></div>
<div><em></em>In the past few years, molecules in development have seen a frightening rate of attrition. The most capital and resource intensive period comes during the clinical trials, which can be broken-down into the following stages: Phase I trials evaluate if a new drug is safe, Phase II and Phase III trials assess a drug’s efficacy, monitor side effects, and compare the drug to similar compounds already on market. Recent studies by the Centre for Medicines Research, places Phase II success rates at 18%, lower than at any other time during drug development [1]. Spending on average of $300 million to $1 Billion up until this point of research is par for the course [2].</div>
<div id="attachment_933"><a href="http://mokaspetridish.files.wordpress.com/2012/01/screen-shot-2012-01-29-at-11-38-39-pm.png"><img class="aligncenter" title="Successful Discovery Strategies" src="http://mokaspetridish.files.wordpress.com/2012/01/screen-shot-2012-01-29-at-11-38-39-pm.png?w=300&amp;h=257" alt="Successful Discovery Strategies" width="300" height="257" /></a></div>
<div><em>Figure 2 | Computer-assisted screenings and traditional discovery strategy distributions of new molecular entities (NME). Followers are in the same class as previously approved drugs.</em></div>
<div><em></em>By contrast, computational drug design strategies have made tremendous advances in the new millennia with new tools to identify targets and virtual screening assays. These include structure-based tools to lead identification and optimization utilizing X-ray crystallography. As well as, high-throughput target-based screenings of key protein families like G protein-coupled receptors. Promising indicators of computational drug designs are encouraging new companies to court Big Pharma, who to-date have relied on academia or internal projects for computation. For a company like GeneDrop, even a fraction of the development budget would be adequate to deliver favorable results.</div>
<p>Drug development’s addressable market-size for global corporations such as Novartis or Roche, which have between 20-100 molecules in the pipeline at a given time, is estimated at  $1.11 Trillion in 2011; down from $1.24 Trillion in 2001 [2]. There are approximately ten large pharmaceutical companies and many small ones with one or two late-stage molecules in development.</p>
<div id="attachment_951"><a href="http://mokaspetridish.files.wordpress.com/2012/01/wiki_clustering.png"><img class="aligncenter" title="Early-Stage Computational Drug Design" src="http://mokaspetridish.files.wordpress.com/2012/01/wiki_clustering.png?w=500&amp;h=375" alt="Early-Stage Computational Drug Design" width="500" height="375" /></a></div>
<div><em>Fig 3 | Early-stage computational drug design flow</em></div>
<div><em></em>To-date, most computation in the space has been limited to early-stage research on the discovery of molecules prior to the clinical trial phases. However, the fall in market cap has sent drug companies scrambling as patents on existing blockbuster drugs near expiration, and those in development see increasingly high failure rates. This begs the question: why are computational resources being spent in the early-stage, when most failures occur in the late-stage, during Phase II</div>
<div id="attachment_962"><a href="http://mokaspetridish.files.wordpress.com/2012/01/screen-shot-2012-01-30-at-10-50-10-am.png"><img class="aligncenter" title="Pharmacogenomics" src="http://mokaspetridish.files.wordpress.com/2012/01/screen-shot-2012-01-30-at-10-50-10-am.png?w=300&amp;h=182" alt="Pharmacogenomics" width="300" height="182" /></a></div>
<div><em>Fig 4 | Pharmacogenomics attempts to correlate how individuals will respond to drugs based genomic variability.</em></div>
<div><em></em>As always, cost has been a primary factor. Late-stage computation has meant analysis of bio-metric data, which has been limited to blood-work and questionnaires of trial subjects. The pie in the sky of course, has always been genomics, the price of which was deemed too high. Even up to a couple of years ago, it would cost over $10,000 to sequence an individual. With Phase II and III trials consisting of hundreds to thousands of patients, the method was rarely used. As of the last few months this is no longer the case, with the cost hovering around $5,000 and quickly approaching $1000 per patient.</div>
<p>So, we are faced with an enticing opportunity for information technology to rescue a high-capital, old-world industry. Threading this needle however is no easy task; entrenched industries with high quarterly revenues are notoriously conservative when adopting innovation, especially from the outside. Adding to this is the high barrier of the technical languages of the hard-sciences and the networking culture of global corporations. Luckily both are boundaries which have been broken before in other industries and we can be optimistic; if anyone can break it, it is the passionate and talented.</p>
<p style="text-align: center;"><em><strong>A cross-post by Mo from <a href="http://petridishtalk.com/2012/04/03/closing-the-gap-between-computational-pharmaceutical-innovation/" target="_blank">petridishtalk.com</a></strong></em></p>
<p><em>Citations:</em></p>
<p><em>[1] Trial watch: Phase II failures: 2008–2010 by J. Arrowsmith – Nature Reviews Drug Discovery 10, 328-329 (May 2011) | <abbr title="Digital Object Identifier">doi</abbr>:10.1038/nrd3439</em></p>
<p><em>[2] - Fig 1- A decade of change by J. Arrowsmith – Nature Reviews Drug Discovery 11, 17-18 (January 2012) | <abbr title="Digital Object Identifier">doi</abbr>:10.1038/nrd3630</em></p>
<p><em>[3] – Fig 2- How were new medicines discovered? by David C. Swinney &amp; Jason Anthony - Nature Reviews Drug Discovery 10, 507-519 (July 2011) | <abbr title="Digital Object Identifier">doi</abbr>:10.1038/nrd3480</em></p>
<p><em>[4] – Fig 4 - Genomics in drug discovery and development by Dimitri Semizarov, Eric Blomme (2008) ISBN 0470096047, 9780470096048</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.bioplustech.com/2012/04/closing-the-gap-between-computational-pharmaceutical-innovation/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>DNA &amp; Iterated Function Systems</title>
		<link>http://www.bioplustech.com/2012/02/dna-iterated-function-systems/</link>
		<comments>http://www.bioplustech.com/2012/02/dna-iterated-function-systems/#comments</comments>
		<pubDate>Mon, 27 Feb 2012 19:00:17 +0000</pubDate>
		<dc:creator>Mo</dc:creator>
				<category><![CDATA[bioinformatics]]></category>
		<category><![CDATA[genomics]]></category>
		<category><![CDATA[fractals]]></category>
		<category><![CDATA[IFS]]></category>
		<category><![CDATA[informatics]]></category>

		<guid isPermaLink="false">http://www.bioplustech.com/?p=55</guid>
		<description><![CDATA[H. Sapiens Genomic code that makes us is made up of four letters, ATGC. Billions of these letters together creates a lifeform. Iterated function systems (IFS) are anything that can be made by repeating the same simple rules over and over. The &#8230; <a href="http://www.bioplustech.com/2012/02/dna-iterated-function-systems/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<h2 style="text-align: center;"><a href="http://mokaspetridish.files.wordpress.com/2012/02/hscmm5.gif"><img title="HsCmM5" src="http://mokaspetridish.files.wordpress.com/2012/02/hscmm5.gif?w=500" alt="" /></a></h2>
<div>
<div id="attachment_1004">
<p style="text-align: center;">H. Sapiens</p>
</div>
<p style="text-align: left;">Genomic code that makes us is made up of four letters, ATGC. Billions of these letters together creates a lifeform. <a href="http://en.wikipedia.org/wiki/Iterated_function_system" target="_blank">Iterated function systems</a> (IFS) are anything that can be made by repeating the same simple rules over and over. The easiest example being tree branches, add a simple structure repeatedly ad-infinitum and before you know it we have complex and beautiful systems; the popular example being the <a href="http://en.wikipedia.org/wiki/Sierpinski_triangle" target="_blank">Sierpinski Triangle</a> or “triforce” for the Zelda fans. As the cost of DNA sequencing becomes cheaper day by day we are confronted with a tsunami of data and it has become exceedingly difficult to derive meaningful answers from all the information contained within us.</p>
<h2 style="text-align: center;"><a href="http://en.wikipedia.org/wiki/Sierpinski_triangle"><img title="sierpinski" src="http://mokaspetridish.files.wordpress.com/2012/02/sierpinski.gif?w=210&amp;h=181" alt="" width="210" height="181" /></a></h2>
<div>
<div id="attachment_1004">
<p style="text-align: center;">Triforce Power</p>
</div>
</div>
<p>Finding any advantage in ways to organize and view the data helps us discover minute differences between individuals or say a normal cell versus a cancer cell. This is where <a href="http://emboss.sourceforge.net/apps/cvs/emboss/apps/chaos.html" target="_blank">Chaos Game Representation </a>(CGR) becomes helpful, CGR is just a form of IFS that is helpful in mapping <em>seemingly </em>random information, that we suspect or know to have some sort of underlying structure.</p>
<p><a href="http://mokaspetridish.files.wordpress.com/2012/02/screen-shot-2012-02-27-at-1-11-08-am.png"><img class="aligncenter" title="Screen shot 2012-02-27 at 1.11.08 AM" src="http://mokaspetridish.files.wordpress.com/2012/02/screen-shot-2012-02-27-at-1-11-08-am.png?w=175&amp;h=162" alt="" width="175" height="162" /></a></p>
<p>In our case this would be the human genome. Although when looking at the letters coming from our DNA it seems like billions of random babbles, it is of course organized in a manner to give the blueprint for our bodies.  So let’s roll the dice-  do we get any sort of meaningful structure when applying CGR to DNA? If you are so inclined, something fun to try is the following:</p>
<pre><code>genome = Import["c:\data\sequence.fasta", "Sequence"];
genome = StringReplace[ToString[genome], {"{" -&gt; "", "}" -&gt; ""}];
chars = StringCases[genome, "G" | "C" | "T" | "A"];
f[x_, "A"] := x/2;
f[x_, "T"] := x/2 + {1/2, 0};
f[x_, "G"] := x/2 + {1/2, 1/2};
f[x_, "C"] := x/2 + {0, 1/2};
pts = FoldList[f, {0.5, 0.5}, chars];
Graphics[{PointSize[Tiny], Point[pts]}]</code></pre>
<p style="text-align: center;"><a href="http://classes.yale.edu/fractals/IntroToFrac/DrivenIFS/DNADrIFS/DNAIFS/DNAIFS.html"><img title="g1346a094" src="http://mokaspetridish.files.wordpress.com/2012/02/g1346a094.gif?w=500" alt="" /></a></p>
<p style="text-align: center;">g1346a094 on Chromosome 7</p>
<p>For example, reading the sequence in order, apply T1 whenever C is encountered, apply T2 whenever A is encountered, apply T3 whenever T is encountered, and apply T4 whenever G is encountered. Really though any transformations to C, A, T, and G can be used and multiple methods can be compared. Self-similarity is immediately noticeable in these maps, which isn’t all that surprising since <em><a href="http://en.wikipedia.org/wiki/Fractal" target="_blank">fractals</a> </em>are abundant in nature and DNA after all, is a natural syntax. Being aware that these patterns exist within our data, opens us up to some new questions to evaluate if IFS, CGR and fractals in general are helpful tools in the interpretation of genomic data.</p>
<p style="text-align: center;"><a href="http://mokaspetridish.files.wordpress.com/2012/02/163965394_cgr.png"><img title="163965394_CGR" src="http://mokaspetridish.files.wordpress.com/2012/02/163965394_cgr.png?w=150&amp;h=150" alt="" width="150" height="150" /></a></p>
<p style="text-align: center;">Signal transducer 5B (STAT5B), on chromosome 17</p>
<p>Since the mapping is 1-1 and we see patterns emerge, we are hinted that there may be biological relevance; especially because different genes yield different patterns. But what exactly are the correlations between the patterns and the biological functions? It would also be very interesting to see mappings of introns/exons colored differently or color amino acids and various codons. One thing is for sure, genomes aren’t just endless columns and rows of letters, they are pictures. It is much easier to compare pictures and discover variations, which can ultimately allow us to find meaningful interpretations from this invaluable data.</p>
<p style="text-align: center;"><em><strong>A cross-post by Mo from <a href="http://petridishtalk.com/2012/02/27/chaos-game-analysis-of-genomes/" target="_blank">petridishtalk.com </a></strong></em></p>
<p><em>Citations:</em></p>
<p><em>Jeffrey, H. J., “Chaos game visualization of sequences,” Computers &amp; Graphics 16 (1992), 25-33.</em></p>
<p><em>Ashlock, D. Golden, J.B., III. Iterated function system fractals for the detection and display of DNA reading frame (2000) ISBN: 0-7803-6375-2</em></p>
<p><em>VV Nair, K Vijayan, DP Gopinath ANN based Genome Classifier using Frequency Chaos Game Representation (2010)</em></p>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.bioplustech.com/2012/02/dna-iterated-function-systems/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title></title>
		<link>http://www.bioplustech.com/2012/02/31/</link>
		<comments>http://www.bioplustech.com/2012/02/31/#comments</comments>
		<pubDate>Wed, 22 Feb 2012 02:25:56 +0000</pubDate>
		<dc:creator>Mo</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.bioplustech.com/?p=31</guid>
		<description><![CDATA[Biotech for Hackers Many software developers have extensive experience and interest in dealing with large data sets, finding correlations  and creating meaningful solutions. However, much of our generation has had little exposure to these problems. Often resulting in the bandwagon effect. Fig &#8230; <a href="http://www.bioplustech.com/2012/02/31/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<h1>Biotech for Hackers</h1>
<div>
<p>Many software developers have extensive experience and interest in dealing with large data sets, finding correlations  and creating meaningful solutions. However, much of our generation has had little exposure to these problems. Often resulting in the bandwagon effect. Fig 1 shows something very similar to social interaction maps one comes across at places like Facebook.</p>
<div id="attachment_471"><a href="http://mokaspetridish.files.wordpress.com/2011/03/picture-1.png"><img title="Picture 1" src="http://mokaspetridish.files.wordpress.com/2011/03/picture-1.png?w=500&amp;h=473" alt="" width="500" height="473" /></a>Fig 1: Interaction map of genes implicated in Alzheimer&#8217;s. Genes were grouped by those that have similar functions (squares) and those with different functions (circles). Modules with a red border have high confidence interactions. While the weight of the connecting green lines corresponds to the number of interactions between two sets.</p>
</div>
<p>The map above is of individual gene relationships where an algorithm began with 12 <em>seed genes </em>that previous experiments have shown to play a role in Alzheimer’s disease. These seeds were compared with 185 new candidate genes from regions deemed susceptible to carrying Alzheimer’s genes. From here, both experimental and computational data was combined to generate Fig 1, which the authors dubbed AD-PIN (<em>Alzheimer’s Disease Protein Interaction Network</em>).</p>
<p>A low hurdle to entry along with the ability to iterate rapidly is key to taking on problems &amp; creating solutions. What do these solutions look like in genomics and why can hackers lead the way? Progress has often been linked to literacy, from books to programming, being able to read and write in life-code just might be the next stage.</p>
<blockquote><p>A cross-post by Mo from <a href="http://petridishtalk.com">petridishtalk.com</a></p>
<p><em>Original published study: <a href="http://mokaspetridish.files.wordpress.com/2011/03/genome-res-2011-soler-lc3b3pez-364-76.pdf">Interactome mapping suggests new mechanistic details underlying Alzheimer’s disease by Soler-Lopez et al.</a></em></p></blockquote>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.bioplustech.com/2012/02/31/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Try It, You&#8217;ll Like It</title>
		<link>http://www.bioplustech.com/2012/02/try-it-youll-like-it/</link>
		<comments>http://www.bioplustech.com/2012/02/try-it-youll-like-it/#comments</comments>
		<pubDate>Tue, 21 Feb 2012 02:31:47 +0000</pubDate>
		<dc:creator>Mo</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.bioplustech.com/?p=38</guid>
		<description><![CDATA[Normal Conference VS Developer Conference. SHDH Illustrated by Derek Yu Attending gatherings for software developers in silicon valley, their hackathons leave much to be desired at bio events like Sagecon, the least of which being the beer. Hopefully this will &#8230; <a href="http://www.bioplustech.com/2012/02/try-it-youll-like-it/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p><a href="http://superhappydevhouse.org/"><img title="2126533900_30e4e1cd4f_z" src="http://mokaspetridish.files.wordpress.com/2011/04/2126533900_30e4e1cd4f_z.jpeg?w=500&amp;h=426" alt="" width="500" height="426" /></a></p>
<p>Normal Conference VS Developer Conference. <a href="http://superhappydevhouse.org">SHDH</a> Illustrated by Derek Yu</p>
<p>Attending gatherings for software developers in silicon valley, their hackathons leave much to be desired at bio events like <a href="http://sagebase.org/">Sagecon</a>, the least of which being the beer.</p>
<p>Hopefully this will be a fun tool for folks not well acquainted with genomics/programming to sandbox and explore in.</p>
<p>The National Center for Biotechnology Information (NCBI) provides a command line based standalone Basic Local Alignment Search Tool (BLAST) package known as BLAST+ to analyze and play with genomic sequence data. Although, the legacy web based BLAST can perform a range of functions, BLAST+ as a command line tool is much better to understand and analyze large amounts of nucleotide data. It may be best to get an idea of what sort of data we’re dealing with by getting into the government’s database:</p>
<blockquote>
<pre>mokas$ ftp ftp.ncbi.nlm.nih.gov
Connected to ftp.wip.ncbi.nlm.nih.gov.
220-
 Warning Notice!

 This is a U.S. Government computer system, which may be accessed and used
 only for authorized Government business by authorized personnel.
 Unauthorized access or use of this computer system may subject violators to
 criminal, civil, and/or administrative action.

 All information on this computer system may be intercepted, recorded, read,
 copied... There is no right of privacy in this system.</pre>
</blockquote>
<p><a href="http://www.ncbi.nlm.nih.gov/"><img title="US-NLM-NCBI" src="http://mokaspetridish.files.wordpress.com/2011/07/us-nlm-ncbi-logo.png?w=109&amp;h=135" alt="" width="109" height="135" /></a>Don’t worry about the scary message, this is all public data… well <a href="http://www.ncbi.nlm.nih.gov/About/news/09may2011">until the funding stops</a>. Take a look in the blast/db directory for many pre-formatted databases NCBI has provided, i.e. genomic &amp; protein reference sequences, patent nucleotide sequence databases from USPTO &amp; EU/Japan Patent Agencies. Get yourself the latest BLAST+ from blast/executables/LATEST , I used <a href="ftp://ftp.ncbi.nlm.nih.gov/blast/executables/LATEST/ncbi-blast-2.2.25+-universal-macosx.tar.gz" target="_blank">ncbi-blast-2.2.25+-universal-macosx.tar.gz</a> .</p>
<p>Installation:</p>
<blockquote>
<pre>mokas$ tar zxvpf ncbi-blast-2.2.25+-universal-macosx.tar.gz 
mokas$ PATH=/Users/mokas/Desktop/ncbi-blast-2.2.25+/bin
mokas$ export PATH
mokas$ echo $PATH
/Users/mokas/Desktop/ncbi-blast-2.2.25+/bin
mokas$ mkdir ./blast-2.2.25+/db
mokas$ blastn -help
USAGE
  blastn [-h] [-help] [-import_search_strategy filename]
...</pre>
</blockquote>
<p>Databases should be loaded directly into /db directory created above with the mkdir command. The last thing that needs to be done is to make a “.ncbirc” text file in the main directory containing the following:</p>
<pre>[BLAST]
BLASTDB=/Users/mokas/Desktop/ncbi-blast-2.2.25+/db</pre>
<p>This will guide the program to where data is being kept. At the end of the day we should hope to get something like this:</p>
<blockquote>
<pre>mokas$ blastn -query Homo_sapiens.NCBI36.apr.rna.fa -db refseq_rna
BLASTN 2.2.25+
...
Query=  ENST00000361359 ncrna:Mt_rRNA chromosome:NCBI36:MT:650:1603:1
gene:ENSG00000198714
Length=954
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

ref|XR_109154.1|  PREDICTED: Homo sapiens hypothetical LOC1005054...   464    5e-128

&gt;ref|XR_109154.1| PREDICTED: Homo sapiens hypothetical LOC100505479 (LOC100505479),
partial miscRNA
Length=266

 Score =  464 bits (251),  Expect = 5e-128
 Identities = 255/257 (99%), Gaps = 0/257 (0%)
 Strand=Plus/Minus

Query  334  CACCTGAGTTGTAAAAAACTCCAGTTGACACAAAATAGACTACGAAAGTGGCTTTAACAT  393
            |||||||||||||||||||||||||||| |||||||| ||||||||||||||||||||||
Sbjct  257  CACCTGAGTTGTAAAAAACTCCAGTTGATACAAAATAAACTACGAAAGTGGCTTTAACAT  198</pre>
</blockquote>
<div id="attachment_666"><a href="http://mokaspetridish.files.wordpress.com/2011/07/picture-2.png"><img title="Screenshot" src="http://mokaspetridish.files.wordpress.com/2011/07/picture-2.png?w=500&amp;h=312" alt="" width="500" height="312" /></a></div>
<div>BLAST+ in action.</div>
<p>Much thanks are in order to Dr. Tao Tao of NCBI</p>
<blockquote><p>A cross-post by Mo from <a href="http://petridishtalk.com/">petridishtalk.com</a></p>
<p>Citations: <a href="http://www.ncbi.nlm.nih.gov/books/NBK52640/" target="_blank">Standalone BLAST Setup for Unix – BLAST® Help – NCBI Bookshelf</a></p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://www.bioplustech.com/2012/02/try-it-youll-like-it/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>October 2011 bio+tech</title>
		<link>http://www.bioplustech.com/2011/10/5/</link>
		<comments>http://www.bioplustech.com/2011/10/5/#comments</comments>
		<pubDate>Wed, 19 Oct 2011 22:19:50 +0000</pubDate>
		<dc:creator>Brad</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.bioplustech.com/?p=5</guid>
		<description><![CDATA[Tuesday, October 18, 2010 Time: 7:00pm Event Page: Eventbrite Location: Giordano Bros. 3108 16th St, SF, CA (note, this is a brand new Giordano Bros – NOT the one on Columbus Ave in North Beach!) Contact: windmiller[at]gmail[dot]com Please RSVP!  The Scoop: On September 13th we had &#8230; <a href="http://www.bioplustech.com/2011/10/5/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<h2></h2>
<p><strong>Tuesday, October 18, 2010</strong><br />
<strong>Time</strong>: 7:00pm<br />
<strong>Event Page: </strong><a href="http://bioplustech.eventbrite.com/">Eventbrite</a><strong><br />
</strong><strong>Location</strong>: Giordano Bros. <a href="http://maps.google.com/maps?q=3108+16th+St,+SF,+CA&amp;hl=en&amp;sll=37.0625,-95.677068&amp;sspn=84.142201,191.513672&amp;vpsrc=0&amp;z=17">3108 16th St, SF, CA</a> (note, this is a brand new Giordano Bros – NOT the one on Columbus Ave in North Beach!)<br />
<strong>Contact</strong>: windmiller[at]gmail[dot]com<br />
<a href="http://bioplustech.eventbrite.com/">Please RSVP! </a></p>
<p><strong>The Scoop:</strong> On September 13th we had a fantastic kick-off for Bio+Tech at Giordano Bros in the Mission in SF! What a great turnout, and from eveyone I interacted with, I’d have to say that I was very impressed with the diversity of ideas, skill sets and people. Once again in October we’ll be teaming with our friends from Hackers and Founders to bring even more great people in to the mix! We hope you can make it – come, grab a beer and meet some fantastic new folks or even future collaborators!</p>
<p>As a note, many of you asked if there could be a medical and/or healhtcare slant to the meet up – of course! It’s all one big ecosystem – from consumer health, to new biopharma, to delivery models, and genomics – there’s tons of room for all.</p>
<p>We’ll be hosting this at the new Giordano Bros in the Mission – they have a shop in North Beach already, but this place is brand new!  Nothing like having an entrepreneur event actually hosted in a start-up!</p>
<p>Please <a href="http://bioplustech.eventbrite.com/">RSVP here at Eventbrite</a> so we know how many people to expect!  Thanks!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.bioplustech.com/2011/10/5/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

