<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: The Perfect DB Storage Array</title>
	<atom:link href="http://blogs.smugmug.com/don/2007/04/27/the-perfect-db-storage-array/feed/" rel="self" type="application/rss+xml" />
	<link>http://blogs.smugmug.com/don/2007/04/27/the-perfect-db-storage-array/</link>
	<description>Thought stream from SmugMug's CEO &#38; Chief Geek</description>
	<pubDate>Sun, 20 Jul 2008 11:31:06 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.6-bleeding2</generator>
		<item>
		<title>By: Glen Shok</title>
		<link>http://blogs.smugmug.com/don/2007/04/27/the-perfect-db-storage-array/#comment-95699</link>
		<dc:creator>Glen Shok</dc:creator>
		<pubDate>Wed, 26 Dec 2007 17:30:05 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.smugmug.com/don/2007/04/27/the-perfect-db-storage-array/#comment-95699</guid>
		<description>Hey, although I am a marketing puke (with a highly technical background) I came by this blog post and found it interesting. We have an array that has been used by very large MySQL database shops and we have implemented features specifically for DB (1MB stripes) among others. I know this is an obvious plug, but you may want to take a look at it. http://www.pillardata.com

If you want to yell at me for posting this: gshok@pillardata.com.  :-)

And to add to Tao's post - I worked with a defense contractor to implement something similar, using cheap 1U servers and a GFS like Lustre...this solution also works well and makes the storage and servers disposable when implemented correctly. Although I work for an array vendor, this is a very elegant solution that is resilient, but needs a lot of care and feeding when using open systems stuff like Lustre. There are some "private" GFS vendors out there that have better uptime.

Good luck. And Happy New Year!</description>
		<content:encoded><![CDATA[<p>Hey, although I am a marketing puke (with a highly technical background) I came by this blog post and found it interesting. We have an array that has been used by very large MySQL database shops and we have implemented features specifically for DB (1MB stripes) among others. I know this is an obvious plug, but you may want to take a look at it. <a href="http://www.pillardata.com" rel="nofollow">http://www.pillardata.com</a></p>
<p>If you want to yell at me for posting this: <a href="mailto:gshok@pillardata.com">gshok@pillardata.com</a>.  <img src='http://blogs.smugmug.com/don/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
<p>And to add to Tao&#8217;s post - I worked with a defense contractor to implement something similar, using cheap 1U servers and a GFS like Lustre&#8230;this solution also works well and makes the storage and servers disposable when implemented correctly. Although I work for an array vendor, this is a very elegant solution that is resilient, but needs a lot of care and feeding when using open systems stuff like Lustre. There are some &#8220;private&#8221; GFS vendors out there that have better uptime.</p>
<p>Good luck. And Happy New Year!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Looking for the perfect DB storage array for your hot Web 2.0 company? &#171; Hardware</title>
		<link>http://blogs.smugmug.com/don/2007/04/27/the-perfect-db-storage-array/#comment-72382</link>
		<dc:creator>Looking for the perfect DB storage array for your hot Web 2.0 company? &#171; Hardware</dc:creator>
		<pubDate>Fri, 19 Oct 2007 17:16:58 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.smugmug.com/don/2007/04/27/the-perfect-db-storage-array/#comment-72382</guid>
		<description>[...] read more &#124; digg story [...]</description>
		<content:encoded><![CDATA[<p>[...] read more | digg story [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: SmugBlog: Don MacAskill &#187; Blog Archive &#187; Dell MD3000 - Great DAS DB Storage</title>
		<link>http://blogs.smugmug.com/don/2007/04/27/the-perfect-db-storage-array/#comment-70270</link>
		<dc:creator>SmugBlog: Don MacAskill &#187; Blog Archive &#187; Dell MD3000 - Great DAS DB Storage</dc:creator>
		<pubDate>Tue, 02 Oct 2007 00:02:34 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.smugmug.com/don/2007/04/27/the-perfect-db-storage-array/#comment-70270</guid>
		<description>[...] So I&#8217;ve written about storage before, specifically our quest for The Perfect DB Storage Array and how Sun&#8217;s storage didn&#8217;t stack up with their excellent servers. As you can probably tell, I spend a lot of my time thinking about and investigating storage - both small-and-fast for our DBs and huge-and-slower (like S3) for our photos. [...]</description>
		<content:encoded><![CDATA[<p>[...] So I&#8217;ve written about storage before, specifically our quest for The Perfect DB Storage Array and how Sun&#8217;s storage didn&#8217;t stack up with their excellent servers. As you can probably tell, I spend a lot of my time thinking about and investigating storage - both small-and-fast for our DBs and huge-and-slower (like S3) for our photos. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tao Shen</title>
		<link>http://blogs.smugmug.com/don/2007/04/27/the-perfect-db-storage-array/#comment-52472</link>
		<dc:creator>Tao Shen</dc:creator>
		<pubDate>Tue, 19 Jun 2007 02:17:50 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.smugmug.com/don/2007/04/27/the-perfect-db-storage-array/#comment-52472</guid>
		<description>Don:

one last thing: you should look into  Iwill H4103 barebone...plug 4 dual core Opterons in there gives you 8 cores, 16 slots mem, and use the PCI-X expansion for your fibre channel connection to DAS.  I think it's cheaper than buying Sun's boxes, and smaller 1U profile.</description>
		<content:encoded><![CDATA[<p>Don:</p>
<p>one last thing: you should look into  Iwill H4103 barebone&#8230;plug 4 dual core Opterons in there gives you 8 cores, 16 slots mem, and use the PCI-X expansion for your fibre channel connection to DAS.  I think it&#8217;s cheaper than buying Sun&#8217;s boxes, and smaller 1U profile.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tao Shen</title>
		<link>http://blogs.smugmug.com/don/2007/04/27/the-perfect-db-storage-array/#comment-52385</link>
		<dc:creator>Tao Shen</dc:creator>
		<pubDate>Mon, 18 Jun 2007 09:30:35 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.smugmug.com/don/2007/04/27/the-perfect-db-storage-array/#comment-52385</guid>
		<description>oh just to add to my post above, no RAID DAS boxes needed in that setup, just cheap drives.  You are basically running an architecture similar to google's fs and bigtable over 1000s of cheap 1U nodes.  In terms of cost and scalability, which way is better.

Since google bought youtube, I was wondering if google will convert youtube's db architecture to use GoogleFS and Bigtable instead of master-N slave mysql clusters.</description>
		<content:encoded><![CDATA[<p>oh just to add to my post above, no RAID DAS boxes needed in that setup, just cheap drives.  You are basically running an architecture similar to google&#8217;s fs and bigtable over 1000s of cheap 1U nodes.  In terms of cost and scalability, which way is better.</p>
<p>Since google bought youtube, I was wondering if google will convert youtube&#8217;s db architecture to use GoogleFS and Bigtable instead of master-N slave mysql clusters.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tao Shen</title>
		<link>http://blogs.smugmug.com/don/2007/04/27/the-perfect-db-storage-array/#comment-52384</link>
		<dc:creator>Tao Shen</dc:creator>
		<pubDate>Mon, 18 Jun 2007 09:25:51 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.smugmug.com/don/2007/04/27/the-perfect-db-storage-array/#comment-52384</guid>
		<description>Hi,Don:

Nice blog, and nice to see someone with experience posting their quest for nice database IO storage subsystems.

I have been thinking about this same problem for a while now, what do you think about the following architecture?

Using a OpenSSI cluster for your compute nodes and run mysqld on the SSI over say 24 nodes of cheap commodity 1Udual core CPUs(don't know if MySQL or PostgreSQL would scale horizontally that way) 

and then have another cluster of cheap 1U dual core system with 4 commodity hotswappable hard drives(SATA even) and run either lustre fs, or hadoop file system.  You can choose to use Infiniband switch if you want, but dual GigE over say 24 nodes will give you 250MB/s*24=6GB/s theoretical IO bandwidth.  In the case of TSUBAME(japan's supercomputer) they use lustrefs, and hadoop file system is simply a open source google file system.  I don't know how much IO smugmug currently require, but linearly scaling IO bandwidth over GigE sounds a good solution

oh, only if I had 1000 nodes to test this idea LOL</description>
		<content:encoded><![CDATA[<p>Hi,Don:</p>
<p>Nice blog, and nice to see someone with experience posting their quest for nice database IO storage subsystems.</p>
<p>I have been thinking about this same problem for a while now, what do you think about the following architecture?</p>
<p>Using a OpenSSI cluster for your compute nodes and run mysqld on the SSI over say 24 nodes of cheap commodity 1Udual core CPUs(don&#8217;t know if MySQL or PostgreSQL would scale horizontally that way) </p>
<p>and then have another cluster of cheap 1U dual core system with 4 commodity hotswappable hard drives(SATA even) and run either lustre fs, or hadoop file system.  You can choose to use Infiniband switch if you want, but dual GigE over say 24 nodes will give you 250MB/s*24=6GB/s theoretical IO bandwidth.  In the case of TSUBAME(japan&#8217;s supercomputer) they use lustrefs, and hadoop file system is simply a open source google file system.  I don&#8217;t know how much IO smugmug currently require, but linearly scaling IO bandwidth over GigE sounds a good solution</p>
<p>oh, only if I had 1000 nodes to test this idea LOL</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: mfc</title>
		<link>http://blogs.smugmug.com/don/2007/04/27/the-perfect-db-storage-array/#comment-51798</link>
		<dc:creator>mfc</dc:creator>
		<pubDate>Wed, 13 Jun 2007 23:25:50 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.smugmug.com/don/2007/04/27/the-perfect-db-storage-array/#comment-51798</guid>
		<description>Our write to read ratio is 2:1.
We are also considering flash drives for redo logs.
Sandisk (M-Systems) FFD35 U3S 8 P80 which is
around 2k a pop!</description>
		<content:encoded><![CDATA[<p>Our write to read ratio is 2:1.<br />
We are also considering flash drives for redo logs.<br />
Sandisk (M-Systems) FFD35 U3S 8 P80 which is<br />
around 2k a pop!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Timmy J</title>
		<link>http://blogs.smugmug.com/don/2007/04/27/the-perfect-db-storage-array/#comment-51772</link>
		<dc:creator>Timmy J</dc:creator>
		<pubDate>Wed, 13 Jun 2007 17:40:44 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.smugmug.com/don/2007/04/27/the-perfect-db-storage-array/#comment-51772</guid>
		<description>I've read through this 3 times; great stuff.  I'm curious:  What is your approximate ratio of writes to reads?

(I have a situation where we have tons of writes compared to much fewer reads.  But it's critical the reads have the lowest possible latency.  We're considering flash drives.  Any thoughts?)</description>
		<content:encoded><![CDATA[<p>I&#8217;ve read through this 3 times; great stuff.  I&#8217;m curious:  What is your approximate ratio of writes to reads?</p>
<p>(I have a situation where we have tons of writes compared to much fewer reads.  But it&#8217;s critical the reads have the lowest possible latency.  We&#8217;re considering flash drives.  Any thoughts?)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jimmy B</title>
		<link>http://blogs.smugmug.com/don/2007/04/27/the-perfect-db-storage-array/#comment-50927</link>
		<dc:creator>Jimmy B</dc:creator>
		<pubDate>Wed, 06 Jun 2007 01:03:56 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.smugmug.com/don/2007/04/27/the-perfect-db-storage-array/#comment-50927</guid>
		<description>Hi Don, good stuff.  It seems the question of disk size comes up quite a bit.  With disk sizes getting upwards of 300 GB now, what are your thoughts on the performance on these larger drives....Vendors will tell you that they are just as fast but I don't entirely trust the larger spindles...

What size and speed of drives do you guys lean towards?</description>
		<content:encoded><![CDATA[<p>Hi Don, good stuff.  It seems the question of disk size comes up quite a bit.  With disk sizes getting upwards of 300 GB now, what are your thoughts on the performance on these larger drives&#8230;.Vendors will tell you that they are just as fast but I don&#8217;t entirely trust the larger spindles&#8230;</p>
<p>What size and speed of drives do you guys lean towards?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Hostgator &#187; Blog Archive &#187; Looking for the perfect DB storage array for your hot Web 2.0 company?</title>
		<link>http://blogs.smugmug.com/don/2007/04/27/the-perfect-db-storage-array/#comment-50184</link>
		<dc:creator>Hostgator &#187; Blog Archive &#187; Looking for the perfect DB storage array for your hot Web 2.0 company?</dc:creator>
		<pubDate>Mon, 28 May 2007 09:43:05 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.smugmug.com/don/2007/04/27/the-perfect-db-storage-array/#comment-50184</guid>
		<description>[...] We do billions of queries per day in our DBs, so we have very specific requirements for what our storage does. It turns out YouTube has very similar requirements, so I wrote up a nice, detailed list of things to watch for if you&#8217;re a startup looking to scale your web application.read more &#124; digg story [...]</description>
		<content:encoded><![CDATA[<p>[...] We do billions of queries per day in our DBs, so we have very specific requirements for what our storage does. It turns out YouTube has very similar requirements, so I wrote up a nice, detailed list of things to watch for if you&#8217;re a startup looking to scale your web application.read more | digg story [...]</p>
]]></content:encoded>
	</item>
</channel>
</rss>
