<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>RightScale Blog &#187; AWS</title>
	<atom:link href="http://blog.rightscale.com/tag/aws/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.rightscale.com</link>
	<description>Cloud Management News &#38; Conversations</description>
	<lastBuildDate>Fri, 03 Feb 2012 19:06:00 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='blog.rightscale.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>RightScale Blog &#187; AWS</title>
		<link>http://blog.rightscale.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://blog.rightscale.com/osd.xml" title="RightScale Blog" />
	<atom:link rel='hub' href='http://blog.rightscale.com/?pushpress=hub'/>
		<item>
		<title>Ending the Year with a bang!  5 new clouds managed by RightScale</title>
		<link>http://blog.rightscale.com/2011/12/19/ending-the-year-with-a-bang-5-new-clouds-managed-by-rightscale/</link>
		<comments>http://blog.rightscale.com/2011/12/19/ending-the-year-with-a-bang-5-new-clouds-managed-by-rightscale/#comments</comments>
		<pubDate>Mon, 19 Dec 2011 17:42:28 +0000</pubDate>
		<dc:creator>Shivan Bindal</dc:creator>
				<category><![CDATA[AWS]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[EC2]]></category>
		<category><![CDATA[Releases]]></category>
		<category><![CDATA[Datapipe]]></category>
		<category><![CDATA[MultiCloud]]></category>
		<category><![CDATA[Rackspace]]></category>
		<category><![CDATA[Release]]></category>
		<category><![CDATA[Softlayer]]></category>

		<guid isPermaLink="false">http://blog.rightscale.com/?p=1092</guid>
		<description><![CDATA[What a year it&#8217;s been!  We&#8217;ve released a lot of really cool features, including a MultiCloud API and many MultiCloud ServerTemplates.  To round out the year, last week, we launched 5 new public clouds that are available on the RightScale &#8230; <a href="http://blog.rightscale.com/2011/12/19/ending-the-year-with-a-bang-5-new-clouds-managed-by-rightscale/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=1092&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>What a year it&#8217;s been!  We&#8217;ve released a lot of really cool <a href="http://support.rightscale.com/18-Release_Notes/01-RightScale_Dashboard">features</a>, including a <a href="http://support.rightscale.com/18-Release_Notes/01-RightScale_Dashboard/2011-08-25#MultiCloud_API">MultiCloud API</a> and many <a href="http://support.rightscale.com/18-Release_Notes/ServerTemplates_and_RightImages/Current">MultiCloud ServerTemplates</a>.  To round out the year, last week, we launched 5 new public clouds that are <a href="http://support.rightscale.com/18-Release_Notes/01-RightScale_Dashboard/Current">available</a> on the RightScale MultiCloud Management Platform: AWS South America in São Paulo, Datapipe, Logicworks, SoftLayer and Rackspace UK.  These new clouds offer choice for our users when they ask where workloads should be launched on the cloud.  With these latest additions, we span a total of 8 geographic areas with additional presence in Amsterdam, Dallas, Hong Kong, London, New York, Sao Paolo, Seattle, Singapore and Washington DC.</p>
<p><a href="http://rightscale.files.wordpress.com/2011/12/screen-shot-2011-12-16-at-1-10-19-pm.png"><img class="aligncenter size-full wp-image-1110" title="Public Clouds" src="http://rightscale.files.wordpress.com/2011/12/screen-shot-2011-12-16-at-1-10-19-pm.png?w=600" alt=""   /></a>These clouds have been in the works for a little while, and I&#8217;m pleased they are now available in the RightScale platform for our customers.  When we integrate with a given cloud, we work hard to ensure a seamless experience across all the clouds we support.  We provide a generic interface to each of the clouds integrated within RightScale.  This is not to limit functionality from the clouds themselves; but rather to ensure all that cool functionality is usable.  If I&#8217;m using SoftLayer and Datapipe, I don&#8217;t want to deal with different storage solutions like volumes or instance based storage (or at least not until I&#8217;m ready to optimize the storage).  Likewise, keep networking off my plate&#8230;I don&#8217;t care whether it&#8217;s security groups or ip tables.  Just make that infrastructure stuff work so that my app can run.</p>
<p>As a user, I want to  easily port what I have in one resource pool to another resource pool.  For this purpose, RightScale has generic constructs for things like instances, instance types, images, volumes, volume snapshots, etc, that are exposed in our dashboard.  Then, in our ServerTemplates (stay tuned by the way, a release is imminent), we use <a href="http://support.rightscale.com/09-Clouds/Multi_Cloud/Chef/Introduction_to_Chef">chef</a> to abstract features for individual ServerTemplates that work, albeit very differently, across different resource pools.  Using the above example, someone launching servers in SoftLayer&#8217;s Amsterdam cloud and Datapipe&#8217;s Hong Kong cloud doesn&#8217;t have to worry about the differences between network configuration and storage management.  You can launch an entire 3-tier PHP architecture on both environments using ServerTemplates from the <a href="http://www.rightscale.com/library/">MultiCloud Marketplace</a>.  We&#8217;ll take care of dealing with instance based storage in Amsterdam and set up the proper security groups for you in Hong Kong through the platform.</p>
<h3>Why does RightScale spend so much time touting &#8216;MultiCloud&#8217; and why should anyone care?</h3>
<p>It&#8217;s a good question to ask actually.  I spend a lot of my time working with service providers and various companies looking to deliver infrastructure as a service for public consumption.   A number of people, our existing customers included, come to us and say &#8220;hey, I know I will have multiple clouds (if I don&#8217;t already)…help me make that happen.&#8221;  Analysts also agree &#8211; Forrester&#8217;s <a href="http://www.forrester.com/rb/analyst/holger_kisker">Holger Kisker</a> touts &#8220;<a href="http://blogs.forrester.com/holger_kisker/11-12-13-10_cloud_predictions_for_2012">multi cloud becomes the norm</a>&#8221; as his number 1 cloud computing prediction for 2012.</p>
<p>It&#8217;s real.  And it&#8217;s great validation for being the leader in &#8216;MultiCloud Management&#8221;.</p>
<p>Perhaps even more interesting (and contradictory if you think about it) is that the service providers say the same thing!  We describe how RightScale offers clouds to consumers and the choice consumers have to use what works best for their business needs.  And, IaaS providers are more than happy (okay, some take it as a challenge to deliver an even better service for their users. <img src='http://s1.wp.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> ).  In truth though, they recognize that cloud is a heterogeneous environment.  A single customer will use more than one cloud offering in a single environment.  Cost is one factor, and another I hear often is performance.  In some cases, geographic location is important and they &#8220;can&#8217;t get there with their current IaaS provider.&#8221;  It&#8217;s an opportunity for some to seize, and we&#8217;re partnering with the best to deliver the multi-cloud solutions our customers want.</p>
<p>Within RightScale, you can use any or all of the following clouds &#8211; all the Amazon regions, SoftLayer, Rackspace Cloud across US and UK, Datapipe, Logicworks as well as private cloud management with CloudStack and Eucalyptus.</p>
<p>I encourage you to <a href="http://support.rightscale.com/18-Release_Notes/01-RightScale_Dashboard/Current">click</a> and <a href="https://my.rightscale.com/session/new">try</a> the new clouds on RightScale.  Use a new app or an existing one that&#8217;s already in cloud and as always, let us know <a href="http://feedback.rightscale.com/forums/94975-customer-feedback">what you think</a>.</p>
<br />Filed under: <a href='http://blog.rightscale.com/category/aws/'>AWS</a>, <a href='http://blog.rightscale.com/category/cloud-computing/'>Cloud Computing</a>, <a href='http://blog.rightscale.com/category/ec2/'>EC2</a>, <a href='http://blog.rightscale.com/category/releases/'>Releases</a> Tagged: <a href='http://blog.rightscale.com/tag/aws/'>AWS</a>, <a href='http://blog.rightscale.com/tag/cloud-computing/'>Cloud Computing</a>, <a href='http://blog.rightscale.com/tag/datapipe/'>Datapipe</a>, <a href='http://blog.rightscale.com/tag/ec2/'>EC2</a>, <a href='http://blog.rightscale.com/tag/multicloud/'>MultiCloud</a>, <a href='http://blog.rightscale.com/tag/rackspace-2/'>Rackspace</a>, <a href='http://blog.rightscale.com/tag/release/'>Release</a>, <a href='http://blog.rightscale.com/tag/softlayer/'>Softlayer</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rightscale.wordpress.com/1092/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rightscale.wordpress.com/1092/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rightscale.wordpress.com/1092/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rightscale.wordpress.com/1092/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rightscale.wordpress.com/1092/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rightscale.wordpress.com/1092/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rightscale.wordpress.com/1092/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rightscale.wordpress.com/1092/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rightscale.wordpress.com/1092/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rightscale.wordpress.com/1092/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rightscale.wordpress.com/1092/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rightscale.wordpress.com/1092/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rightscale.wordpress.com/1092/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rightscale.wordpress.com/1092/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=1092&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.rightscale.com/2011/12/19/ending-the-year-with-a-bang-5-new-clouds-managed-by-rightscale/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/5286276bd31176caab87da92fed6a73d?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">shivanbindal</media:title>
		</media:content>

		<media:content url="http://rightscale.files.wordpress.com/2011/12/screen-shot-2011-12-16-at-1-10-19-pm.png" medium="image">
			<media:title type="html">Public Clouds</media:title>
		</media:content>
	</item>
		<item>
		<title>RightScale Launches 3 Millionth Server</title>
		<link>http://blog.rightscale.com/2011/09/20/rightscale-launches-3-millionth-server/</link>
		<comments>http://blog.rightscale.com/2011/09/20/rightscale-launches-3-millionth-server/#comments</comments>
		<pubDate>Tue, 20 Sep 2011 14:58:02 +0000</pubDate>
		<dc:creator>Michael Crandell</dc:creator>
				<category><![CDATA[AWS]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Cloud.com]]></category>
		<category><![CDATA[Eucalyptus]]></category>
		<category><![CDATA[Rackspace]]></category>

		<guid isPermaLink="false">http://blog.rightscale.com/?p=1017</guid>
		<description><![CDATA[Here at RightScale, we&#8217;ve just passed the 3 million server milestone.  Driven by our growing customer and free-user base, and their ever-increasing cloud usage, the 3M mark represents a benchmark in the industry, and is noteworthy in three different ways. &#8230; <a href="http://blog.rightscale.com/2011/09/20/rightscale-launches-3-millionth-server/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=1017&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Here at RightScale, we&#8217;ve just passed the 3 million server milestone.  Driven by our growing customer and free-user base, and their ever-increasing cloud usage, the 3M mark represents a benchmark in the industry, and is noteworthy in three different ways.</p>
<p>First, 3 million is impressive in the data center business.  Many well-known hosting companies house between 50,000 and 100,000 servers, and estimates for the world&#8217;s largest computer companies with large data centers range up to 1 million.  (See the DataCenterKnowledge report <a href="http://www.datacenterknowledge.com/archives/2011/08/15/ovh-is-largest-web-host-with-100000-servers/">here</a>.)  It&#8217;s difficult to compare our statistic with these installations, since many may be running largely under a pre-cloud operational model.  Nevertheless, launching 3 million is quite a number by any comparative metric, and there’s no question that it was achieved only with new levels of automation and dynamic configuration that are core to RightScale.</p>
<p>The second reason 3M is worth noting has to do with how fast we got there.   After our founding in 2007, it took us about 27 months to reach 1M, another 12 months to reach 2M, and then just 6 months to reach 3M.  That&#8217;s more than twice as fast for each subsequent 1M servers.  Likewise, one year ago in Sept. 2010, we had launched 1.5M servers – and we doubled in the last 12 months.</p>
<p>The third reason this milestone matters is that the servers our users launch have increased in power, and persist for a longer duration, as each month passes.  In fact, since January this year server runtime has increased on average 30%. So the trend is clear: companies are running &#8220;bigger iron&#8221; in the cloud &#8212; and keeping it running longer &#8212; than ever before.  Here is a graph of the size distribution we recorded this summer:</p>
<p style="text-align:center;"><a href="http://rightscale.files.wordpress.com/2011/09/servers1.jpg"><img class="size-full wp-image-1023 aligncenter" title="Servers" src="http://rightscale.files.wordpress.com/2011/09/servers1.jpg?w=600" alt=""   /></a></p>
<p>Certainly, the growth rate we’re tracking for the quantity, power and longevity of servers launched on RightScale remains quite healthy and mirrors the broad adoption of cloud services industry-wide. But equally important is the range of customers driving this growth, representing a wide variety of industries, use cases and services powered by RightScale on the cloud. For example, during the last year:</p>
<ul>
<li>media giant <a href="http://www.rightscale.com/info_center/videos/testimonial-pearson-education.php">Pearson</a> converted a traditional educational software offering to a SaaS based model that allowed faster onboarding of new customers;</li>
<li>consumer goods company <a href="http://www.rightscale.com/info_center/videos/use-case-american-girl.php">American Girl</a> (a division of Mattel) launched their virtual world with a major advertising push behind it and sailed smoothly through the holiday season;</li>
<li>online game company <a href="http://www.rightscale.com/customers/zynga-grows-to-1-social-gaming-site-with-rightscale.php">Zynga</a> launched new games that consistently broke records;</li>
<li>and companies like <a href="http://www.rightscale.com/info_center/videos/nyc2011-gali-reznik-amdocs.php">Amdocs</a> and <a href="http://www.rightscale.com/info_center/videos/nyc2011-tim-jones-trader-media.php">Trader Media</a> spoke at our <a href="http://www.rightscale.com/conference/">User Conference</a> last June about new enterprise services launched on both public and hybrid clouds.</li>
</ul>
<p>All of these RightScale customers contributed toward the 3M milestone, and we continue to be dazzled by the solutions they achieve using cloud infrastructure. We’re looking forward to the next million servers launched by our customers, and the amazing services they’ll power with them.</p>
<br />Filed under: <a href='http://blog.rightscale.com/category/aws/'>AWS</a>, <a href='http://blog.rightscale.com/category/cloud-computing/'>Cloud Computing</a>, <a href='http://blog.rightscale.com/category/cloud-com/'>Cloud.com</a>, <a href='http://blog.rightscale.com/category/eucalyptus/'>Eucalyptus</a>, <a href='http://blog.rightscale.com/category/rackspace/'>Rackspace</a> Tagged: <a href='http://blog.rightscale.com/tag/aws/'>AWS</a>, <a href='http://blog.rightscale.com/tag/cloud-computing/'>Cloud Computing</a>, <a href='http://blog.rightscale.com/tag/cloud-com/'>Cloud.com</a>, <a href='http://blog.rightscale.com/tag/eucalyptus/'>Eucalyptus</a>, <a href='http://blog.rightscale.com/tag/rackspace-2/'>Rackspace</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rightscale.wordpress.com/1017/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rightscale.wordpress.com/1017/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rightscale.wordpress.com/1017/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rightscale.wordpress.com/1017/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rightscale.wordpress.com/1017/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rightscale.wordpress.com/1017/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rightscale.wordpress.com/1017/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rightscale.wordpress.com/1017/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rightscale.wordpress.com/1017/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rightscale.wordpress.com/1017/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rightscale.wordpress.com/1017/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rightscale.wordpress.com/1017/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rightscale.wordpress.com/1017/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rightscale.wordpress.com/1017/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=1017&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.rightscale.com/2011/09/20/rightscale-launches-3-millionth-server/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/7e16a9f13a9f29a326915856217df94d?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">mcrandell</media:title>
		</media:content>

		<media:content url="http://rightscale.files.wordpress.com/2011/09/servers1.jpg" medium="image">
			<media:title type="html">Servers</media:title>
		</media:content>
	</item>
		<item>
		<title>Performing Security Testing in the Cloud</title>
		<link>http://blog.rightscale.com/2011/08/04/performing-security-testing-in-the-cloud/</link>
		<comments>http://blog.rightscale.com/2011/08/04/performing-security-testing-in-the-cloud/#comments</comments>
		<pubDate>Thu, 04 Aug 2011 15:00:02 +0000</pubDate>
		<dc:creator>Phil Cox</dc:creator>
				<category><![CDATA[AWS]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[EC2]]></category>
		<category><![CDATA[Security]]></category>

		<guid isPermaLink="false">http://blog.rightscale.com/?p=889</guid>
		<description><![CDATA[[This is Phil Cox's first blog post since he joined us as Director of Security and Compliance. We hope to have more from him to post in the near future! -Thorsten] Security testing is one aspect of a security program &#8230; <a href="http://blog.rightscale.com/2011/08/04/performing-security-testing-in-the-cloud/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=889&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>[This is Phil Cox's first blog post since he joined us as Director of Security and Compliance. We hope to have more from him to post in the near future! -Thorsten]</p>
<p>Security testing is one aspect of a security program that is often overlooked. Organizations who take security seriously understand that testing systems and applications is just smart business. We felt that one way we could help our customers is to describe the process, and nuances, that we go through during our testing. Since RightScale runs in the cloud, the information should help any RightScale customer accomplish the same tasks on their environment.</p>
<p>Our process is basically broken down into the following steps:</p>
<ol>
<li>Identify instances and applications that will be tested</li>
<li>Select tools and systems that will be used to perform the testing</li>
<li>Coordinate with the cloud service provider to get authorization for testing</li>
<li>Execute the test</li>
<li>Communicate the results</li>
</ol>
<p>Below I have outlined some of the practical details of each of these steps.</p>
<p><strong>Identify Targets</strong></p>
<p>Before we start testing, we identify what we want to test. For this particular test, we decided that we would include all of the systems that make up our platform, as well as the main dashboard application. Since we use RightScale to manage RightScale, and one of the main functions of our service is using ServerTemplates™ and RightScripts™ to ensure that systems are deployed consistently, there was a temptation to select a representative sample.</p>
<p>Since this was my first time testing RightScale since becoming the Director of Security and Compliance, we decided to test them all. We figured it is good practice, and provided a “validation” of sorts that we were following the practices we champion. We did however decide to limit the testing to publicly addressable AWS IP addresses. (Note: Anyone trying to be PCI compliant in AWS will likely need to test private IPs as well.)</p>
<p>As for the application, we decided on the entire dashboard, and not just a portion (mostly because I wanted a good overview to have as a baseline).</p>
<p><strong>Select Testing Tools</strong></p>
<p>Along with determining which systems/instances and applications we were testing, we selected tools that would help us automate the testing. We had agreed that a primarily automated vulnerability test (with manual validation) was acceptable, but that the application scanning would require a more manual approach given the complexity of our application. To that end, we had the following basic selection criteria:</p>
<ul>
<li>Vulnerability scanner: Number one criterion was its ability to appropriately identify vulnerabilities. We did not want a lot of false positives, but felt that false negatives would be much worse. A second criterion for the vulnerability scanner, was the flexibility of its reporting mechanism.</li>
<li>Application testing: Number one criterion was our ability to use it, not what others think of it. A second criterion for the application testing tool was its ability to test against the framework of our application.</li>
</ul>
<p>Given those “requirements” we chose three vulnerability scanners that we wanted to evaluate, in hopes of selecting one as the foundation for our ongoing testing program. Those were SAINT, NeXpose, and OpenVAS. Many will point out that there are other tools out there, and I agree, but these were tools I personally have history with, and one is free. We had to start somewhere.</p>
<p>As far as the application testing, I have used Burp Pro for a number of years and am a fan of it, and selected that as an application testing tool of choice. It should be noted that a number of other tools have recently come out that may rival Burp Pro in its functionality, but familiarity of use was important. We wanted to test the application, not the tool.</p>
<p><strong>Where to Run Them?</strong></p>
<p>Once we determined the tools that we wanted to use, we had to figure out where we wanted to run them:</p>
<ul>
<li>SaaS</li>
<li>Instance in the same cloud</li>
<li>Instance in a different cloud</li>
<li>Traditional hosting environment</li>
<li>Physical system on our network</li>
</ul>
<p>We chose the “Instance in the same cloud” for a couple of reasons:</p>
<ul>
<li>Flexibility: We were able to install multiple tools to evaluate and test</li>
<li>Eating our own dog food: RightScale is all about configuring and managing systems, so what better way for us to help our customers be able to deploy scanning systems than to do it ourselves</li>
<li>Bandwidth cost: By using an instance within the same availability zones on AWS, bandwidth was not an issue</li>
<li>Access to internal IPs: By running in the same cloud (AWS region) we can test internal IP addresses</li>
</ul>
<p>Once we decided to build our own, we downloaded a trial version of SAINT, the community version of NeXpose, and followed the Ubuntu installation directions for OpenVAS. Then we wrote some RightScripts to automate the majority of the install and we were “cooking with gas” so to speak.</p>
<p><strong>Get Authorization from Cloud Provider</strong></p>
<p>Once we identified all our instances we were going to test, and had our testing sources (one in our case), per the AWS usage agreement, we needed to get authorization from AWS to perform the testing.<br />
AWS provides a <strong><a title="Penetration Testing" href="http://aws.amazon.com/security/penetration-testing/" target="_blank">form</a></strong> that we filled out to request penetration testing of instances. We had to supply the AWS instance IDs and IPs that we obtained earlier, as well as the source of the testing. AWS uses this to create a ticket that AWS security team will get, and subsequently white list the account so the IDS systems are not triggering alerts during the testing. This prevents getting nasty emails about policy violation as well as port blocking, which would affect the test results.</p>
<p>AWS security responded back within a couple of days with approval for the scanning. It is interesting to note that it appears it is the vulnerability scanning that this applies to, for all intents and purposes you should make this request for application-based scanning as well, but it’s been my experience that testing the application does not cause abuse reports to be generated within AWS. During the testing, launching and relaunching of the scanner we did accidentally perform a number of scans from an IP address other than the one we provided to AWS and we did receive two abuse notices.</p>
<p>Probably the biggest point to note with respect to testing instances running in AWS is that instance size must be medium or greater. AWS policy does not allow pen testing, including port/service scanning, of smalls or below, presumably because they want to avoid that the testing degrades the other VMs on the same host. It should be noted, that we were just testing in AWS, depending on your cloud service provider, what you need to provide as far as what you are testing will vary. For AWS, we provided the instance ID as well as the public IP that will be tested, and the source of the testing.</p>
<p>For AWS, the quickest way to get the list of all AWS instance IDs and associated IPs is to use the <strong><a href="http://rubygems.org/gems/rest_connection" target="_blank">rest_connection</a></strong> API. It can be used to programmatically generate a list of the instances and associated IP addresses that will be the targets of testing. We ignored the security groups in this test and hit all the “well known ports” that the tools scan. An alternative would be to only test the accessible ports.</p>
<p><strong>Execute the Test</strong></p>
<p>Once we obtained the authorization for the testing, we coordinated with the ops team to make sure they were ready for any potential problems. Once we got their “we are a go” signal, we commenced the testing. The general methodology looked something like this:</p>
<ol>
<li>A sequential vulnerability scan, using each of the scanners. For both SAINT and NeXpose, we utilized the “exploit” portion of the tools (when it existed) on any noted vulnerability. (Note that we performed multiple scans with each scanner over the course of our 3 weeks of testing.)</li>
<li>General walk through and Burp Pro “passive” testing of the entire dashboard. Attempting to get an overall feel for the testing tool with the dashboard, and basically doing a full manual spider of the site.</li>
<li>Next we specifically performed testing of our session state mechanism, looking for entropy, manipulation, and injection flaws.</li>
<li>We then stepped through each of the dashboard’s main function areas, “Reports,” “Manage,” “Design,” “Clouds” and “Settings,” looking for well-known attack vectors. In particular focusing on identifying Cross Site Scripting and Request Forgers (XSS and CSRF), Injection, parameter manipulation, and other common web app exposures. See the <strong><a title="OWASP testing guide" href="https://www.owasp.org/index.php/OWASP_Testing_Guide_v3_Table_of_Contents" target="_blank">OWASP testing guide</a></strong> for a good discussion of things that should be tested for in web applications.</li>
</ol>
<p>Note that all testing we performed was done in both an authenticated state as well as an unauthenticated state.</p>
<p>As stated earlier, we made the decision that the vulnerability scanning portion of our testing would be mostly automated, and the application testing mostly manual. It took us approximately 3 weeks to identify the systems, get the authorization, and perform the testing. About 2 weeks of that was dedicated to the manual app testing.</p>
<p><strong>A Bit More on the Application</strong></p>
<p>It could be argued, that the bulk of “cloud” security testing should revolve around the application. This is not to say that making sure supporting services like Apache and MySQL versions are patched is not important (it is, just ask Sony), but meaning that much of the exposure to your data will come through the application. Taking the time to assess the mechanisms protecting the application is critical. For example:</p>
<ul>
<li>Are the security groups appropriate?</li>
<li>Do you have appropriate controls on who can access API calls or make security related changes via the UI?</li>
<li>Does your authorization mechanism enforce appropriate controls via all interfaces?</li>
</ul>
<p>Items like these are things that will be critical for long-term protection of information. Make sure that you include them in your testing regiment.</p>
<p><strong>Communicate Results</strong></p>
<p>We are an Agile shop, so frequent communication is part of our culture, and we leveraged that to provide feedback from the testing to the appropriate engineering or ops teams as we uncovered potential threats. This allowed us to create records of our testing results, as well as provided timely information to be fed into our sprint process. At the completion of the testing, we wriote a summary report and included details of the vulnerabilities from each of the tools as appendices. Even though the information is already fed into the appropriate groups, including details along with the final report allowed stakeholders the ability to review the overall testing methodology and findings, as well as dig down into the details of any vulnerabilities found.</p>
<p>Your process may vary, and you may have a much more formal reporting requirement. The most important part is to get the appropriate information to the people who can get the system services or applications fixed in a timely manner.</p>
<p><strong>Summary</strong></p>
<p>The process of identifying targets, maintaining testing tools, coordinating with cloud service providers, and communicating those results should be formalized within your organization. Security testing should become an integral part of the IT culture. There will always be issues, as nothing is absolutely secure, but trying to stay ahead of the curve is a worthy cause. With a formal process, you can make it a regular occurrence, thus enhancing your security program and likely meeting many practical as well as compliance requirements.</p>
<p>One side note about the testing is that for all practical purposes, it was exactly the same methodology and tools that I have used previously in non-cloud environments. So I encourage you to roll up your sleeves and implement a testing program for your infrastructure and applications.</p>
<br />Filed under: <a href='http://blog.rightscale.com/category/aws/'>AWS</a>, <a href='http://blog.rightscale.com/category/cloud-computing/'>Cloud Computing</a>, <a href='http://blog.rightscale.com/category/ec2/'>EC2</a> Tagged: <a href='http://blog.rightscale.com/tag/aws/'>AWS</a>, <a href='http://blog.rightscale.com/tag/cloud-computing/'>Cloud Computing</a>, <a href='http://blog.rightscale.com/tag/ec2/'>EC2</a>, <a href='http://blog.rightscale.com/tag/security/'>Security</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rightscale.wordpress.com/889/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rightscale.wordpress.com/889/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rightscale.wordpress.com/889/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rightscale.wordpress.com/889/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rightscale.wordpress.com/889/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rightscale.wordpress.com/889/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rightscale.wordpress.com/889/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rightscale.wordpress.com/889/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rightscale.wordpress.com/889/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rightscale.wordpress.com/889/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rightscale.wordpress.com/889/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rightscale.wordpress.com/889/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rightscale.wordpress.com/889/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rightscale.wordpress.com/889/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=889&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.rightscale.com/2011/08/04/performing-security-testing-in-the-cloud/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/87686a76345286ba2c47e76dc2eb7dea?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">philcoxrs</media:title>
		</media:content>
	</item>
		<item>
		<title>AWS outage follow-up: if you wanted details, you got details!</title>
		<link>http://blog.rightscale.com/2011/05/02/aws-outage-follow-up-if-you-wanted-details-you-got-details/</link>
		<comments>http://blog.rightscale.com/2011/05/02/aws-outage-follow-up-if-you-wanted-details-you-got-details/#comments</comments>
		<pubDate>Mon, 02 May 2011 16:04:13 +0000</pubDate>
		<dc:creator>Thorsten</dc:creator>
				<category><![CDATA[AWS]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[EC2]]></category>
		<category><![CDATA[Outage]]></category>

		<guid isPermaLink="false">http://blog.rightscale.com/?p=794</guid>
		<description><![CDATA[A week after the April 21st 2011 outage AWS posted a detailed post mortem explanation of what happened. It&#8217;ll be interesting to see how everyone digests the very detailed account. Since AWS did not provide an executive summary I&#8217;ll try &#8230; <a href="http://blog.rightscale.com/2011/05/02/aws-outage-follow-up-if-you-wanted-details-you-got-details/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=794&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>A week after the April 21st 2011 outage AWS posted a <a href="http://aws.amazon.com/message/65648/">detailed post mortem explanation</a> of what happened. It&#8217;ll be interesting to see how everyone digests the very detailed account. Since AWS did not provide an executive summary I&#8217;ll try my hand at one:</p>
<blockquote><p>The outage was triggered by an operator error during a router upgrade which funneled very high-volume network traffic into a low-bandwidth control network used by EBS (Elastic Block Store). The resulting flooding of the control network caused a large number of EBS servers to be effectively isolated from one another, which broke the volume replication, and caused these servers to start re-replicating the data to fresh servers. This large-scale re-replication storm in turn had two effects: it failed in many cases causing the volumes to go offline for manual intervention, and it flooded the EBS control plane with re-replication events that affected its operation across the entire us-east region.</p>
<p>The steps taken by AWS to regain control started by stopping the re-replication attempts to quiesce the system and prevent new volumes from being drawn into the outage. AWS then isolated the affected availability zone from the EBS control plane to restore normal operation in other zones. Finally, AWS started to recover volumes by adding storage capacity to allow the re-replication to succeed where possible, by restoring data from snapshots on S3, and finally by manually restoring data. Ultimately 0.07% of the volumes could not be restored to a consistent state.</p>
<p>The Relational Database Service RDS was also affected by the outage. 45% of single-availability-zone databases in the affected availability zone went down because each database server stripes data across multiple EBS volumes with the result that one stuck volume halts the entire database. A number of multi-AZ  RDS databases whose  master server was in the affected zone failed to fail-over because of a bug in the fail-over process.</p>
<p>The post mortem lists a number of system improvements that AWS is working on. These primarily target improving the resiliency of EBS when replication fails as well as improving the tools created and used during the outage to recover from the situation. Customer communication improvements, especially regarding the frequency of updates, are also listed and AWS is crediting affected users a significant fraction of this month&#8217;s charges, this way beyond anything covered in its SLAs.</p></blockquote>
<p>It is interesting to see how a network configuration error caused such a chain reaction within the EBS system. The outage trigger really is pretty incidental, a similar set of events could have probably been triggered by something else as well. The measures taken by AWS to contain and repair the outage highlight the deep technical expertise and full mastery of the entire software and hardware stack at AWS. Clearly deep code changes were made and sophisticated recovery tools were written 24&#215;7 under the pressure of the outage, without which the situation most likely would have spun completely out of control.</p>
<p>The impact of the outage, the public reaction, and the measures necessary to control it show the scale at which AWS operates. It is pretty clear that this type of outage is part of growing the service to unprecedented scale. I find it amazing that this type of outage, where the sophisticated systems necessary to provide cloud computing at scale fail massively hasn&#8217;t happened years ago. This is a testament to AWS&#8217;s sophistication.</p>
<p>The outage summary exposes interesting technical details about the architecture of the services that AWS has kept confidential until now, however, more than providing information to competitors I believe that it provides education to cloud customers. All cloud providers who are planning world-wide cloud roll-outs absolutely must understand the power of and the need for availability zones in a region and isolation between regions (or equivalent constructs to &#8220;differentiate&#8221; from AWS). Without that redundancy and isolation, it has now become crystal clear: &#8220;how can we sell that to customers?&#8221;</p>
<p>An aspect of EBS durability which is not often mentioned is the role of snapshots during recovery. The <a href="http://aws.amazon.com/ebs/">EBS product description</a> states &#8220;the durability of your volume depends both on the size of your volume and the percentage of the data that has changed since your last snapshot.&#8221; Here&#8217;s what this means. Suppose there are two copies of the volume (i.e. mirroring) and one fails, then a fresh mirror can fetch data contained in snapshots from S3 (which is itself replicated) but must retrieve other data from the single remaining copy, which may itself fail or become unreachable. Sadly the performance impact of taking a snapshot is such that most of our customers with high volume database cannot snapshot the master DB volume. Please fix that AWS!</p>
<p>An item missing from the remedies list in my opinion is EBS performance improvement. Better performance would have helped in the outage. Specifically I&#8217;d like AWS to reduce the impact of snapshots on volume performance so customers can actually snapshot high-volume servers and improve the performance of volumes so customers don&#8217;t have to stripe across multiple volumes which reduces availability (as it did with RDS).</p>
<p>I also am not satisfied with the communication improvements AWS proposes. I was fine with the frequency of status updates because it was clear that the EBS team was on top of it and didn&#8217;t have much new to report. I would like to see improved responsiveness so we don&#8217;t have to open a ticket before something shows up on the status page. But foremost I would like better content in the status updates. I&#8217;d like to be constructive, so I&#8217;ll make it concrete. Here is some of what I would have liked to see (I naturally have to make some assumptions about what was concluded when within AWS):</p>
<ul>
<li>explicit mention that the initial network event was contained, status updates kept talking about &#8220;increased latencies&#8221;, which made it unclear whether there was a general ongoing network issue</li>
<li>clear statement that the outage revolved around EBS and noting the impact on launching servers from EBS images, but also stating that there was no impact on servers not using EBS</li>
<li>clear statement that certain API calls were disabled instead of vaguely referring to &#8220;increased error rates affecting EBS CreateVolume API calls&#8221;</li>
<li>timely reporting, e.g., the post mortem states &#8220;by 5:30 AM PDT, error rates and latencies again increased for EBS API calls across the Region&#8221; while the status updates only mentioned this at 7am</li>
<li>the fact that the outage was due to failed EBS volumes as opposed to just connectivity or latency issues accessing the volumes was only reported at 8:54am, yet this is crucial piece of information</li>
<li>the status updates never made it clear that EBS volumes continued to fail after the initial event, nor did they mention when this infection was halted</li>
<li>the isolation of the other availability zones from the &#8220;affected one&#8221; was reported several hours after it was put in place</li>
<li>it would have been useful to see some relative numbers, such as % of volumes deemed operational, % being recovered automatically soon, % slated for later manual recovery; best would have been emails to users with specific volume IDs</li>
</ul>
<p>I&#8217;m sure that some of the items above weren&#8217;t quite as obvious at the time and in the heat of the moment it&#8217;s always difficult to determine what to say. But there is no question that the status updates were filled with vague terms, such as &#8220;increased latencies&#8221;, &#8220;moderate increase in error rates&#8221;, &#8220;affected availability zone&#8221;, &#8220;a network event&#8221;, etc. Perhaps foremost it&#8217;s not until 8 hours after the onset of the outage that AWS made it clear that volumes in the affected zone weren&#8217;t going to return to normal for hours to come. Up to that point it seemed that everything could return to normal any minute. This lack of clarity made it much harder for users to take the right decisions promptly.</p>
<p>On the public reaction front, while I understand it, I&#8217;m still baffled by reporters stating that the loss of 0.07% of volumes as not recoverable is a fundamental problem. This is equivalent to complaining about users losing data because their RAID array failed (happens all the time from operator error to 6ft drop due to earthquake). Users that lost data and were not aware of the risk they were taking need to seriously reflect on what they&#8217;re doing (and get help as appropriate).</p>
<p>This episode provides a key lesson to all cloud companies regarding architecting to withstand failure, and communicating with customers when failures do occur. While RightScale got through the outage relatively unscathed, we are working to improve on both those fronts ourselves. And we intend to continue to work with customers to enable AWS as well as other providers with independent, best-practice solutions that are resilient and highly available.</p>
<br />Filed under: <a href='http://blog.rightscale.com/category/aws/'>AWS</a>, <a href='http://blog.rightscale.com/category/cloud-computing/'>Cloud Computing</a>, <a href='http://blog.rightscale.com/category/ec2/'>EC2</a> Tagged: <a href='http://blog.rightscale.com/tag/aws/'>AWS</a>, <a href='http://blog.rightscale.com/tag/cloud-computing/'>Cloud Computing</a>, <a href='http://blog.rightscale.com/tag/ec2/'>EC2</a>, <a href='http://blog.rightscale.com/tag/outage/'>Outage</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rightscale.wordpress.com/794/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rightscale.wordpress.com/794/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rightscale.wordpress.com/794/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rightscale.wordpress.com/794/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rightscale.wordpress.com/794/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rightscale.wordpress.com/794/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rightscale.wordpress.com/794/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rightscale.wordpress.com/794/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rightscale.wordpress.com/794/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rightscale.wordpress.com/794/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rightscale.wordpress.com/794/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rightscale.wordpress.com/794/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rightscale.wordpress.com/794/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rightscale.wordpress.com/794/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=794&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.rightscale.com/2011/05/02/aws-outage-follow-up-if-you-wanted-details-you-got-details/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0b359e9ce1a8fb5f3315fbec8beda697?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">TvE</media:title>
		</media:content>
	</item>
		<item>
		<title>Amazon EC2 outage: summary and lessons learned</title>
		<link>http://blog.rightscale.com/2011/04/25/amazon-ec2-outage-summary-and-lessons-learned/</link>
		<comments>http://blog.rightscale.com/2011/04/25/amazon-ec2-outage-summary-and-lessons-learned/#comments</comments>
		<pubDate>Mon, 25 Apr 2011 15:27:47 +0000</pubDate>
		<dc:creator>Thorsten</dc:creator>
				<category><![CDATA[AWS]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[EC2]]></category>
		<category><![CDATA[Outage]]></category>

		<guid isPermaLink="false">http://blog.rightscale.com/?p=765</guid>
		<description><![CDATA[Last Thursday&#8217;s Amazon EC2 outage was the worst in cloud computing&#8217;s history. It made the front page of many news pages, including the New York Times, probably because many people were shocked by how many web sites and services rely &#8230; <a href="http://blog.rightscale.com/2011/04/25/amazon-ec2-outage-summary-and-lessons-learned/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=765&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Last Thursday&#8217;s Amazon EC2 outage was the worst in cloud computing&#8217;s history. It made the front page of many news pages, including the New York Times, probably because many people were shocked by how many web sites and services rely on EC2. Seeing so much affected was a very graphical illustration of how pervasive cloud computing has become.</p>
<p>I will try to summarize what happened, what worked and didn&#8217;t work, and what to learn from it. I&#8217;ll do my best to add signal to all the noise out there, in that respect I liked <a href="http://twitter.com/#!/Beaker/status/61552829613092865">a tweet by Beaker</a> (Christofer Hoff): &#8220;Happy with my decision NOT to have written a blog about the misfortune of AWS, stating nothing but the obvious &amp; sounding like a muppet&#8221;.</p>
<h2>Executive summary</h2>
<ul>
<li>The Amazon cloud proved itself in that sufficient resources were available world-wide such that many well-prepared users could continue operating with relatively little downtime. But because Amazon&#8217;s reliability has been incredible, many users were not well-prepared leading to widespread outages. Additionally, some users got caught by unforseen failure modes rendering their failure plans ineffective.</li>
<li>Some ripple effects within EC2 and in particular EBS caused by the initial failure should not have happened. There&#8217;s important work Amazon needs to do to prevent such occurrences.</li>
<li>Amazon&#8217;s communication, while better than during previous outages still earns an F. This is probably the #1 threat to AWS&#8217;s business.</li>
<li>The cloud architecture provides ample opportunities to design systems to withstand failures. The material cost of such designs is a fraction of what comparative measures would cost using traditional hosting means. However, designing, building, and testing everything is not cheap. Many of our customers who used our best practices fared well (I&#8217;m not claiming we&#8217;re perfect or that everything is automatic!) and we got numerous calls from other companies that were wholly unprepared.</li>
<li>Overall this is just one of many bumps in the cloud computing road. It reminds us that this is still &#8220;day one&#8221; of the cloud and that we all have much to learn about building and operating robust systems on a large scale. We are receiving a stream of calls from EC2 users that realize they need help in setting up a more robust architecture for their systems.</li>
</ul>
<h2>Outage analysis</h2>
<p>At the time of writing Amazon has not yet posted a root cause analysis. I will update this section when they do. Until then, I have to make some educated guesses.</p>
<p>We got the first alerts at 1:01am on Thursday, the proverbial Christmas lights lit up indicating I/O issues on a large number of our servers. We started failing servers over and opened a ticket with Amazon. They finally posted a status message at 1:41am containing no useful details, sadly this is a typical sequence of events.</p>
<p>It appears that a major network failure was the initial cause of problems but that the real damage happened when EBS (Elastic Block Store) volume replication was disrupted. We did some extrapolations and concluded that there must have been on the order of 500k EBS storage volumes in the affected availability zone. It appears that a significant fraction of the volumes concluded that the replication mirroring was out-of-sync and started re-replicating causing further havoc, including an overload of the EBS control plane. It is also possible that the EBS replication problem was the root cause and that the network issues were a consequence, hopefully Amazon&#8217;s root cause analysis will shed light on this.</p>
<p>The biggest problem, from my point of view, was that more than one availability zone was affected. We didn&#8217;t see servers or volumes fail in other zones but we were unable to create fresh volumes elsewhere, which of course makes it difficult to move services. This is &#8220;not supposed to happen&#8221; and is an indication that the EBS control plane has dependencies across zones. Amazon did manage to contain the problem to one zone approx 3 hours after the onset.</p>
<p>After Amazon managed to contain the problems to one zone, it took a very long time to get the EBS machinery under control and to recover all the volumes. Given the extrapolated number of volumes it would not be surprising that an event of this scale exceeded the design parameters and was never tested (or able to be tested). I&#8217;m not sure there is any system of comparable scale in operation anywhere.</p>
<p>I do want to state that while &#8220;something large&#8221; clearly failed, namely the EBS system as a whole, the real big failure is that multiple availability zones were affected for ~3 hours. I also want to mention two important things that didn&#8217;t fail: we didn&#8217;t see capacity constraints in relaunching servers in other zones after the initial cross-zone issues and we didn&#8217;t see other regions affected at all. This is clearly good news!</p>
<h2>Amazon communication failure</h2>
<p>In my opinion the biggest failure in this event was Amazon&#8217;s communication, or rather lack thereof. The status updates were far too vague to be of much use and there was no background information whatsoever. Neither the official AWS blog nor Werner Vogels&#8217; blog had any post whatsoever 4 days after the outage! Here is a list of improvements for Amazon:</p>
<ul>
<li>Do not wait 40 minutes to post the first status message!</li>
<li>Do not talk about &#8220;a small percentage of instances/volumes/&#8230;&#8221;, give actual percentages! Those of us with many servers/volumes care whether it&#8217;s 1% or 25%, we will take different actions.</li>
<li>Do not talk about &#8220;the impacted availability zone&#8221; or &#8220;multiple availability zones&#8221;, give each zone a name and refer to them by name (I know that zone 1a in each account refers to a different physical zone, so give each zone a second name so I can look it up).</li>
<li>Provide individualized status information: use email (or other means) to tell us what the status of our instances and volumes is. I don&#8217;t mean things I can get myself like cpu load or such, but information like &#8220;the following volumes (by id) are currently recovering and should be available within the next hour, the following volumes will require manual intervention at a later time, &#8230;&#8221;. That allows users to plan and choose where to put their efforts.</li>
<li>Make predictions! We saw volumes in the &#8220;impacted availability zone&#8221; getting taken out many hours after the initial event. I&#8217;m sure you knew that the problem was still spreading and could have warned everyone. Something like: &#8220;we recommend you move all servers and volumes that are still operating in the impacted availability zone [sic] to a different zone or region as the problem is still spreading.&#8221;</li>
<li>Provide an overview! Each status update should list which functions are still affected and which have been repaired, don&#8217;t make everyone scan back through the messages and try to infer what the status of each function is.</li>
<li>Is it so hard to write a blog post with an apology and some background information, even if it&#8217;s preliminary? AWS tweeters that usually send multiple tweets per day remained silent. I&#8217;m sure there&#8217;s <em>something</em> to talk about 24 hours after the event! Don&#8217;t you want to tell everyone what they should be thinking instead of having them make it up???</li>
</ul>
<h2>Coverage from around the web</h2>
<p>Since Amazon did not communicate much of substance beyond the rather sparse and obscure status updates everyone else was left to speculate. Most of the blog posts or news articles contained little information. Here&#8217;s a list of blog posts that I found interesting:</p>
<ul>
<li><a href="http://blogs.gartner.com/lydia_leong/2011/04/21/amazon-outage-and-the-auto-immune-vulnerabilities-of-resiliency/">Amazon outage and the auto-immune vulnerabilities of resiliency</a> by Lydia Leong at Gartner. An early post during the outage that has a good overall analysis.</li>
<li><a href="http://www.geekwire.com/2011/amazoncoms-real-problem-outage-communication">Amazon.com’s real problem isn’t the outage, it’s the communication </a>by Keith Smith from BigDoor.</li>
<li><a href="http://justinsb.posterous.com/aws-down-why-the-sky-is-falling">AWS is down: Why the sky is falling </a>by Justin Santa Barbara. One of the early posts with technical information.</li>
<li><a href="http://blog.pagerduty.com/2011/04/22/standing-on-the-shoulders-of-giants-and-stumbling-with-them-the-amazon-aws-outages-pain-statistics/">Standing on the shoulders of giants and stumbling with them – the Amazon AWS outage’s “pain” statistics </a>by PagerDuty (a service we happily use) with nice stats about alerts going out during the outage.</li>
<li><a href="http://freedb2.com/2011/04/21/cloud-crash-has-a-silver-lining/">Cloud crash has a silver lining </a>by Leon Katsnelson from IBM about DB2 and replication.</li>
<li><a href="http://news.ycombinator.com/item?id=2472120">nostromo comment on news.ycombinator.com</a> (look for nostromo&#8217;s comment if it&#8217;s not at the top anymore), a brief description of the pain with RDS.</li>
<li><a href="http://agilesysadmin.net/ec2-outage-lessons">Today’s EC2 / EBS Outage: Lessons learned</a> by Stephen Nelson-Smith. One of the first good lessons learned posts I saw.</li>
<li><a href="http://www.twilio.com/engineering/2011/04/22/why-twilio-wasnt-affected-by-todays-aws-issues/">Why Twilio Wasn’t Affected by Today’s AWS Issues</a> has some interesting recommendations on how to architect for failure.</li>
<li><a href="http://broadcast.oreilly.com/2011/04/the-aws-outage-the-clouds-shining-moment.html">The AWS Outage: The Cloud&#8217;s Shining Moment</a> by George Reese of Enstratus has a very nice analysis of what designing for failure means and how it contrasts with more traditional approaches.</li>
<li><a href="http://www.nytimes.com/2011/04/23/technology/23cloud.html">Amazon’s Trouble Raises Cloud Computing Doubts</a> on the front-page of the New York Times business section.</li>
<li><a href="http://blog.dotcloud.com/working-around-the-ec2-outage">Working around the EC2 outage</a> by Jérôme Petazzoni of dotcloud with an interesting account of the issues they faced.</li>
</ul>
<h2>Lessons learned</h2>
<p>Our services team handled 4x the incident volume last Thursday compared to a normal Thursday. A large number of callers needed help in assessing the situation or in bringing their servers back up. A typical request was: &#8220;It looks like my db server is down due to the outage, can you help confirm and assist with a migration?&#8221; Unfortunately we also heard from a good number of users who were using a single availability zone or didn&#8217;t set up redundancy properly. Hindsight is always 20-20.</p>
<p>A clear lesson for everyone is obviously that backup and replication have to be taken seriously (duh). In EC2 this means live replication across multiple availability zones and backups to S3 (and ideally elsewhere also). It has also become clear that a minimum of replicas must be running and a certain degree of over-provisioning is necessary to handle the load spike after a massive failure. Adrian Cockroft from Netflix summarized their strategy in <a href="https://twitter.com/#%21/adrianco/status/61089202229624832">a tweet</a> a while ago: &#8220;Deploy in three AZ with no extra instances &#8211; target autoscale 30-60% util. You have 50% headroom for load spikes. Lose an AZ -&gt; 90% util.&#8221; (Also see the <a href="http://news.ycombinator.com/item?id=2470773">discussion around the tweet</a>.) Users that relied on launching fresh servers or on creating fresh volumes from snapshots were not able to do so for several hours. The only previous event that I remember where multiple availability zones were affected was the <a href="http://status.aws.amazon.com/s3-20080720.html">July 20th 2008 S3 outage</a> that took down S3 in the US and EU (multiple <em>regions</em>!).</p>
<p>A number of blogs mention NoSQL databases as a solution to the replication and failure difficulties with traditional relational databases. While we&#8217;ve started to use <a href="http://cassandra.apache.org/">Cassandra</a> ourselves it has become pretty clear to me that this is not a silver bullet by a long shot. When a single node fails the built-in replication and recovery functions well, although the extra load on remaining nodes is high when the failing node is repaired and resynchronizes. But when large numbers of nodes in the cluster lock-up one-by-one over the course of an hour, I&#8217;d be hesitant to make a prediction about the outcome both in terms of the cluster&#8217;s availability and its consistency. We have two applications that make very different use of Cassandra and the behavior of the database is very different in both cases. My conclusion from what I have observed thus far is that clusters of replicated eventually-consistent NoSQL stores have pretty complex dynamics that can easily lead to unpleasant surprises. Sometimes it&#8217;s nice to have a comparatively simple MySQL master-slave set-up that experiences some downtime during the fail-over but acts very predictably.</p>
<p>I can&#8217;t help but feel uncomfortable about the performance of Amazon&#8217;s RDS &#8220;database-as-a-service&#8221; in that some databases that were replicated across multiple availability zones did not fail-over properly. It evidently took more than 12 hours to recover a number of the multi-az databases. The obvious failure here is compounded by the fact that Amazon has made it difficult for users to backup their databases outside of RDS, leaving them no choice but to wait for someone at Amazon to work on their database. This lock-in is one reason many of our customers prefer to use our MySQL master-slave setup or to architect their own.</p>
<p>The biggest lesson we learned abut operating RightScale itself is that we have to continue pushing hard on reducing the load on our central MySQL database and distributing our service. The database has grown too big and failover consequently takes too long because it takes forever to load the working set (over 30GB) into memory. We have some short-term measures we will be implementing to reduce the failover time, but more is needed. We also need to provide our users a choice of RightScale systems located in different regions and clouds: users operating primarily out of one region need to be able to use RightScale in an independent region or cloud. Ironically the first thing every public cloud operator and every company with a private cloud asks us is whether we can run RightScale inside their cloud: that seems pretty misguided to me!</p>
<p>We also were confused by Amazon&#8217;s status messages. In hindsight we should have intentionally failed-over our master database which was operating in the &#8220;impacted availability zone&#8221; early on at a time where we could minimize downtime. We were lucky that it didn&#8217;t get affected until about 12 hours after the start of the outage but we didn&#8217;t connect one and one. A clear message from Amazon that more and more volumes were continuing to fail in the zone would have been really helpful.</p>
<h2>What&#8217;s next?</h2>
<p>With Amazon&#8217;s overall stellar operating reliability it is easy to become complacent. This outage was a wake-up call for many of us. What remains to be seen is whether Amazon decides to take a lead and provide more granular descriptions of failure modes and recommended actions or whether they will leave it to everyone else to guess and figure it out. I see this as being one of the main long-term problems of cloud computing, namely that it is extremely difficult for users to list the possible failure modes and even more difficult to actually test any of them.</p>
<p>In the big picture I find <a href="http://www.rackspace.com/information/leadership/lmoorman.php">Lew Moorman</a>&#8216;s analogy in the <a href="http://www.nytimes.com/2011/04/23/technology/23cloud.html?_r=1">NYT article</a> very appropriate: &#8220;The Amazon interruption was the computing equivalent of an airplane crash. It is a major episode with widespread damage. But airline travel is still safer than traveling in a car — analogous to cloud computing being safer than data centers run by individual companies. Every day, inside companies all over the world, there are technology outages, each episode is smaller, but they add up to far more lost time, money and business.” Most of the articles that predict a run away from cloud computing fail to explain where to run to. Unless you can hire superman to run your private datacenters my experience tells me that you&#8217;ll be worse off.</p>
<br />Filed under: <a href='http://blog.rightscale.com/category/aws/'>AWS</a>, <a href='http://blog.rightscale.com/category/cloud-computing/'>Cloud Computing</a>, <a href='http://blog.rightscale.com/category/ec2/'>EC2</a> Tagged: <a href='http://blog.rightscale.com/tag/aws/'>AWS</a>, <a href='http://blog.rightscale.com/tag/cloud-computing/'>Cloud Computing</a>, <a href='http://blog.rightscale.com/tag/ec2/'>EC2</a>, <a href='http://blog.rightscale.com/tag/outage/'>Outage</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rightscale.wordpress.com/765/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rightscale.wordpress.com/765/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rightscale.wordpress.com/765/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rightscale.wordpress.com/765/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rightscale.wordpress.com/765/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rightscale.wordpress.com/765/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rightscale.wordpress.com/765/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rightscale.wordpress.com/765/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rightscale.wordpress.com/765/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rightscale.wordpress.com/765/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rightscale.wordpress.com/765/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rightscale.wordpress.com/765/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rightscale.wordpress.com/765/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rightscale.wordpress.com/765/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=765&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.rightscale.com/2011/04/25/amazon-ec2-outage-summary-and-lessons-learned/feed/</wfw:commentRss>
		<slash:comments>49</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0b359e9ce1a8fb5f3315fbec8beda697?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">TvE</media:title>
		</media:content>
	</item>
		<item>
		<title>RightScale Release: 25 ServerTemplates across 66 RightImages</title>
		<link>http://blog.rightscale.com/2011/03/29/rightscale-release-25-servertemplates-across-66-rightimages/</link>
		<comments>http://blog.rightscale.com/2011/03/29/rightscale-release-25-servertemplates-across-66-rightimages/#comments</comments>
		<pubDate>Tue, 29 Mar 2011 13:00:36 +0000</pubDate>
		<dc:creator>Darryl Eaton</dc:creator>
				<category><![CDATA[AWS]]></category>
		<category><![CDATA[Cloud.com]]></category>
		<category><![CDATA[Rackspace]]></category>
		<category><![CDATA[Releases]]></category>
		<category><![CDATA[RightImage]]></category>
		<category><![CDATA[RightScale]]></category>
		<category><![CDATA[ServerTemplate]]></category>

		<guid isPermaLink="false">http://blog.rightscale.com/?p=665</guid>
		<description><![CDATA[Is it Christmas? It seems the RightScale elves have been very busy. Luckily for you, many of our elves have PhDs or masters degrees in Computer Science&#8230; or just many years behind a unix prompt. Lucky for us, they don&#8217;t &#8230; <a href="http://blog.rightscale.com/2011/03/29/rightscale-release-25-servertemplates-across-66-rightimages/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=665&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Is it Christmas? It seems the RightScale elves have been very busy. Luckily for you, many of our elves have PhDs or masters degrees in Computer Science&#8230; or just many years behind a unix prompt. Lucky for us, they don&#8217;t look like elves.</p>
<p><em>Let&#8217;s see what&#8217;s under the tree&#8230;</em></p>
<h3>New Database Managers</h3>
<p><a href="http://rightscale.files.wordpress.com/2011/03/mysql1.gif"><img class="alignright size-thumbnail wp-image-697" title="mysql" src="http://rightscale.files.wordpress.com/2011/03/mysql1.gif?w=120&#038;h=66" alt="" width="120" height="66" /></a>Our <a title="MySQL Cloud Database Manager" href="http://www.rightscale.com/library/server_templates/All?search%5Badvanced_search%5D=&amp;search%5Bfilter_value%5D=Database+Manager+with+MySQL&amp;x=18&amp;y=16&amp;search%5Bfilter_type%5D=title&amp;search%5Bprice%5D=&amp;search%5Border%5D=date_desc">Database Manager for MySQL</a> is one of our most popular ServerTemplates, serving as the rock-solid foundation of many of our customer&#8217;s deployments. With this release, we&#8217;ve expanded our MySQL 5.1 manager to support CentOS as well as Ubuntu.</p>
<p><a href="http://rightscale.files.wordpress.com/2011/03/sql_server_2008_logo.png"><img class="alignright size-thumbnail wp-image-698" title="sql_server_2008_logo" src="http://rightscale.files.wordpress.com/2011/03/sql_server_2008_logo.png?w=120&#038;h=74" alt="Microsoft SQL Server 2008" width="120" height="74" /></a>We&#8217;re also happy to bring this same foundation to our Microsoft Windows customers, with our new <a title="Microsoft SQL Server Cloud Database Manager" href="http://www.rightscale.com/library/server_templates/All?search%5Badvanced_search%5D=&amp;search%5Bfilter_value%5D=Database+Manager+with+MS+SQL+Server&amp;x=20&amp;y=7&amp;search%5Bfilter_type%5D=title&amp;search%5Bprice%5D=&amp;search%5Border%5D=date_desc">Database Manager for MS SQL Server</a>. This new ServerTemplate automates the provisioning and operation of SQL Server, and includes scheduled backups, assisted restoration, and database monitoring. The template supports both SQL Server 2005 on Windows 2003 and SQL Server 2008 on Windows 2008.</p>
<h3>Windows ServerTemplates on Rackspace</h3>
<p>We have been working closely with Rackspace to expand our support to include ServerTemplates. We&#8217;re making Windows ServerTemplates available first. Our <a title="RightScale Windows ServerTemplate" href="http://www.rightscale.com/library/server_templates/All?search%5Badvanced_search%5D=&amp;search%5Bfilter_value%5D=Base+ServerTemplate+for+Windows&amp;x=0&amp;y=0&amp;search%5Bfilter_type%5D=title&amp;search%5Bprice%5D=&amp;search%5Border%5D=date_desc">Base ServerTemplate for Windows</a> includes RightImages for both AWS and Rackspace and can be launched on either cloud. This is a <em>public beta</em>, and we are very interested in your feedback &#8211; please let your account representative know if you plan to experiment with Windows on Rackspace.</p>
<p><a href="http://rightscale.files.wordpress.com/2011/03/amazon_and_rackspace_servertemplate.png"><img class="aligncenter size-full wp-image-703" title="amazon_and_rackspace_servertemplate" src="http://rightscale.files.wordpress.com/2011/03/amazon_and_rackspace_servertemplate.png?w=600&#038;h=165" alt="Amazon and Rackspace RightScale Windows ServerTemplate" width="600" height="165" /></a></p>
<h3>Base and LAMP ServerTemplates for Cloud.com CloudStack</h3>
<p>New CentOS 5.4 RightImages have been released for Cloud.com private cloud customers. We have also released a <a href="http://www.rightscale.com/library/server_templates/LAMP-All-In-One-with-MySQL-5-0/18248">LAMP All-In-One</a> ServerTemplate based on these images. The LAMP template uploads a backup of the database to either AWS S3 or Rackspace CloudFiles. Enterprise customers can contact their account representative for access to these images.</p>
<h3>RightImages and ServerTemplates for EC2 AP-Tokyo</h3>
<p><a href="http://blog.rightscale.com/2011/03/02/rightscale-global-japan/">Following our Dashboard support</a> for the new region, we have released a full set of RightImages (Ubuntu &amp; CentOS) and ServerTemplates (Database Manager for MySQL, HAProxy Load Balancer, App Servers for PHP/Rails/Tomcat, and more) for EC2 AP-Tokyo.</p>
<h3>New Free All-In-One Developer ServerTemplates</h3>
<p>Developer templates are designed for you to quickly plug in your code &amp; database and get up and running on the cloud with a single all-in-one server.</p>
<ul>
<li><a href="http://www.rightscale.com/library/server_templates/All?search%5Badvanced_search%5D=&amp;search%5Bfilter_value%5D=LAMP+All-In-One+with+MySQL+5&amp;x=0&amp;y=0&amp;search%5Bfilter_type%5D=title&amp;search%5Bprice%5D=&amp;search%5Border%5D=date_desc">LAMP All-In-One</a> with MySQL 5.0 or 5.1</li>
<li><a href="LAMP All-In-One WordPress">WordPress All-In-One</a> example</li>
<li><a href="http://www.rightscale.com/library/server_templates/All?search%5Badvanced_search%5D=&amp;search%5Bfilter_value%5D=Rails+AIO+Developer+with+Nginx&amp;x=0&amp;y=0&amp;search%5Bfilter_type%5D=title&amp;search%5Bprice%5D=&amp;search%5Border%5D=date_desc">Rails All-In-One</a> with Nginx and Passenger</li>
<li><a href="http://www.rightscale.com/library/server_templates/All?search%5Badvanced_search%5D=&amp;search%5Bfilter_value%5D=Rails+All-In-One+Mephisto&amp;x=0&amp;y=0&amp;search%5Bfilter_type%5D=title&amp;search%5Bprice%5D=&amp;search%5Border%5D=date_desc">Mephisto All-In-One</a> example</li>
</ul>
<h3>Our First Compatibility Release</h3>
<p>Our first &#8220;<a title="RightScale Compatibility Release" href="http://support.rightscale.com/12-Guides/RightScale_Methodologies/Compatibility_Release">Compatibility Release</a>&#8221; is now generally available (GA). What is a compatibility release, you might ask? Let&#8217;s break it down to two words: <em>compatible</em> and <em>release</em>. We test and <em>release</em> all of our most popular ServerTemplates and RightImages at the same time with the same <a title="RightScale Frozen Repositories" href="http://support.rightscale.com/12-Guides/RightScale_Methodologies/Freezing_Software_Repositories">software repository date</a>. This helps ensure that RightScripts developed for one template and operating system should be <em>compatible</em> with other templates and operating systems that are also part of the release.</p>
<p><a href="http://rightscale.files.wordpress.com/2011/03/rightscale-library-servertemplates-compatible.png"><img class="alignright size-full wp-image-678" title="RightScale Library ServerTemplates Compatible" src="http://rightscale.files.wordpress.com/2011/03/rightscale-library-servertemplates-compatible.png?w=600" alt=""   /></a>Why did we do this? We found that customers mixed and matched pieces from our ServerTemplates to create their own. Great! But without guaranteeing the same software repository or operating system version, their mileage varied. Now customers and partners can develop within a compatibility release, and take full advantage of all of the pre-built configurations we offer. You can see all of the ServerTemplates in the 11H1 Compatibility Release <a title="RightScale Compatibility Release" href="http://www.rightscale.com/library/server_templates/11H1%20Compatible">in the Library</a>. (Hint to publishers: If you build a template within the Compatibility Release, be sure to add the &#8220;11H1 Compatible&#8221; category.)</p>
<h3>New Base ServerTemplates</h3>
<p>It&#8217;s actually easy for you to get most of the benefits of the compatibility release without even thinking about it. The trick is to start with one of our <a href="http://www.rightscale.com/library/server_templates/All?search%5Badvanced_search%5D=&amp;search%5Bfilter_value%5D=base+servertemplate&amp;x=0&amp;y=0&amp;search%5Bfilter_type%5D=title&amp;search%5Bprice%5D=&amp;search%5Border%5D=date_desc">Base ServerTemplates</a> for any custom ServerTemplate development. You&#8217;ll automatically get monitoring, the latest production RightImages, and the ability to pull compatible scripts from any related Compatibility Release templates.</p>
<p><em>and finally, while it was hard to wrap&#8230;</em></p>
<h3>RightScale Experience and Expertise in the Cloud</h3>
<p>About a month ago, a few of our customers started reporting sudden crashes of Ubuntu servers. RightScale worked closely with Canonical to <a href="https://bugs.launchpad.net/ubuntu/+source/linux-ec2/+bug/708920">identify the obscure kernel bug</a> and brainstorm potential solutions. We then ran heavy tests on the Ubuntu 10.04 release with a variety of other kernels. <em>Heavy testing</em> in RightScale equates to multiple automated runs of hundreds of servers performing a variety of different workloads (database, load balancer, application server, etc.). We finally found a kernel that worked, and it is included with the latest Ubuntu RightImages.</p>
<p>With our current scale, we have the <a href="http://www.rightscale.com/news_events/press_releases/2011/rightscale-customers-expand-cloud-workloads.php">visibility</a> and <a href="http://www.rightscale.com/partners/">partnerships</a> to identify and solve major cloud issues. We&#8217;re happy to package this expertise up for all of our customers as we continue to invest in the future of cloud computing.</p>
<p><em>Enjoy the new toys!</em></p>
<p>For a complete description of what was released, visit the <a href="http://support.rightscale.com/18-Release_Notes/ServerTemplates_and_RightImages/Current">Current Release Notes for ServerTemplates and MultiCloud Images</a>.</p>
<br />Filed under: <a href='http://blog.rightscale.com/category/aws/'>AWS</a>, <a href='http://blog.rightscale.com/category/cloud-com/'>Cloud.com</a>, <a href='http://blog.rightscale.com/category/rackspace/'>Rackspace</a>, <a href='http://blog.rightscale.com/category/releases/'>Releases</a> Tagged: <a href='http://blog.rightscale.com/tag/aws/'>AWS</a>, <a href='http://blog.rightscale.com/tag/cloud-com/'>Cloud.com</a>, <a href='http://blog.rightscale.com/tag/rightimage/'>RightImage</a>, <a href='http://blog.rightscale.com/tag/rightscale/'>RightScale</a>, <a href='http://blog.rightscale.com/tag/servertemplate/'>ServerTemplate</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rightscale.wordpress.com/665/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rightscale.wordpress.com/665/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rightscale.wordpress.com/665/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rightscale.wordpress.com/665/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rightscale.wordpress.com/665/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rightscale.wordpress.com/665/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rightscale.wordpress.com/665/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rightscale.wordpress.com/665/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rightscale.wordpress.com/665/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rightscale.wordpress.com/665/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rightscale.wordpress.com/665/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rightscale.wordpress.com/665/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rightscale.wordpress.com/665/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rightscale.wordpress.com/665/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=665&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.rightscale.com/2011/03/29/rightscale-release-25-servertemplates-across-66-rightimages/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/c93598e507723cd07d140484e6da750a?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">darryleaton</media:title>
		</media:content>

		<media:content url="http://rightscale.files.wordpress.com/2011/03/mysql1.gif?w=150" medium="image">
			<media:title type="html">mysql</media:title>
		</media:content>

		<media:content url="http://rightscale.files.wordpress.com/2011/03/sql_server_2008_logo.png?w=150" medium="image">
			<media:title type="html">sql_server_2008_logo</media:title>
		</media:content>

		<media:content url="http://rightscale.files.wordpress.com/2011/03/amazon_and_rackspace_servertemplate.png" medium="image">
			<media:title type="html">amazon_and_rackspace_servertemplate</media:title>
		</media:content>

		<media:content url="http://rightscale.files.wordpress.com/2011/03/rightscale-library-servertemplates-compatible.png" medium="image">
			<media:title type="html">RightScale Library ServerTemplates Compatible</media:title>
		</media:content>
	</item>
		<item>
		<title>ServerTemplates: The Key to RightScale Cloud Management</title>
		<link>http://blog.rightscale.com/2011/03/22/servertemplates-the-key-to-rightscale-cloud-management/</link>
		<comments>http://blog.rightscale.com/2011/03/22/servertemplates-the-key-to-rightscale-cloud-management/#comments</comments>
		<pubDate>Tue, 22 Mar 2011 13:10:25 +0000</pubDate>
		<dc:creator>Thorsten</dc:creator>
				<category><![CDATA[AWS]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Cloud.com]]></category>
		<category><![CDATA[EC2]]></category>
		<category><![CDATA[Cloud Management]]></category>

		<guid isPermaLink="false">http://blog.rightscale.com/?p=634</guid>
		<description><![CDATA[People often ask us what the biggest innovation is that RightScale offers beyond other forms of server management. The real answer is the integrated approach that puts together all the parts needed to architect multi-server deployments, launch them, monitor and &#8230; <a href="http://blog.rightscale.com/2011/03/22/servertemplates-the-key-to-rightscale-cloud-management/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=634&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>People often ask us what the biggest innovation is that RightScale offers beyond other forms of server management. The real answer is the integrated approach that puts together all the parts needed to architect multi-server deployments, launch them, monitor and manage them, and then cycle back to re-architecting, re-launching, etc. But if there is one aspect of the overall integration that is special to RightScale, it&#8217;s the methodology of dynamic server configuration using ServerTemplates and the ability to publish these ServerTemplates and meter their usage.</p>
<p>We recently took a look at how our customers and ISV partners use ServerTemplates and <a href="http://www.rightscale.com/news_events/press_releases/2011/rightscale-customers-expand-cloud-workloads.php">just published</a> some of our findings in a press release. Among the interesting facts that surfaced, our customers have developed a total of 42,500 RightScale ServerTemplates for Cloud Deployments as of 2011, doubling the number from the previous year. Of those, 42% of the customer-developed ServerTemplates were built from RightScale ServerTemplates and 58% were created from scratch or from partner templates. I was myself intrigued by the sheer number of ServerTemplates and looked a bit into what our users are doing. Of course a good number of the derived ServerTemplates come from customers making improvements to the ones we&#8217;ve published, and many of them are good fuel for our roadmap. But the vast majority of customer-created ServerTemplates are variations that are specific to the customer&#8217;s environment or have very customer-specific software. Looking at these numbers is really a good reminder that while most organizations use standard software, and most sysadmins try to adhere to standard ways for installing and operating this software, the reality for the leading companies is that customization is a vital necessity and RightScale supports just that.</p>
<p>The ServerTemplate methodology is also uniquely suited to supporting multiple cloud implementations. This ability to leverage our and our partners&#8217; ServerTemplate investment has led to the just announced <a href="http://www.rightscale.com/news_events/press_releases/2011/rightscale-announces-global-partnership-tata.php">partnership with Tata Communications</a> where we&#8217;ll support their cloud offering with our multi-cloud ServerTemplates. The Tata clouds are based on Cloud.com&#8217;s system, which means that customers will be able to deploy servers based on the same RightScale ServerTemplate in Amazon EC2, Tata&#8217;s InstaCompute cloud, as well as the customer&#8217;s in-house private Cloud.com cloud (if they have one). What really excites us in enabling this level of portability is that it leads to more business for everyone! Most of our larger EC2 customers have plans for other clouds, whether public clouds by other providers or private clouds, and in every single case the result will be more EC2 consumption because the overall goal isn&#8217;t to move EC2 servers back in-house but to enable more parts of the business to leverage cloud computing!</p>
<p><strong>Dynamic Configuration</strong></p>
<p>The dynamic configuration of servers through ServerTemplates is key to all this. The way we see a server coming together within RightScale is:</p>
<ul>
<li><a href="http://rightscale.files.wordpress.com/2011/03/server_templates_illustration.png"><img class="size-full wp-image-669 alignright" style="margin-left:10px;margin-right:10px;" title="server_templates_illustration" src="http://rightscale.files.wordpress.com/2011/03/server_templates_illustration.png?w=600" alt=""   /></a>The cloud provides a virtualization container, the &#8220;hardware,&#8221; with requested resources for processing, memory, disk, and network</li>
<li>The cloud then boots a machine image which sets up the operating system and the RightLink agent</li>
<li>RightScale runs the set of boot scripts identified in the server&#8217;s ServerTemplate, these perform several types of tasks:
<ul>
<li>install the full complement of software that the server needs to operate,</li>
<li>add additional hardware resources to the server, for example, disk volumes restored from snapshot, assignable ip addresses, etc, and</li>
<li>configure all the software and bring the server into an operational state</li>
</ul>
</li>
<li>RightScale starts collecting monitoring data and generates alerts according to the alert specifications of the ServerTemplate</li>
</ul>
<p>Where we&#8217;ve innovated in the face of the new opportunities offered by cloud computing is in moving the boundary between what&#8217;s cast into the machine image vs. what&#8217;s done dynamically at boot time. Traditionally in the virtualization world there is little difference between an image and a server. An image is just a snapshot of a server, or perhaps its embodiment when the server is not running and has no hardware resources allocated to it. While this methodology works in the cloud as well, it doesn&#8217;t allow the benefits of the cloud to be leveraged.</p>
<p><strong>How to Fully Leverage the Cloud</strong></p>
<p>To fully leverage cloud architecture the role of images has to change. An image has to  be the basis for launching and relaunching many servers. This is how  auto-scaling happens, this is how failing servers get replaced quickly,  this is how IT becomes agile and can spin up or rev servers. The  key insight needed is that fewer things should be cast in the image and  more needs to be customized on demand to each server&#8217;s role and configuration. The truth is that to leverage the cloud we have to recover the operating system as an abstraction layer!</p>
<p>What has happened is that the operating system, the application, and often also the application data are mushed together in machine images. Remember that one of the main roles of an operating system is to abstract the specifics of the hardware and present a portable machine abstraction to applications. We need to preserve this in the cloud so we can replicate and move servers and server configurations!</p>
<p>For example, as a server configuration moves from development to production the hardware (really the virtualization container) can change drastically. It may move from an internal dev cloud to an external production cloud or simultaneously to multiple internal and hosted production clouds around the world. Each time the operating system needs to be able to change while the stack on top of it remains the same. Some of the typical OS changes include: 32-bit vs. 64-bit platforms, uniprocessor vs. multiprocessor, slight changes in OS revision/configuration, changes in disk layout, hypervisor changes, etc.</p>
<p>The best way to recover the abstraction layer is to dedicate the machine image to contain only a cloud-optimized version of the operating system. This is exactly what we do with RightImages. These are base OS installs that we&#8217;ve been producing for years that provide effective abstraction across multiple clouds and virtualization containers. They consist of an optimized operating system install, the RightLink agent, and a few other very common supporting software packages. RightImages serve as the foundation upon which ServerTemplates dynamically assemble the rest of the software stack at boot time. This methodology lets us reliably stand up the same server configuration across a large number of clouds and machine types.</p>
<p>Often users get hung up about what happens at boot time versus what is already present on the image from a performance perspective. The performance of installing software at boot is a non-issue in most cases, but not always, and that doesn&#8217;t really break the model. Many of our customers create their own images that simply have more software pre-installed. Some good examples are pre-installing the version of Java a customer uses across many servers, or pre-installing SharePoint on Windows as it takes a long time to install. The advantage is that booting is faster, but the disadvantage is that a larger portion of the stack becomes more difficult to manage. It helps to remember that you&#8217;re effectively adding this software to the operating system abstraction, so to speak.</p>
<p><a href="http://rightscale.files.wordpress.com/2011/03/server_templates_vs_images.png"><img class="alignright size-medium wp-image-668" title="server_templates_vs_images" src="http://rightscale.files.wordpress.com/2011/03/server_templates_vs_images.png?w=300&#038;h=193" alt="" width="300" height="193" /></a>Keeping the application software separate from the operating system adds great runtime and maintenance flexibility. For example, you can test something out on a running server by adding a new script, making sure it works, committing the ServerTemplate, then rolling forward all running servers to that revision of the ServerTemplate to have it applied immediately. With bundled images, you would have to push the script to all machines manually and re-snapshot an image to use for future instances. Or, If you’re an ISV publisher of a solution on RightScale, and you make a simple bug fix to a script on a ServerTemplate, anyone using that solution can pull in the bug fix onto their existing servers where they may have added additional scripts and applications. When ISVs deliver their software through pre-bundled images, customers have to get the new image with any fixes, then reinstall and re-configure any applications and take a new snapshot &#8211; very tedious! In short, ServerTemplates are flexible like playlists on an iPod, where virtual machine images are like burning and shipping a CD.</p>
<p><strong>RightScale Learnings</strong></p>
<p>A related trend that we&#8217;ve found ourselves pursuing is to be more procedural and less declarative. At first blush a declarative template that states &#8220;this server needs X Y and Z&#8221; may seem better than one that cobbles things together in a script, but we&#8217;ve had relatively little success with that beyond simple demo examples. Disk volumes are a case in point. It may seem natural to specify in the ServerTemplate that it needs a 1TB volume, or perhaps even a volume restored from a specific snapshot (this is what the EC2 API supports). But in reality a server will need to restore data from a snapshot backup, generally the most recent consistent backup of the role the server is about to take on. This ends up involving listing recent snapshots, filtering for those pertaining to the role and that are tagged as being consistent (in many cases several volumes have to be backed up and are then tagged as consistent if all the snapshots complete correctly within the allotted timeout), and then selecting the most recent one. After attaching the volume, it needs to be mounted and appropriate recovery procedures often have to be run. All this ends up being procedural, and the best methodologies we&#8217;ve seen are to use configuration languages that don&#8217;t repeat work that has already been performed and are idempotent, such as Puppet and Chef.</p>
<p><strong>Thoughts on AWS&#8217;s CloudFormation</strong></p>
<p>In this context, it’s interesting that Amazon decided to use a declarative approach in the new CloudFormation (CF) templates. The CF templates are mostly at a different level of abstraction from ServerTemplates in that they focus on putting together multiple servers and their associated resources. We&#8217;ve been using our macro feature for that purpose for some time and have published a number of &#8220;getting started&#8221; macros to set up demo deployments as well as partner macros that setup clusters with software from multiple vendors. We&#8217;ve again found that past the simple examples, we really want to have procedural control over the setup of a server cluster, if only to be able to let the user input some parameters, like the number of app servers, and adjust how everything unfolds accordingly, or handle the cases where some resources already exist and should be used as-is.</p>
<p>Another area where we&#8217;ve taken a broader approach: CloudFormation really only targets setup (of course that&#8217;s the most juicy part where $$ start flowing if you’re the underlying cloud provider <img src='http://s1.wp.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' />  ). Servers also need actions performed on them during operation and often their roles and complexity are no different from initial setup. For example, when you run a RightScale operational script to re-mount the latest production snapshot on a staging database, that&#8217;s really no different at the core from what happened when the server originally launched. Or, if you choose to relaunch the staging server to get the latest snapshot, then the operational script to reconnect all the app servers to the new database is the same as the app server to database connection when all the servers were originally launched. A dynamic runtime configuration system is needed really at all three stages of a server&#8217;s lifespan: launch, runtime, and decommissioning.</p>
<p>What I hope becomes clear through these examples is the value of the integrated approach that RightScale offers. We see tremendous leverage in integrating all the pieces that developers and sysadmins need to architect, deploy, manage, track, audit, and recycle servers in one platform. We are leading the cloud industry in that respect, and we also still have a lot of cool new features under development in the pipeline!</p>
<br />Filed under: <a href='http://blog.rightscale.com/category/aws/'>AWS</a>, <a href='http://blog.rightscale.com/category/cloud-computing/'>Cloud Computing</a>, <a href='http://blog.rightscale.com/category/cloud-com/'>Cloud.com</a>, <a href='http://blog.rightscale.com/category/ec2/'>EC2</a> Tagged: <a href='http://blog.rightscale.com/tag/aws/'>AWS</a>, <a href='http://blog.rightscale.com/tag/cloud-computing/'>Cloud Computing</a>, <a href='http://blog.rightscale.com/tag/cloud-management/'>Cloud Management</a>, <a href='http://blog.rightscale.com/tag/cloud-com/'>Cloud.com</a>, <a href='http://blog.rightscale.com/tag/ec2/'>EC2</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rightscale.wordpress.com/634/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rightscale.wordpress.com/634/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rightscale.wordpress.com/634/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rightscale.wordpress.com/634/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rightscale.wordpress.com/634/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rightscale.wordpress.com/634/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rightscale.wordpress.com/634/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rightscale.wordpress.com/634/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rightscale.wordpress.com/634/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rightscale.wordpress.com/634/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rightscale.wordpress.com/634/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rightscale.wordpress.com/634/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rightscale.wordpress.com/634/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rightscale.wordpress.com/634/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=634&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.rightscale.com/2011/03/22/servertemplates-the-key-to-rightscale-cloud-management/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0b359e9ce1a8fb5f3315fbec8beda697?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">TvE</media:title>
		</media:content>

		<media:content url="http://rightscale.files.wordpress.com/2011/03/server_templates_illustration.png" medium="image">
			<media:title type="html">server_templates_illustration</media:title>
		</media:content>

		<media:content url="http://rightscale.files.wordpress.com/2011/03/server_templates_vs_images.png?w=300" medium="image">
			<media:title type="html">server_templates_vs_images</media:title>
		</media:content>
	</item>
		<item>
		<title>Windows in the Cloud with RightScale</title>
		<link>http://blog.rightscale.com/2010/08/17/windows-in-the-cloud-with-rightscale/</link>
		<comments>http://blog.rightscale.com/2010/08/17/windows-in-the-cloud-with-rightscale/#comments</comments>
		<pubDate>Tue, 17 Aug 2010 10:00:32 +0000</pubDate>
		<dc:creator>Thorsten</dc:creator>
				<category><![CDATA[AWS]]></category>
		<category><![CDATA[EC2]]></category>
		<category><![CDATA[RightImage]]></category>
		<category><![CDATA[RightLink]]></category>
		<category><![CDATA[RightScale]]></category>
		<category><![CDATA[Windows]]></category>

		<guid isPermaLink="false">http://blog.rightscale.com/?p=576</guid>
		<description><![CDATA[As many of you know we&#8217;ve been working on supporting Windows on par with Linux within RightScale for close to a year. Well, the big moment has finally arrived where we can announce the GA (General Availability) of our Windows &#8230; <a href="http://blog.rightscale.com/2010/08/17/windows-in-the-cloud-with-rightscale/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=576&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>As many of you know we&#8217;ve been working on supporting Windows on par with Linux within RightScale for close to a year. Well, the big moment has finally arrived where we can announce the GA (General Availability) of our Windows support! By &#8220;support&#8221; we mean much more than being able to launch a Windows instance on EC2, which has been possible for a long time, so I thought it worthwhile to expand a little on what it took to make Windows behave nicely in the cloud and what the outcome is.  This is also a good opportunity to examine whether there is a &#8220;Windows handicap&#8221; in the cloud, or whether it works just as smoothly as Linux.</p>
<p>First the upshot: across the board we now support Windows 2003 and 2008 on par with Linux. We provide RightImages that work well with RightScale out of the box, they can be managed in the dashboard, we support extensible monitoring, associated alerts and automation, and we support ServerTemplates with Windows and even provide a few sample templates. Plus, if you have existing images, we provide a RightLink installer with documentation and a starter ServerTemplate that gives you monitoring out of the box. This will give you a good dose of the RightScale love (monitoring, scaling arrays, user access control, etc.).</p>
<p>The mountain of work that we&#8217;ve been chipping away at for the past two weeks is being able to release 40 Windows RightImages. This work wasn&#8217;t what we had imagined, it crept up on us little by little. When we release RightImages, whether it is for Windows or Linux, we strive to provide a consistent software environment across the RightImages of a generation and to ensure they work smoothly with RightScale. Both the Linux and Windows RightImages work great without RightScale as well and many EC2 users have been using Linux RightImages for a long time. Here is what we had to do for the Windows RightImages:</p>
<ul>
<li>install RightLink for integration with RightScale</li>
<li>ensure the images are automatable, meaning psexec can be used, which also involved getting consistent settings for the windows firewall, file sharing and a few other details</li>
<li>install PowerShell 2.0 on all images, including the Windows 2003 ones, because according to everyone we&#8217;ve talked to it&#8217;s a must</li>
<li>ensure all the right DLLs are installed to manage SQL server from PowerShell</li>
<li>increase the size of the root disk for the 2008 images because the std 40GB are insufficient for a lot of interesting software installs (SharePoint being one example), and increase the pagefile to 1.5x ram size (up to 8GB)</li>
<li>ensure the server&#8217;s clock is successfully synchronized to NTP at boot and periodically thereafter</li>
<li>install logic to determine when the server is fully ready after boot before starting any automation, which is not as simple as it may sound due to sysprep, double boot, and an ec2-specific service that runs at boot and sets a few things up</li>
<li>ensure complete install of asp.net on Windows 2003</li>
</ul>
<p>This list kept growing as we discovered issues. In addition, we are supporting all four EC2 regions (us-east, us-west, eu-west, ap-southeast) and it turns out we have to build each image from scratch in each region because it&#8217;s not possible to copy a Windows image from one region to another (licensing restrictions). We are supporting 10 images in each region:</p>
<ul>
<li>Windows 2003 /2008 across i386/x64</li>
<li>Windows w/SQLServer Express 2003 /2008 across i386/x64</li>
<li>Windows w/SQLServer 2003 /2008 on x64</li>
</ul>
<p>Getting to the final images involved a fair number of iterations (we don&#8217; know how many images we built total, and frankly, we&#8217;d rather not be reminded) so we built automation to crank out the images. This makes them consistent and reproducible. Unfortunately, the AWS images we had to start from are not all automatable, so we also had to build some &#8220;intermediate&#8221; images by hand with just a few settings tweaked so we could target them with our automation. Some of our developers ended up with nightmares about clicking through endless circular install dialog boxes!</p>
<p>While we were launching lots of Windows instances for testing we noticed that a fair number had their clock off by a large amount, like days. Digging into the issue we discovered that, unlike with stock AWS Linux images, the Windows wall clock is not synchronized to the virtualization host&#8217;s time and that the initial NTP synchronization doesn&#8217;t always succeed, partially because the Windows NTP service is &#8220;challenged&#8221; but also because it is pointed at a public pool of servers. Since performing automation on a server that thinks it&#8217;s yesterday or tomorrow is a non-starter we concluded that we had to beef up the time synchronization by ensuring that we get an NTP sync before proceeding with any automation and also run our own set of NTP servers so we can ensure our customers always have in-cloud NTP servers available to synchronize with. We think this really is part of the famous &#8220;muck / undifferentiated heavy lifting&#8221; that Amazon prides itself to take care of, but they politely declined, which is really a shame. We also noticed that the new HVM Linux images don&#8217;t lock their clocks to the host, so we will probably switch the way the clock sync works across all RightImages.</p>
<p>Another interesting issue we discovered during testing was that automation at boot time would frequently fail. The root cause ended up being that our service, which was configured to launch at boot, was starting too early and that the server just wasn&#8217;t ready yet. The way Windows boots in the cloud is quite different from Linux or from the &#8220;normal&#8221; world. With Windows, each OS install generates a server key that is embedded in the registry and uniquely identifies the server, so when an image is booted many times a fresh key needs to be generated for each instance and a number of things need to be updated (&#8220;sysprep&#8221;) and the server rebooted once. Towards the end of the second boot a special Ec2Config service finalizes the config, including admin password and hostname. This means that any automation has to wait for the reboot plus the ec2 service to complete its changes, which is not trivial due to an oversight by AWS. It&#8217;s interesting how the security details of Windows (i.e. the server key) ripple down into the whole boot process, making it take twice as long as it should. The net is that while Linux instance boot times on EC2 have come down from a typical 6-8 minutes back in 2006 to under a minute now when using EBS images the Windows boot times are starting out around 10-15 minutes. Hopefully Microsoft can be sensitized to the notion that fast boot times are an important asset in the cloud because they enable a lot of automation that is very painful if one has to wait so long for additional capacity or replacement servers to come online.</p>
<p>The final and biggest piece is implementing all the automation support we offer in Linux for Windows as well. As part of this we ported Chef to Windows (we&#8217;re working with Opscode to feed the changes back into the mainline) and we built out support for PowerShell. This means that software can be installed and configured on Windows servers similar to the way it&#8217;s done in Linux. We find that larger software packages often need to be installed manually, but even in that case it&#8217;s nice to be able to automate as much as possible and choose which portions to run &#8220;ahead of time&#8221; and bake into a custom image and which portions to leave off for the actual server launches. To round out the automation we wrote a nice little monitoring plugin that speaks the collectd protocol and that offers a simple meta-language that can be used to query many of the WMI statistics available on Windows servers. And all this is available out of the box to all our customers.</p>
<p>One of the open question we have concerns Windows updates. We will most certainly republish fresh RightImages when AWS updates its base set. What is not clear to us is whether we should be publishing &#8220;fully updated&#8221; RightImages on a regular schedule. Most of the customers we asked told us that they really want to carefully manage the exact set of updates on their servers. It&#8217;s going to be interesting to see how the whole update management with Windows servers plays out and how it will affect the amount of image rebuilding that everyone will have to do. We will definitely build our ServerTemplates directly on RightImages or RightImages augmented by just base installs of larger software packages and do all the config using Chef/PowerShell. This way we minimize the work we have to do when we swap out the underlying Windows install or update level.</p>
<p>I must say that overall I&#8217;m very happy with where we&#8217;re ending up, which is that we&#8217;re getting Windows to a point where it definitely is usable in cloud-style. What I mean by that is not just migrating a set of traditional servers into a equivalent set of servers in the cloud, but rather automating Windows servers for the cloud and leveraging the flexibility of the cloud to enable the business. The friction along the way certainly is higher than with Linux, whether it&#8217;s from license questions that crop up everywhere to the mechanics that currently require double-booting, but it is totally possible and Microsoft can, if it focuses on it, make it a lot better yet!</p>
<br />Filed under: <a href='http://blog.rightscale.com/category/aws/'>AWS</a>, <a href='http://blog.rightscale.com/category/ec2/'>EC2</a>, <a href='http://blog.rightscale.com/category/rightimage/'>RightImage</a>, <a href='http://blog.rightscale.com/category/rightlink/'>RightLink</a> Tagged: <a href='http://blog.rightscale.com/tag/aws/'>AWS</a>, <a href='http://blog.rightscale.com/tag/ec2/'>EC2</a>, <a href='http://blog.rightscale.com/tag/rightimage/'>RightImage</a>, <a href='http://blog.rightscale.com/tag/rightlink/'>RightLink</a>, <a href='http://blog.rightscale.com/tag/rightscale/'>RightScale</a>, <a href='http://blog.rightscale.com/tag/windows/'>Windows</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rightscale.wordpress.com/576/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rightscale.wordpress.com/576/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rightscale.wordpress.com/576/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rightscale.wordpress.com/576/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rightscale.wordpress.com/576/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rightscale.wordpress.com/576/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rightscale.wordpress.com/576/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rightscale.wordpress.com/576/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rightscale.wordpress.com/576/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rightscale.wordpress.com/576/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rightscale.wordpress.com/576/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rightscale.wordpress.com/576/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rightscale.wordpress.com/576/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rightscale.wordpress.com/576/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=576&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.rightscale.com/2010/08/17/windows-in-the-cloud-with-rightscale/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0b359e9ce1a8fb5f3315fbec8beda697?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">TvE</media:title>
		</media:content>
	</item>
		<item>
		<title>Accelerating Cloud Adoption</title>
		<link>http://blog.rightscale.com/2010/04/28/accelerating-cloud-adoption/</link>
		<comments>http://blog.rightscale.com/2010/04/28/accelerating-cloud-adoption/#comments</comments>
		<pubDate>Thu, 29 Apr 2010 01:00:40 +0000</pubDate>
		<dc:creator>Thorsten</dc:creator>
				<category><![CDATA[AWS]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[EC2]]></category>
		<category><![CDATA[EC2 Cloud Adoption]]></category>

		<guid isPermaLink="false">http://blog.rightscale.com/?p=540</guid>
		<description><![CDATA[Over the past weeks we&#8217;ve been releasing a number of new features in response to numerous requests from our user base to broaden the reach of the RightScale platform.  Just today we launched support for Amazon&#8217;s new Singapore region, see &#8230; <a href="http://blog.rightscale.com/2010/04/28/accelerating-cloud-adoption/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=540&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Over the past weeks we&#8217;ve been releasing a number of new features in response to numerous requests from our user base to broaden the reach of the RightScale platform.  Just today we launched support for Amazon&#8217;s new Singapore region, see <a href="http://www.allthingsdistributed.com/2010/04/aws_asia_pacific_singapore_region.html">Werner&#8217;s blog post</a> as well as <a href="http://aws.typepad.com/aws/2010/04/now-open-aws-region-in-asia-pacific.html">Jeff Barr&#8217;s</a>. We&#8217;ve worked closely with the EC2 team so we can support the new region right out of the gate in our dashboard as well as with RightImages and the whole ServerTemplate, monitoring and automation stack. We&#8217;ll soon re-release our popular ServerTemplates in our <a href="http://www.rightscale.com/library">library</a> such that Singapore shows up in the list of supported clouds in each template. We&#8217;ve gotten a fair number of questions around who exactly has been asking for RightScale in south-east Asia and interestingly there isn&#8217;t a single answer.  Given that we support a large number of social gaming companies we&#8217;ve certainly heard requests from there. Some even were ready to set-up a Eucalyptus cloud managed with RightScale. Obviously latency, specially with mobile devices, has been an issue so being able to spin up servers in the region is a major enabler. Additionally, enterprises have been asking for ways to provide better cloud services to their development and research locations in Asia. How soon until it&#8217;s easier to list world regions where there <em>isn&#8217;t</em> an EC2 cloud than to list where they are?</p>
<p>Last week we released RightLink install packages for Linux and Windows enabling our users to take almost any image and RightScale-enable it. Most of you are probably well aware that I&#8217;m not a fan of creating images: it&#8217;s a slow process that results in hard to maintain monsters. This hasn&#8217;t changed with the RightLink installer, but what has changed is that we wanted to make it easier for our partners to build RightScale support for their flavor of Linux and we wanted to make it easier for users that already have servers up in the cloud to benefit from RightScale. But we didn&#8217;t just put the RightLink packages out in the wild, we also published the <a href="https://my.rightscale.com/library/server_templates/VM%20Image%20Tools">ServerTemplates we use to build our RightImages</a>. We&#8217;ve been publishing these images since 2007, the idea being that we provide a clean base OS install for CentOS and Ubuntu. In RightScale these serve as the base boot image on top of which custom software gets installed and configured dynamically at boot time. What has set our images apart is that we built them automatically and have published the scripts we used for that purpose. This means they&#8217;re clean and reproducible. More recently we&#8217;ve used ServerTemplates to build the images, basically we boot a server that then builds and saves an image as it comes up. The benefit of using ServerTemplates to build images is that they are easy to maintain and 100% automated, they also make it easy to understand what went into an image later on. This is an excellent framework for anyone else to customize and build their own special image. But I won&#8217;t let an opportunity pass by to recommend that you save yourself a lot of time and grief and simply use the RightImages we already built and maintain!</p>
<p>Last but not least we&#8217;ve introduced public beta support for Windows in RightScale, and that&#8217;s not just launching AMIs, it&#8217;s the full ServerTemplate machinery and associated automation, including initial monitoring support. Our sales guys all asked which of our standard features we don&#8217;t support for windows&#8230; the response is &#8220;uhhh, syslog consolidation?&#8221;. I&#8217;m really interested in seeing how the usage pattern of Windows deployments will differ from Linux. I assume that Windows folks will have to be much more in the image bundling business to create base installs of software packages that are too slow to install at boot time. But I also wonder about things like auto-scaling because it still takes a Windows server a lot longer to boot up. We may have to play more games with stopping and starting servers instead of booting them cold. I&#8217;m sure our users will tell us!</p>
<br />Filed under: <a href='http://blog.rightscale.com/category/aws/'>AWS</a>, <a href='http://blog.rightscale.com/category/cloud-computing/'>Cloud Computing</a>, <a href='http://blog.rightscale.com/category/ec2/'>EC2</a> Tagged: <a href='http://blog.rightscale.com/tag/aws/'>AWS</a>, <a href='http://blog.rightscale.com/tag/cloud-computing/'>Cloud Computing</a>, <a href='http://blog.rightscale.com/tag/ec2-cloud-adoption/'>EC2 Cloud Adoption</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rightscale.wordpress.com/540/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rightscale.wordpress.com/540/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rightscale.wordpress.com/540/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rightscale.wordpress.com/540/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rightscale.wordpress.com/540/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rightscale.wordpress.com/540/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rightscale.wordpress.com/540/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rightscale.wordpress.com/540/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rightscale.wordpress.com/540/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rightscale.wordpress.com/540/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rightscale.wordpress.com/540/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rightscale.wordpress.com/540/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rightscale.wordpress.com/540/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rightscale.wordpress.com/540/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=540&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.rightscale.com/2010/04/28/accelerating-cloud-adoption/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0b359e9ce1a8fb5f3315fbec8beda697?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">TvE</media:title>
		</media:content>
	</item>
		<item>
		<title>Thoughts about the Amazon Simple Notification Service</title>
		<link>http://blog.rightscale.com/2010/04/12/thoughts-about-amazon-simple-notification-service/</link>
		<comments>http://blog.rightscale.com/2010/04/12/thoughts-about-amazon-simple-notification-service/#comments</comments>
		<pubDate>Tue, 13 Apr 2010 04:40:36 +0000</pubDate>
		<dc:creator>Thorsten</dc:creator>
				<category><![CDATA[AWS]]></category>
		<category><![CDATA[EC2]]></category>
		<category><![CDATA[Cloud Computing]]></category>

		<guid isPermaLink="false">http://blog.rightscale.com/?p=523</guid>
		<description><![CDATA[Amazon just released a new service, the Simple Notification Service (SNS), which is a distributed message delivery service roughly similar to AMQP or JMS message services. It uses a publish-subscribe paradigm and supports push delivery of notifications using HTTP and &#8230; <a href="http://blog.rightscale.com/2010/04/12/thoughts-about-amazon-simple-notification-service/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=523&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Amazon just released a new service, the <a href="https://aws.amazon.com/sns/">Simple Notification Service</a> (SNS), which is a distributed message delivery service roughly similar to AMQP or JMS message services. It uses a publish-subscribe paradigm and supports push delivery of notifications using HTTP and email. It seems mostly targeted at back-end applications, e.g. servers sending notifications to one another, but given that is has an email delivery mechanism it can also be used to deliver notifications to users.</p>
<p>Jeff Barr wrote an <a href="http://aws.typepad.com/aws/2010/04/introducing-the-amazon-simple-notification-service.html">article</a> describing SNS, the key points are:</p>
<ul>
<li>you can create topics and publish messages to these topics</li>
<li>others can subscribe to the topics and they will get messages pushed to them</li>
<li>messages can be pushed over http or smtp (email)</li>
<li>using access control policies one can control who is allowed to subscribe to a topic</li>
<li>the SNS system is redundant and retries message delivery if necessary</li>
<li>the cost at volume is $0.06 per 100,000 messages published and $0.06 per 100,000 HTTP message pushes. Email pushes cost 33x more. Good news is the first 100,000 message publish and pushes are free.</li>
</ul>
<p>It really is great that AWS provides such a service. It&#8217;s relatively easy to fire up a messaging server, like <a href="http://www.rabbitmq.com/">RabbitMQ</a>, but it&#8217;s a different story to set up a redundant scalable messaging system. While this can be done with RabbitMQ, for many users having this provided as a service is the right way to go.</p>
<p>Unfortunately SNS does not use a standard messaging API, it&#8217;s all proprietary. This is a major weakness of SQS, SNS, and SDB: once you use their interface you&#8217;re locked in to using AWS. Granted, the SNS interface isn&#8217;t particularly big, but then why did they have to roll their own?</p>
<p>My biggest beef with SNS is what is being said, or more precisely, not being said about reliability. I have no reason to believe that SNS doesn&#8217;t do all the right things, but AWS isn&#8217;t very forthcoming with specifics. Here is what the SNS docs state:</p>
<ul>
<li>&#8220;Reliable – Amazon SNS runs within Amazon’s proven network  infrastructure and datacenters, so topics will be available whenever  applications need them. To prevent messages from being lost, all  messages published to Amazon SNS are stored  redundantly across multiple servers and data centers.&#8221; <a href="https://aws.amazon.com/sns/">source</a></li>
<li>&#8220;Although most of the time each message will be delivered to your  application exactly once, the distributed nature of Amazon SNS and transient network conditions could result in  occasional, duplicate messages at the subscriber end.&#8221; <a href="https://aws.amazon.com/sns/faqs/#44">source</a></li>
</ul>
<p>So there is talk about redundant storage, at least once delivery, and delivery retries. But what I&#8217;d really want to know is not all this fuzzy feel-good stuff. The question is not that difficult:</p>
<ul>
<li>If SNS returns a HTTP &#8220;200 OK&#8221; to my publish request, what is the probability that each subscriber will receive at least one delivery attempt?</li>
</ul>
<p>ok, I guess I really need to factor in time, which would also give an indication of performance:</p>
<ul>
<li>If SNS returns a HTTP &#8220;200 OK&#8221; to my publish request, what is the probability distribution over time  that each subscriber has received at least one delivery attempt?</li>
</ul>
<p>This would let me reason about what I can use SNS for and what not, or whether I need a back-up synchronization mechanism or not, etc. If the story stays at the warm&amp;fuzzy level, AWS could at least specify when messages are stored redundantly, e.g. is a redundant copy stored by the time I get an HTTP &#8220;200 OK&#8221; response? Also specifics about how long and how often retries are made. (I&#8217;m focusing on the HTTP delivery, I don&#8217;t think it makes much sense talking about email delivery reliability.)</p>
<p>I hope that many others will also ask AWS to be more specific about what in the end is really the SLA offered by SNS (same goes for some of the other AWS services). I&#8217;m not asking for damages if the SLA isn&#8217;t met, I just want to know what AWS is publicly holding itself accountable for and thus what I can design for and apply my &#8220;trust in AWS&#8221; judgement to. While the service is in beta, the SLA might be a target.</p>
<br />Filed under: <a href='http://blog.rightscale.com/category/aws/'>AWS</a>, <a href='http://blog.rightscale.com/category/ec2/'>EC2</a> Tagged: <a href='http://blog.rightscale.com/tag/aws/'>AWS</a>, <a href='http://blog.rightscale.com/tag/cloud-computing/'>Cloud Computing</a>, <a href='http://blog.rightscale.com/tag/ec2/'>EC2</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rightscale.wordpress.com/523/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rightscale.wordpress.com/523/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rightscale.wordpress.com/523/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rightscale.wordpress.com/523/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rightscale.wordpress.com/523/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rightscale.wordpress.com/523/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rightscale.wordpress.com/523/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rightscale.wordpress.com/523/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rightscale.wordpress.com/523/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rightscale.wordpress.com/523/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rightscale.wordpress.com/523/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rightscale.wordpress.com/523/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rightscale.wordpress.com/523/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rightscale.wordpress.com/523/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=523&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.rightscale.com/2010/04/12/thoughts-about-amazon-simple-notification-service/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0b359e9ce1a8fb5f3315fbec8beda697?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">TvE</media:title>
		</media:content>
	</item>
		<item>
		<title>Amazon Consolidated Billing and Reserved Instances</title>
		<link>http://blog.rightscale.com/2010/02/10/amazon-consolidated-billing-reserved-instances/</link>
		<comments>http://blog.rightscale.com/2010/02/10/amazon-consolidated-billing-reserved-instances/#comments</comments>
		<pubDate>Thu, 11 Feb 2010 06:51:37 +0000</pubDate>
		<dc:creator>Thorsten</dc:creator>
				<category><![CDATA[AWS]]></category>
		<category><![CDATA[EC2]]></category>
		<category><![CDATA[Amazon]]></category>
		<category><![CDATA[Billing]]></category>

		<guid isPermaLink="false">http://blog.rightscale.com/?p=476</guid>
		<description><![CDATA[Amazon added consolidated billing a couple of days ago. It allows you to consolidate multiple accounts onto a single bill so your credit card only gets one hefty charge instead of many smaller ones from all the accounts you might &#8230; <a href="http://blog.rightscale.com/2010/02/10/amazon-consolidated-billing-reserved-instances/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=476&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Amazon added <a href="http://aws.amazon.com/about-aws/whats-new/2010/02/09/announcing-consolidated-billing-for-aws-accounts/">consolidated billing</a> a couple of days ago. It allows you to consolidate multiple accounts onto a single bill so your credit card only gets one hefty charge instead of many smaller ones from all the accounts you might have. What&#8217;s actually nice is that you can download a csv file with the details of all the accounts in one place so you get an overview of where the money is going. For someone like me who has over a dozen accounts that&#8217;s really, really nice!</p>
<p>The way it works is that from the account you use for billing you can send invites to others so they can add their account to your billing method. They can still see their bill, it just doesn&#8217;t hit their credit card, it hits yours. You get to see what they used too, since you&#8217;re paying for it. I created a new account that I&#8217;m not using for any service at all and consolidated all my &#8220;real&#8221; accounts onto it. This way I can pretty freely hand out the credentials in finance &amp; admin so anyone there who wants to see the numbers can log in without having any power over any instances, buckets, or whatnot.</p>
<p>One of the nice benefits of consolidated billing is that you can share some of the savings of reserved instances across accounts, but the details are pretty weird. The part that makes sense is that if account A doesn&#8217;t use a reserved instance &#8220;slot&#8221; then another account B can make use of it and get the discounted hourly rates. Where it gets weird is that if account A purchased a reserved instances in zone us-east-1a then account B gets the benefit also in zone us-east-1a. While this may sound logical it makes no sense because the zone names are permuted between accounts, so A&#8217;s 1a is typically different from B&#8217;s! Amazon: so why do we need to buy reserved instances in an availability zone as opposed to a region since it clearly doesn&#8217;t really matter???</p>
<p>I&#8217;ve actually always been unhappy with reserved instances. They seem to combine two completely different notions that I believe don&#8217;t go together because the use-cases are disjoint. One notion is that of a &#8220;revenue commit&#8221; or &#8220;buying into a discount tier&#8221;: you pay some money up-front to get a lower rate. Makes perfect sense, but why does it have to be tied to an availability zone? Or even to an instance type for that matter? The second notion is that of guaranteed availability, i.e., your reserved instance slot is always guaranteed to be available, you won&#8217;t get an &#8220;insufficient capacity&#8221; error. I understand why that has to be for a specific zone and type.</p>
<p>The reason the two notions don&#8217;t mix for me is that in order to get the discount benefits you have to run your instance virtually all the time, it really only is advisable for instances that always run. Well, in that case the whole notion of reservation is moot since you&#8217;re running all the time anyway! If you&#8217;re looking for the availability guarantee it&#8217;s generally because the instance is *not* running all the time and you want to make sure that the day you need it you can get it. Think disaster recovery scenario. Well, in that case the whole &#8220;discount&#8221; is moot, since you&#8217;re like to actually pay more due to the up-front reservation cost! I wish I could just buy the discount without the availability guarantee and without the tie to a specific zone!</p>
<p><a name="Support"></a></p>
<p>One thing I noticed is that most of our larger customers are not using reserved instances. I wonder why&#8230;</p>
<br />Filed under: <a href='http://blog.rightscale.com/category/aws/'>AWS</a>, <a href='http://blog.rightscale.com/category/ec2/'>EC2</a> Tagged: <a href='http://blog.rightscale.com/tag/amazon/'>Amazon</a>, <a href='http://blog.rightscale.com/tag/aws/'>AWS</a>, <a href='http://blog.rightscale.com/tag/billing/'>Billing</a>, <a href='http://blog.rightscale.com/tag/ec2/'>EC2</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rightscale.wordpress.com/476/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rightscale.wordpress.com/476/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rightscale.wordpress.com/476/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rightscale.wordpress.com/476/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rightscale.wordpress.com/476/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rightscale.wordpress.com/476/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rightscale.wordpress.com/476/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rightscale.wordpress.com/476/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rightscale.wordpress.com/476/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rightscale.wordpress.com/476/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rightscale.wordpress.com/476/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rightscale.wordpress.com/476/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rightscale.wordpress.com/476/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rightscale.wordpress.com/476/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=476&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.rightscale.com/2010/02/10/amazon-consolidated-billing-reserved-instances/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0b359e9ce1a8fb5f3315fbec8beda697?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">TvE</media:title>
		</media:content>
	</item>
		<item>
		<title>Bid for Your Instances!</title>
		<link>http://blog.rightscale.com/2009/12/13/bid-for-your-instances/</link>
		<comments>http://blog.rightscale.com/2009/12/13/bid-for-your-instances/#comments</comments>
		<pubDate>Mon, 14 Dec 2009 05:52:23 +0000</pubDate>
		<dc:creator>Thorsten</dc:creator>
				<category><![CDATA[AWS]]></category>
		<category><![CDATA[EC2]]></category>
		<category><![CDATA[Cloud Computing]]></category>

		<guid isPermaLink="false">http://blog.rightscale.com/?p=466</guid>
		<description><![CDATA[We&#8217;re clearly witnessing a year-end release finale at AWS with another big release tonight: EC2 Spot instance pricing. Spot instance pricing is the third pricing model introduced by Amazon after the original per-hour price (now called &#8220;on-demand&#8221;), then the &#8220;reserved&#8221; &#8230; <a href="http://blog.rightscale.com/2009/12/13/bid-for-your-instances/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=466&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>We&#8217;re clearly witnessing a year-end release finale at AWS with another big release tonight: EC2 Spot instance pricing. Spot instance pricing is the third pricing model introduced by Amazon after the original per-hour price (now called &#8220;on-demand&#8221;), then the &#8220;reserved&#8221; instance pricing and now a supply and demand driven &#8220;spot&#8221; pricing. As far as I know, this is the first step on a large scale towards &#8220;market pricing&#8221; for computing based on offer and demand. I know many people have been dreaming about something like this and a few startups have started to offer a compute market of some sort. But with Amazon&#8217;s offering it is now available on a large scale to anyone!</p>
<p>How it works is simple yet complex. You can read the <a href="http://aws.amazon.com/ec2/spot-instances/">official product page</a>,  <a href="http://aws.typepad.com/aws/2009/12/ec2-spot-instances-and-now-how-much-would-you-pay.html">Jeff Barr&#8217;s blog</a>, and <a href="http://www.allthingsdistributed.com/2009/12/amazon_ec2_spot_instances.html">Werner&#8217;s blog</a>. Here&#8217;s my attempt at explaining it. AWS publishes a spot price for each instance size in each region. The spot price is the per-hour cost of a server and if you launch a spot price server now that&#8217;s what you pay for the next hour. So instead of $0.10/hr for a small server you might only pay $0.03/hr if that&#8217;s the current spot price. AWS adjusts the spot price periodically based on the idle capacity available, so the price might be low at night or week-ends when many sites auto-scale down and it might be high during the day when everything is busy.</p>
<p>Now comes the complex part. You don&#8217;t just launch a spot instance and forget about it, you actually specify a maximum price you are willing to pay and for each hour you have your server running you pay the spot price current at the start of the hour. As the spot price continues to vary while your instance is running this maximum becomes very important because should the spot price exceed your maximum then your instance will be terminated by AWS! It&#8217;s also possible to work the maximum price in reverse: specify a price lower than the current spot price in the evening and your request stays queued until the spot price drops below what you specified and AWS then automatically launches your instances. You can revise your maximum at any time, so if at 4am the spot price has not dropped enough you can raise your max so your instances get to run before sunrise.</p>
<p>It should be clear from the way the spot pricing functions that this is intended for transient compute capacity. For your database instances you should carefully stay with the on-demand or reserved instances, but for late night batch jobs where it doesn&#8217;t matter whether they run a bit earlier or later the spot pricing can save quite some money.</p>
<p>One thing that is not obvious at the outset is what would motivate Amazon to keep the price down. Part of the answer lies in the fact that instances whose max bid drops below the current spot price get terminated, thus if the price goes up too much, too many instances get terminated which results in less revenue. So there is a balance between more instances at a lower price and fewer instances at a higher price. But I&#8217;m sure it&#8217;s a lot more complex than that.</p>
<p>We will be supporting spot pricing in the RightScale platform over the coming months and we&#8217;re curious about the functionality our customers would like to see in that respect. There are a lot of opportunities for automation here!</p>
<br />Posted in AWS, EC2 Tagged: AWS, Cloud Computing, EC2 <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rightscale.wordpress.com/466/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rightscale.wordpress.com/466/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rightscale.wordpress.com/466/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rightscale.wordpress.com/466/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rightscale.wordpress.com/466/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rightscale.wordpress.com/466/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rightscale.wordpress.com/466/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rightscale.wordpress.com/466/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rightscale.wordpress.com/466/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rightscale.wordpress.com/466/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rightscale.wordpress.com/466/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rightscale.wordpress.com/466/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rightscale.wordpress.com/466/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rightscale.wordpress.com/466/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=466&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.rightscale.com/2009/12/13/bid-for-your-instances/feed/</wfw:commentRss>
		<slash:comments>20</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0b359e9ce1a8fb5f3315fbec8beda697?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">TvE</media:title>
		</media:content>
	</item>
		<item>
		<title>Amazon EC2 &#8211; A New Chapter Begins</title>
		<link>http://blog.rightscale.com/2009/12/03/amazon-ec2-a-new-chapter-begins/</link>
		<comments>http://blog.rightscale.com/2009/12/03/amazon-ec2-a-new-chapter-begins/#comments</comments>
		<pubDate>Thu, 03 Dec 2009 15:22:36 +0000</pubDate>
		<dc:creator>Thorsten</dc:creator>
				<category><![CDATA[AWS]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[EC2]]></category>

		<guid isPermaLink="false">http://blog.rightscale.com/?p=455</guid>
		<description><![CDATA[Tonight Amazon made a milestone release introducing the ability to boot instances from an EBS volume and stop &#38; start instances. In addition, just a few weeks after announcing their plans to expand AWS to the far east, today they&#8217;ve &#8230; <a href="http://blog.rightscale.com/2009/12/03/amazon-ec2-a-new-chapter-begins/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=455&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Tonight Amazon made a milestone release introducing the ability to boot instances from an EBS volume and stop &amp; start instances. In addition, just a few weeks after <a href="http://aws.amazon.com/about-aws/whats-new/2009/11/12/aws-asia/">announcing</a> their plans to expand AWS to the far east, today they&#8217;ve moved west and made a US west coast cloud available. (Do they need a compass?) For the AWS view on all this see <a href="http://www.allthingsdistributed.com/2009/12/amazon_ec2_boot_from_ebs.html">Werner&#8217;s Blog</a> as well as Jeff Barr&#8217;s <a href="http://aws.typepad.com/aws/2009/12/expanding-the-aws-footprint.html">postings</a>. But one thing at a time&#8230;</p>
<h2>Amazon introduces US west coast cloud</h2>
<p>Almost exactly a year after the first geographical expansion of EC2 to Europe today is the second big step to the west coast. What is notable about the EC2 architecture is that each one of these expansions constitutes a new cloud or &#8220;region&#8221; in EC2 speak. This means that now in addition to the US-EAST-1 and EU-WEST-1 regions we have a new US-WEST-1 region. Each region operates autonomously from the others in order to provide failure isolation, which has benefits as well as downsides. A major benefit is obviously the redundancy one can get by operating in more than one region or placing DR in a region other than the one used for one&#8217;s primary service. The downside is that sharing across regions is not as easy as one might imagine. For example, machine images (AMIs) are not shared, so for each image you&#8217;re using in one region you have to copy and re-register the image in the other, and then it has a different id you need to keep track of and reference. We didn&#8217;t plan it this way, but our multi-cloud support turns out to be very helpful in managing operations in multiple EC2 regions. For example, in RightScale you can define ServerTemplates that use different images in different clouds, this means that as you update your ServerTemplate it automatically works across clouds and thus EC2 regions.</p>
<p>For redundant operations the comparison between the cloud and DIY datacenters is becoming ever more lopsided. Who can really afford to lose the man-hours, the cap-ex, the time-to-market, and incur the headaches it takes to set-up a datacenter from scratch, even if it&#8217;s in a traditional colo? And who can afford to go through all that again to set-up a second or DR site? The ease with which it is now possible to set-up a DR site in the cloud that is a faithful replica of the primary site is really remarkable. And the best is that the second site can be extremely low cost because very little needs to be running there: most of it can be fired up on-demand in the case something happens. If you already have your own datacenter/colo set-up then all hope is not lost. Setting up DR in the cloud is one of the common use-cases we see.</p>
<h2>Amazon Instances Boot from EBS</h2>
<p>The real sea change about to occur in EC2 is booting from EBS. Tonight&#8217;s release includes a ton of new features which build on the recently introduced ability to publish EBS snapshots. Here&#8217;s a quick summary:</p>
<ul>
<li>instances can boot from an EBS snapshot instead of a traditional AMI, EC2 creates an EBS volume from the snapshot and makes it the root partition</li>
<li>instances can also boot from an EBS volume, which means that a &#8220;boot from EBS&#8221; instance can effectively be stopped and restarted later by keeping the volume around and launching a fresh instance from the same volume</li>
<li>instances can now be stopped and restarted later, which works almost exactly as described in the bullet above except for the fact that the instance id (the i-12345678 number) remains the same</li>
<li>almost all attributes of an instance can change while stopped, including the instance size (naturally the availability zone is one thing that can&#8217;t change)</li>
<li>EBS snapshots can be registered and published as images, so now we have &#8220;traditional images&#8221; as well as &#8220;EBS images&#8221; (I wonder what AWS will call these)</li>
<li>images can specify snapshots and volumes to be automatically mounted at boot, and they can specify EIPs to be attached at boot, the run-instances API call can add/override these &#8220;image defaults&#8221;</li>
<li>instances can be &#8220;locked&#8221;, which prevents their accidental termination</li>
<li>instances can be bundled into images using an API call (with shutdown or optionally without)</li>
</ul>
<p>That&#8217;s a long list of features to digest! What&#8217;s going on here is that AWS is responding to the needs of enterprise customers who have many &#8216;legacy&#8217; applications that are not designed to scale out or to play nice with the operations agility enabled by the cloud. It&#8217;s for the apps that sysadmins spend weeks setting up and then do their utmost not to touch again. Now they can be installed on an EBS root volume and servers can be launched and relaunched as needed without having to touch the config. Basically this enables the old-school way of managing servers to be applied to EC2.</p>
<p>But these new features are also of great benefit to those operating scalable arrays of servers or web 2.0 web sites. It is now much easier to make changes to a clean server image: mount the image as a volume onto an extra server, edit the software/config on the image (e.g. using chroot and the native packaging system), when happy create an image from the volume and boot a server. Test it out and fix any problems in the original volume. Repeat until happy. If done correctly, this results in clean images that are not polluted by repeated boots and other operations, which is one goal we&#8217;ve always pursued with the RightImages we publish.</p>
<p>The stopping and starting of servers can also make development more cost effective. Developers that use dev &amp; test servers can stop them at the end of the day and start them back up when they next need them. In fact, many servers could be set-up to stop by themselves if there has been no activity for a while. (This reminds me that I saw that the three longest running instances visible by RightScale have been running for over 1000 days and that the account they run in has seen no activity since then, except for credit card charges I assume, impressive and scary at the same time!)</p>
<p>Stopping and starting servers can also be abused. For example, it can be used to implement &#8220;dumb auto-scaling&#8221;: simply stop some servers when the load drops and start them back up later. The good thing is that you don&#8217;t end up with fresh servers on start, so they don&#8217;t have to self-configure, the bad thing is, well, that you don&#8217;t end up with fresh servers, servers come up believing the world hasn&#8217;t changed since they were last stopped. I think of this as abuse because it&#8217;s easy to forget to update one of the stopped servers when making changes to the system, whether these are changes to the software installed on each server or changes to the rest of the system each server needs to communicate with. In other words, the danger of having a zombie come back to life and create mayhem is high. Better keep a basic amount of hygiene and start with fresh servers.</p>
<h2>The Cloud Marches On&#8230;</h2>
<p>It will be interesting to see how EC2 and its user base continue to evolve. With each release Amazon offers more options. That&#8217;s more ways to do interesting stuff, but also more ways to shoot oneself in the foot and more stuff to &#8216;grok&#8217; to get started. Maybe the most important, though, is that the Boot from EBS features rank very high on the &#8220;remove sales objections&#8221; scale: not every application is ready for the former EC2 cloud, not every sysadmin is ready for it either, by far not. I have to admit that all this leaves me with mixed feelings. EC2 used to have a simple &amp; clean model, it required some rethinking but that was for the better. It was clear how to deploy highly scalable, highly redundant applications with a high degree of automation. Now that there are 10 ways to skin the proverbial cat it&#8217;s much harder to stay on track and to leverage automation. Where early customers needed help figuring out how to operate in the world of EC2&#8242;s disposable servers today&#8217;s customers need help just navigating through all the options available in EC2 and which to apply to each application or use-case.</p>
<p>Support for the new features and the new US-WEST region in RightScale will become available with our next release, currently scheduled to go live just before xmas. Full support for booting from EBS will take a little longer as it has far-reaching implications. I&#8217;m sure that many of our customers will be operating in the new west coast region and that  it may even have some appeal to those in the far east and south pacific as &#8220;one step closer&#8221; to a local presence.  As always, we&#8217;d love to hear your thoughts on the new features, how you&#8217;re planning to use them, and how you&#8217;d like to see us support them.</p>
<h2>Updates:</h2>
<ul>
<li>AWS now gives each region a little local character: US-WEST-1 is <a href="http://aws.amazon.com/ec2/#pricing">listed</a> as &#8220;N. California&#8221;, US-EAST-1 as &#8220;N. Virginia&#8221;, and &#8220;EU-WEST-1&#8243; as &#8220;Ireland&#8221;.</li>
<li>Nice <a href="http://clouddevelopertips.blogspot.com/2009/07/boot-ec2-instances-from-ebs.html">blog post</a> on some of the mechanics of using Boot from EBS by Shlomo Swidler (but see comment below)</li>
<li>Some things you can&#8217;t do with traditional AMIs: start &amp; stop instance, create image (new way of bundling)</li>
<li>Some things you can&#8217;t do with EBS-based AMIs: dev pay, protect the content of public AMIs (someone can mount the content as a data volume and pull files off it)</li>
<li>If you plan to create a public EBS-based AMI beware of deleted files: don&#8217;t just &#8220;delete&#8221; files with sensitive data on the volume because they can be &#8220;undeleted&#8221;, you have to erase the blocks, or better, not put anything sensitive there in the first place</li>
</ul>
<br />Posted in AWS, Cloud Computing, EC2 Tagged: AWS, Cloud Computing, EC2 <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rightscale.wordpress.com/455/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rightscale.wordpress.com/455/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rightscale.wordpress.com/455/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rightscale.wordpress.com/455/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rightscale.wordpress.com/455/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rightscale.wordpress.com/455/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rightscale.wordpress.com/455/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rightscale.wordpress.com/455/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rightscale.wordpress.com/455/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rightscale.wordpress.com/455/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rightscale.wordpress.com/455/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rightscale.wordpress.com/455/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rightscale.wordpress.com/455/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rightscale.wordpress.com/455/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=455&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.rightscale.com/2009/12/03/amazon-ec2-a-new-chapter-begins/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0b359e9ce1a8fb5f3315fbec8beda697?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">TvE</media:title>
		</media:content>
	</item>
		<item>
		<title>RightScale ServerTemplate Library and Machine Tags</title>
		<link>http://blog.rightscale.com/2009/10/28/rightscale-servertemplate-library-and-machine-tags/</link>
		<comments>http://blog.rightscale.com/2009/10/28/rightscale-servertemplate-library-and-machine-tags/#comments</comments>
		<pubDate>Wed, 28 Oct 2009 19:35:02 +0000</pubDate>
		<dc:creator>Thorsten</dc:creator>
				<category><![CDATA[AWS]]></category>
		<category><![CDATA[Chef]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[EC2]]></category>
		<category><![CDATA[Rackspace]]></category>
		<category><![CDATA[RightLink]]></category>
		<category><![CDATA[RightScale]]></category>
		<category><![CDATA[ServerTemplate]]></category>

		<guid isPermaLink="false">http://blog.rightscale.com/?p=448</guid>
		<description><![CDATA[Yesterday&#8217;s release of the RightScale platform introduced two new features that I&#8217;m really excited about: the ServerTemplate Library and the use of Machine Tags on servers. (Ooops, I shouldn&#8217;t forget the new features for RackSpace, but I&#8217;ll talk about those &#8230; <a href="http://blog.rightscale.com/2009/10/28/rightscale-servertemplate-library-and-machine-tags/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=448&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Yesterday&#8217;s release of the RightScale platform introduced two new features that I&#8217;m really excited about: the ServerTemplate Library and the use of Machine Tags on servers. (Ooops, I shouldn&#8217;t forget the new features for RackSpace, but I&#8217;ll talk about those next week.)</p>
<p>We&#8217;ve had rather sophisticated sharing of ServerTemplates in RightScale for over a year now allowing certain users to share ServerTemplates, RightScripts and other design artifacts with other RightScale users. This enables us to publish free ServerTemplates to all our users, premium ones to our customers and it also lets ISVs on our platform publish ServerTemplates for free or for pay to their users and customers. In addition, each of the design artifacts is versioned such that users who have launched servers with a ServerTemplate last year can still launch new servers with exactly the same version of that ServerTemplate.</p>
<p>A result of all this publishing, sharing and versioning is that there&#8217;s a lot to choose from. So much that drop-down menus have become really unwieldy and this is where the new library comes into play. In the past, when adding a server to a deployment one had to find the correct ServerTemplate from the list of all available templates in the RightScale system. Now this has become a two-step process where you first import the ServerTemplates of interest from the library into your account and then only the imported templates are shown in all the drop-down selection menus. Separating the library import/export step will also allow us to significantly upgrade the experience browsing all the design artifacts in the library over the coming releases, stay tuned&#8230;</p>
<p>We introduced Flickr style <a href="http://www.flickr.com/groups/api/discuss/72157594497877875/">machine tags</a> recently and we&#8217;re expanding their use with this release. One of the really exciting new features is that servers now have tags and we&#8217;ve integrated the tags with the routing of messages between servers, with Chef (via the RightLink agents) and with the UI. All this is still in alpha but it&#8217;s starting to take shape. Our first real use-case is the registration of application servers with load balancers. The way it works is that when a load balancer comes up and is ready for operation it adds a &#8220;loadbalancer:lb=www&#8221; tag to say &#8220;I&#8217;m a load balancer for the www vhost&#8221;. When an app server starts up, it requests all servers in the deployment with a &#8220;loadbalancer:lb=www&#8221; tag to run a Chef recipe that adds the app server to the load balancer rotation. This way, the app server doesn&#8217;t need to know which or how many load balancers there are. The tag matching, communication, and running of the Chef recipe are all done by the RightLink agents.</p>
<p>In order to let new load balancers come up when app servers are already running we can do the same tag-location in reverse: app servers announce &#8220;loadbalancer:app=www&#8221; to say &#8220;I&#8217;m an app server serving vhost www&#8221; and load balancers on start-up can add all app servers to their config by querying for all servers with that tag. For overall resiliency it&#8217;s a good idea for load balancers to re-query the set of app servers and to update their config accordingly. This catches race conditions as well as issues where portions of the app servers may be temporarily invisible due to network partitions. The theme here is &#8220;eventual consistency&#8221; and we&#8217;re still evaluating what the best primitives are to support high availability.</p>
<p>You may wonder why the examples above use such long tags and that&#8217;s really where machine tags come in. The &#8220;loadbalancer:&#8221; prefix helps isolate the tags to coordinate the load balancer registration from other tags. Think of &#8220;loadbalancer&#8221; as being the name of the application or feature that uses these tags, e.g. the load balancer registration. The &#8220;lb=www&#8221; and &#8220;app=www&#8221; tag predicate and value can be used to support multiple vhosts. So a load balancer could announce &#8220;loadbalancer:lb=www&#8221; and &#8220;loadbalancer:lb=api&#8221; to indicate that it&#8217;s load balancing the www and api vhosts. And an api app server then would only query for the &#8220;lb=api&#8221; tag and it would only announce the &#8220;app=api&#8221; counterpart.</p>
<p>While all this is happening amongst the servers, the RightScale UI provides access to all the tags, so one can see the servers announce the various tags and one can even intervene and manually modify these tags. We might provide a &#8220;don&#8217;t touch&#8221; notion for some tags, but right now it&#8217;s much more important to us to be able to expose all this machinery. As an ops guy there are few things I loathe more than hidden automation that I can&#8217;t inspect and override when I need to.</p>
<p>Of course there&#8217;s more in the new release than just these two features: more support for RackSpace (monitoring in particular), improved support for Chef, support for new AWS features, <a href="http://support.rightscale.com/18-Release_Notes/01-RightScale_Dashboard/Current">and more</a>&#8230;</p>
<br />Posted in AWS, Chef, Cloud Computing, EC2, Rackspace, RightLink Tagged: AWS, Chef, RightScale, ServerTemplate <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rightscale.wordpress.com/448/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rightscale.wordpress.com/448/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rightscale.wordpress.com/448/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rightscale.wordpress.com/448/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rightscale.wordpress.com/448/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rightscale.wordpress.com/448/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rightscale.wordpress.com/448/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rightscale.wordpress.com/448/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rightscale.wordpress.com/448/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rightscale.wordpress.com/448/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rightscale.wordpress.com/448/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rightscale.wordpress.com/448/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rightscale.wordpress.com/448/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rightscale.wordpress.com/448/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=448&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.rightscale.com/2009/10/28/rightscale-servertemplate-library-and-machine-tags/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0b359e9ce1a8fb5f3315fbec8beda697?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">TvE</media:title>
		</media:content>
	</item>
		<item>
		<title>Amazon takes EC2 to the next level with persistent storage volumes</title>
		<link>http://blog.rightscale.com/2008/04/13/amazon-takes-ec2-to-the-next-level-with-persistent-storage-volumes/</link>
		<comments>http://blog.rightscale.com/2008/04/13/amazon-takes-ec2-to-the-next-level-with-persistent-storage-volumes/#comments</comments>
		<pubDate>Mon, 14 Apr 2008 04:39:59 +0000</pubDate>
		<dc:creator>Thorsten</dc:creator>
				<category><![CDATA[AWS]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[EC2]]></category>
		<category><![CDATA[MySQL]]></category>

		<guid isPermaLink="false">http://rightscale.wordpress.com/?p=51</guid>
		<description><![CDATA[The Amazon folks have gone public today with the next EC2 feature: persistent storage. The official information is found in Jeff Barr&#8217;s blog entry and in Matt&#8217;s forum post. Calling the persistent storage a &#8220;feature&#8221; is actually quite an understatement, &#8230; <a href="http://blog.rightscale.com/2008/04/13/amazon-takes-ec2-to-the-next-level-with-persistent-storage-volumes/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=51&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>The Amazon folks have gone public today with the next EC2 feature: persistent storage. The official information is found in <a href="http://aws.typepad.com/aws/2008/04/block-to-the-fu.html">Jeff Barr&#8217;s blog entry</a> and in <a href="http://developer.amazonwebservices.com/connect/thread.jspa?threadID=21082&amp;tstart=0">Matt&#8217;s forum post</a>. Calling the persistent storage a &#8220;feature&#8221; is actually quite an understatement, it really revolutionizes EC2 and enables usage patterns that any big-iron SAN user would die for.</p>
<h3>The basics</h3>
<p>What does this persistent storage look like? We&#8217;ve been testing it for awhile and are thoroughly impressed. The Amazon folks are clearly still fine-tuning a lot of the details, but basically you can create storage volumes in the cloud next to the server instances you launch in the cloud. Think of having a really big SAN in the cloud in which you can create volumes of up to 1TB each with a single API call, or with a simple click in the RightScale UI (yes, of course we&#8217;ll have nice support for the storage volumes on our site coupled with some neat automation and an array of pre-packaged solutions). You can mount one or multiple volumes on an instance and they appear just like the other local drives, so you can format them as you like, set-up striping and do other useful things.</p>
<p>The feature that really makes the storage volumes sizzle is the ability to snapshot them to S3 and then create new volumes from the snapshots. The snapshots are great for durability: once a snapshot is taken it is stored in S3 with all the reliability attributes of S3, namely redundant storage in multiple availability zones. This essentially solves the whole backup issue with one simple API call or click in the RightScale UI. You can also easily restore a snapshot by creating a fresh volume from it. This feature is useful beyond just restoring a backup: you may restore to another instance where you now have a clone of the data and can do whatever you want to it. Wow!</p>
<h3>The cool stuff</h3>
<p>There are so many great uses for the storage volumes that it&#8217;s impossible to write them all up in a single blog post, and we obviously haven&#8217;t thought of them all (or even close). The first usage scenario we looked into is running a database. Up to today the only setup for a mission critical database we recommend is using two instances with real-time database replication and frequent backups to S3. We&#8217;ve now installed our <a href="http://www.rightscale.com/m/features.html">Manager for MySQL</a> replicated set-up for many, many customers and it works very well. In short, we use MySQL replication for redundancy and frequent (like every 10 minutes) backups to S3 on the slave to guard against the unlikely event of simultaneous failure of both instances located in different availability zones.</p>
<p>With the storage volumes the Manager for MySQL set-up works even better. Instead of having to tar-up the database files and upload them to S3 we can just take a snapshot. And in order to initialize a slave we simply create a volume for it from the last snapshot of the master and launch the replication: no more rsync of the data is necessary. It&#8217;s really nice to see how all the automation we&#8217;ve built stays in place with the new Amazon capabilities and saves just as many headaches as before, it just gets turbocharged by the storage volumes!</p>
<p>In addition, the storage volumes enable slightly lower-end database offerings. Since the storage volumes are more durable than local instance storage a lot of the risk of losing it all if the instance dies goes away. It is now possible to run a single instance with the database data living on a storage volume and to take frequent snapshots to get backups onto S3. Should the instance die, it is very simple to launch a fresh one using the same storage volume. Typically it would take only a few minutes for the new instance to come up and take off where the old one stopped! Of course this set-up has more downtime when compared to the redundant database set-up, and one has to be really careful in setting everything up to minimize the time it takes to mount the volume and to ensure a successful database recovery.</p>
<p>Just as the storage volumes enable the reliable use of single-instance databases they also enable single-tenant appliances in EC2. It is now possible to host the data for a single-tenant virtual appliance on a storage volume and mount it on an instance. What&#8217;s really cool is the decoupling of the data from the instance. It means that you can start a customer on a small instance and if they outgrow it, you can migrate them almost seamlessly to a large and later an x-large instance, all using the same storage volume. Beyond an x-large a couple of interesting options are possible to increase performance further, such as striping multiple storage volumes. EC2 really brings virtual appliances to the next level!</p>
<p>The S3 snapshots enable some completely different and very intriguing usage scenarios. Suppose you&#8217;re doing some DNA matching against a Genome data set on 1000 instances. In addition to firing-up 1000 instances on a whim you can, also on a whim, clone a nicely prepared snapshot of the data set 1000-times to create 1000 volumes, one for each instance. BANG! This way they can all independently crawl over the data set. This type of massive (essentially read-only) cloning really opens-up new possibilities in running such large computations in a cost effective manner.</p>
<h3>Summing it up</h3>
<p>I&#8217;ll stop here, but clearly the cloud has just squared in size! Two years ago, when I started on EC2 there were only small instances available and the sentiment was that in order to get the horizontal scalability and pricing of the cloud you had to accept inferior features. In the meantime we&#8217;ve gotten multiple instance sizes plus recently the remappable IP addresses and availability zones. That already indicated that computing in the cloud would soon surpass computing in traditional colos or in your own datacenter not just in scale and price, but also in feature set. With the addition of the storage volumes with all the cool snapshot features it&#8217;s now a <em>fait accomplit</em>: the cloud adopters will have much more computing horsepower and flexibility at their fingertips than those who are still racking their own machines. It&#8217;s going to be like agile software development: if you want to survive as an internet/web service you will have to compute in the cloud or your competitors will leave you in the dust by being able to deploy faster, better, and cheaper.</p>
<p><strong>Update:</strong> Werner Vogels, Amazon&#8217;s CTO also <a href="http://www.allthingsdistributed.com/2008/04/persistent_storage_for_amazon.html">blogs about the storage volumes</a> in <a href="http://www.allthingsdistributed.com/">all-things-distributed</a> with a little more background perspective. The Amazon folks are getting pretty coordinated with news appearing at the same time on their blogs and the forums. Maybe I missed it, but I don&#8217;t think they even press release this stuff&#8230;</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/rightscale.wordpress.com/51/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/rightscale.wordpress.com/51/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rightscale.wordpress.com/51/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rightscale.wordpress.com/51/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rightscale.wordpress.com/51/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rightscale.wordpress.com/51/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rightscale.wordpress.com/51/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rightscale.wordpress.com/51/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rightscale.wordpress.com/51/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rightscale.wordpress.com/51/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rightscale.wordpress.com/51/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rightscale.wordpress.com/51/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rightscale.wordpress.com/51/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rightscale.wordpress.com/51/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rightscale.wordpress.com/51/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rightscale.wordpress.com/51/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=51&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.rightscale.com/2008/04/13/amazon-takes-ec2-to-the-next-level-with-persistent-storage-volumes/feed/</wfw:commentRss>
		<slash:comments>34</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0b359e9ce1a8fb5f3315fbec8beda697?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">TvE</media:title>
		</media:content>
	</item>
		<item>
		<title>RightAws release 1.7.0, including ActiveSDB alpha</title>
		<link>http://blog.rightscale.com/2008/04/01/rightaws-release-170-including-activesdb-alpha/</link>
		<comments>http://blog.rightscale.com/2008/04/01/rightaws-release-170-including-activesdb-alpha/#comments</comments>
		<pubDate>Tue, 01 Apr 2008 20:36:24 +0000</pubDate>
		<dc:creator>Thorsten</dc:creator>
				<category><![CDATA[AWS]]></category>
		<category><![CDATA[EC2]]></category>
		<category><![CDATA[RightAws]]></category>
		<category><![CDATA[Ruby]]></category>

		<guid isPermaLink="false">http://rightscale.wordpress.com/?p=49</guid>
		<description><![CDATA[We released a new version of our Ruby library gem for accessing AWS Services, including EC2, S3, SQS, and SDB: RightAws 1.7.0 is now available off rubyforge.  This version includes enhancements of the EC2 interface to support Elastic IP addresses, &#8230; <a href="http://blog.rightscale.com/2008/04/01/rightaws-release-170-including-activesdb-alpha/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=49&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>We released a new version of our Ruby library gem for accessing AWS Services, including EC2, S3, SQS, and SDB: RightAws 1.7.0 is now available off <a href="http://rubyforge.org/projects/rightaws/">rubyforge</a>.  This version includes enhancements of the EC2 interface to support Elastic IP addresses, selectable kernels, and availability zones.  It also contains the first alpha release of ActiveSDB, which is a new ActiveResource-like interface for Amazon SDB. If you try out ActiveSDB please let us know what you think!</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/rightscale.wordpress.com/49/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/rightscale.wordpress.com/49/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rightscale.wordpress.com/49/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rightscale.wordpress.com/49/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rightscale.wordpress.com/49/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rightscale.wordpress.com/49/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rightscale.wordpress.com/49/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rightscale.wordpress.com/49/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rightscale.wordpress.com/49/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rightscale.wordpress.com/49/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rightscale.wordpress.com/49/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rightscale.wordpress.com/49/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rightscale.wordpress.com/49/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rightscale.wordpress.com/49/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rightscale.wordpress.com/49/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rightscale.wordpress.com/49/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=49&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.rightscale.com/2008/04/01/rightaws-release-170-including-activesdb-alpha/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0b359e9ce1a8fb5f3315fbec8beda697?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">TvE</media:title>
		</media:content>
	</item>
		<item>
		<title>Setting up a fault-tolerant site using Amazon&#8217;s Availability Zones</title>
		<link>http://blog.rightscale.com/2008/03/26/setting-up-a-fault-tolerant-site-using-amazons-availability-zones/</link>
		<comments>http://blog.rightscale.com/2008/03/26/setting-up-a-fault-tolerant-site-using-amazons-availability-zones/#comments</comments>
		<pubDate>Thu, 27 Mar 2008 00:57:33 +0000</pubDate>
		<dc:creator>Thorsten</dc:creator>
				<category><![CDATA[AWS]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[EC2]]></category>

		<guid isPermaLink="false">http://rightscale.wordpress.com/?p=48</guid>
		<description><![CDATA[Amazon&#8217;s Availability Zones are a fabulous new feature that allows users to assign instances to locations that are very fault-tolerant from one another yet that have very high bandwidth between each other. I wish I could have done something like &#8230; <a href="http://blog.rightscale.com/2008/03/26/setting-up-a-fault-tolerant-site-using-amazons-availability-zones/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=48&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Amazon&#8217;s Availability Zones are a fabulous new feature that allows users to assign instances to locations that are very fault-tolerant from one another yet that have very high bandwidth between each other. I wish I could have done something like that as easily when I was responsible for operations at Citrix Online and we had 5 datacenters worldwide. As I&#8217;ll explain in this post, what Amazon actually provides us is much better than just putting servers into multiple datacenters.</p>
<p>The most confusing thing about availability zones is the name:  In the cloud, what exactly is an &#8220;availability zone&#8221;? The easiest way to think about it is that a zone equals a datacenter. If power goes out in one datacenter and the generators fail to start (naah, that never happens&#8230;) then it doesn&#8217;t affect the other datacenter. Or if there&#8217;s a fire, one datacenter may burn out or be otherwise incapacitated, but others are unaffected. In reality zones don&#8217;t necessarily correspond to datacenters. Given careful engineering, it&#8217;s possible to have multiple &#8220;rooms&#8221; in a datacenter that are highly failure isolated while technically still being part of the same datacenter (imagine football-sized fields here).</p>
<p>The point of availability zones is the following: if I launch a server in zone A and a second server in zone B, then the probability that both go down at the same time due to an external event is extremely small. This simple property allows us to construct highly reliable web services by placing servers into multiple zones such that the failure of one zone doesn&#8217;t disrupt the service or at the very least, allows us to rapidly reconstruct the service in the second zone.</p>
<p>The one caveat to consider when using multiple zones is that there is no free lunch (you knew there was a catch, didn&#8217;t you?). First of all there&#8217;s the speed of light. The zones Amazon is exposing are all on the East coast (indicated by the names, such as &#8220;us-east-1a&#8221;. I don&#8217;t have inside information about the location of their facilities, but I imagine some may be in New York and others may be in Virginia, so the distance between zones may be considerable, thus translating into some network latency. And even if the actual facilities used by EC2 today are not that far apart, they may be someday in the future.</p>
<p>The second &#8220;gotcha&#8221; is that bandwidth across zone boundaries is not free.  Amazon is charging $0.01/GB for what they call &#8220;regional&#8221; traffic. This is less than 1/10th the cost of Internet traffic, which seems perfectly reasonable to me. In the days where I was managing multiple datacenters the cost of traffic between them was essentially the same as the cost of random Internet traffic. Actually, I take that back, it cost twice as much: once to exit one datacenter and once to enter the other. (Granted, at high volume one can do interesting things to save some money, but it doesn&#8217;t become free by a long shot.)</p>
<p><b>An example </b></p>
<p>Enough talk, let&#8217;s show a diagram of how a simple redundant web site looks like with Availability Zones and Elastic IPs. At the core we&#8217;ll have two web servers (e.g. with Apache and PHP) running the web application and accessing the master database. All this occurs in one zone. We&#8217;ll allocate two Elastic IP addresses that we assign to the two web servers and then we create a round-robin DNS entry for our web site that maps the domain name to the two IP addresses (this is commonly called &#8220;round-robin DNS&#8221;).</p>
<p align="center"><img src="http://www.kahunadesign.com/RightScale/blog/availability_zone_1.png" alt="Fault Tolerance with availability zones img1" height="325" width="493" /></p>
<p>In order to ensure the survival of the data in the case of a massive failure, we start a slave database in a second availability zone and replicate the data in real-time. This is how we&#8217;ve set-up all our customers to date, except that up until now we haven&#8217;t been able to specify the placement of the slave with respect to the master. In the RightScale Dashboard the zone of each server is shown and at server launch time one can select the desired zone.</p>
<p>Now suppose the zone with the web servers and database fails due to a fire! After receiving an alert, we first promote the slave in the second zone to master using the RightScale Manager for MySQL automation. We then launch fresh web/app servers in the same zone as the slave database. Once the promotion completes and the two new servers are up, it is a simple matter of reassigning the Elastic IPs to the two new servers to redirect all the users to the new servers and we&#8217;re up and running again.</p>
<div style="text-align:center;"><img src="http://www.kahunadesign.com/RightScale/blog/availability_zone_2.png" alt="Fault Tolerance with availability zones img2" height="364" width="548" /></div>
<p>The next step is to recreate the redundancy and for this the third availability zone that each account has access to comes into play. We start a fresh database slave in the third zone again using the automation in the Manager for MySQL. Once that comes up and starts replicating we are back to having a redundant setup!</p>
<div style="text-align:center;"><img src="http://www.kahunadesign.com/RightScale/blog/availability_zones_4.png" alt="Fault Tolerance with availability zones img4" height="316" width="605" /></div>
<p>If you have never tried to set something like this up yourself starting from renting colo space, purchasing bandwidth to buying and installing servers, you really can&#8217;t appreciate the amount of capital expense, time, headache, and ongoing expense saved by EC2&#8242;s features! And best of all, using RightScale it&#8217;s just a couple of clicks away <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> .</p>
<p><b>Beyond the simple redundant setup</b></p>
<p>As an astute reader you probably noticed that the site described above would go down if there was a failure in the primary zone, which would require a manual restarting of new servers in order to bring it back up.  Some of this can be easily remedied by placing one or multiple web servers into the secondary zone and having them talk to the master DB across the zone boundary.  The performance of these servers may be slightly lower due to the inter-zone latency and there is some cost to the database access traffic. It&#8217;s somewhat application-dependent how these play out.</p>
<p>A more sophisticated setup uses load balancers to reduce the impact of the cross-site traffic. The idea is to place one load balancer instance in each zone and route the requests primarily to a set of redundant web/app servers in the primary zone, as shown in the figure below. A third app server can be running in the secondary zone and perhaps get a trickle of traffic from the load balancers just to keep it &#8220;warm.&#8221;  Keeping it warm makes it easy to monitor and ensure that it&#8217;s operating properly.</p>
<div style="text-align:center;"><img src="http://www.kahunadesign.com/RightScale/blog/availability_zones_3.png" alt="Fault Tolerance with availability zones img3" height="473" width="470" /></div>
<p>The good thing about this setup is that the traffic shipped across the zone boundary is exactly the same as comes into the second load balancer. This means that for half the total Internet traffic there is a $0.01/GB surcharge, which results in less than 5% extra cost overall. (This is not counting the DB replication traffic.) Also, the extra latency from one zone to the other is negligible when compared to the already incurred Internet latency.</p>
<p>In the case of a primary zone failure, browsers will fail over to the load balancer in the remaining zone (this is a feature built into web browsers related to the round-robin DNS set-up). The load balancer will direct all traffic to the third web/app server. At that point the secondary database needs to be promoted to master and the third app server repointed to that database and everything will be back up and running. With automation the DB promotion could be done automatically, but it&#8217;s better to be conservative: a promotion due to a false alert could cause a lot of harm.</p>
<p>This second set-up is a bit more complicated than the previous one, but it requires less machinery and no server launches in the case of a failure. It also requires one extra machine if one assumes that each load balancer can run on the same instance as a web/app server (typically not a problem). Many more variants on this basic setup are clearly possible and should be considered on a case-by-case basis.</p>
<p>Wow, it&#8217;s mind-boggling how much power Amazon is giving us in designing sophisticated distributed redundant Internet services! In combination, the availability zones, the elastic IPs and the overall programmatic control over all the resources <i>make the cloud a superior environment</i> for deploying sophisticated Internet services. At RightScale we&#8217;re extremely excited and are hard at work to incorporate the new features into our standard deployment templates such that all our customers can easily take advantage of the new features in their deployments. We&#8217;re also automating a number of the failure scenarios so that you don&#8217;t need to have an alert wake you up if there a fire at Amazon in the middle of the night!</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/rightscale.wordpress.com/48/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/rightscale.wordpress.com/48/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rightscale.wordpress.com/48/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rightscale.wordpress.com/48/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rightscale.wordpress.com/48/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rightscale.wordpress.com/48/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rightscale.wordpress.com/48/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rightscale.wordpress.com/48/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rightscale.wordpress.com/48/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rightscale.wordpress.com/48/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rightscale.wordpress.com/48/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rightscale.wordpress.com/48/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rightscale.wordpress.com/48/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rightscale.wordpress.com/48/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rightscale.wordpress.com/48/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rightscale.wordpress.com/48/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=48&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.rightscale.com/2008/03/26/setting-up-a-fault-tolerant-site-using-amazons-availability-zones/feed/</wfw:commentRss>
		<slash:comments>27</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0b359e9ce1a8fb5f3315fbec8beda697?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">TvE</media:title>
		</media:content>

		<media:content url="http://www.kahunadesign.com/RightScale/blog/availability_zone_1.png" medium="image">
			<media:title type="html">Fault Tolerance with availability zones img1</media:title>
		</media:content>

		<media:content url="http://www.kahunadesign.com/RightScale/blog/availability_zone_2.png" medium="image">
			<media:title type="html">Fault Tolerance with availability zones img2</media:title>
		</media:content>

		<media:content url="http://www.kahunadesign.com/RightScale/blog/availability_zones_4.png" medium="image">
			<media:title type="html">Fault Tolerance with availability zones img4</media:title>
		</media:content>

		<media:content url="http://www.kahunadesign.com/RightScale/blog/availability_zones_3.png" medium="image">
			<media:title type="html">Fault Tolerance with availability zones img3</media:title>
		</media:content>
	</item>
		<item>
		<title>Amazon&#8217;s communication is improving</title>
		<link>http://blog.rightscale.com/2008/03/24/amazons-communication-is-improving/</link>
		<comments>http://blog.rightscale.com/2008/03/24/amazons-communication-is-improving/#comments</comments>
		<pubDate>Tue, 25 Mar 2008 03:58:48 +0000</pubDate>
		<dc:creator>Thorsten</dc:creator>
				<category><![CDATA[AWS]]></category>
		<category><![CDATA[EC2]]></category>

		<guid isPermaLink="false">http://rightscale.wordpress.com/?p=40</guid>
		<description><![CDATA[One of the most often heard complaints about Amazon Web Services is the lack of communication about service status and issues. The community has been pretty vocal about this and they&#8217;ve certainly heard it and are committed to improving. So &#8230; <a href="http://blog.rightscale.com/2008/03/24/amazons-communication-is-improving/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=40&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>One of the most often heard complaints about Amazon Web Services is the lack of communication about service status and issues. The community has been pretty vocal about this and they&#8217;ve certainly heard it and are committed to improving. So it was nice to see the following response to a minor incident the other day posted on the <a href="http://developer.amazonwebservices.com/connect/thread.jspa?threadID=20489&amp;tstart=0&amp;start=41">forum</a>:</p>
<blockquote><p>Following up with more information about this morning&#8217;s event.  At 1:31am PST, a network engineer made a change to a pair of redundant aggregation routers fronting a portion of EC2.  This change caused both to no longer route traffic to a subset of EC2 instances.  The change should have been non-intrusive.  We are taking steps to prevent this type of failure in the future.  The failure affected a portion of EC2 instance connectivity for 2 hours and 5 minutes.  In response to the question about RSS feeds, we are planning to provide this functionality in the future.</p>
<p>Sincerely,<br />
The Amazon Web Services Team</p></blockquote>
<p>Thanks Kathrin for posting this. We all know that this stuff happens and we all want to use a provider who is committed to getting to root causes of incidents and eradicating them as much as possible!</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/rightscale.wordpress.com/40/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/rightscale.wordpress.com/40/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rightscale.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rightscale.wordpress.com/40/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rightscale.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rightscale.wordpress.com/40/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rightscale.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rightscale.wordpress.com/40/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rightscale.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rightscale.wordpress.com/40/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rightscale.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rightscale.wordpress.com/40/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rightscale.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rightscale.wordpress.com/40/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rightscale.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rightscale.wordpress.com/40/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=40&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.rightscale.com/2008/03/24/amazons-communication-is-improving/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0b359e9ce1a8fb5f3315fbec8beda697?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">TvE</media:title>
		</media:content>
	</item>
		<item>
		<title>Top reasons Amazon EC2 instances disappear</title>
		<link>http://blog.rightscale.com/2008/02/02/top-reasons-amazon-ec2-instances-disappear/</link>
		<comments>http://blog.rightscale.com/2008/02/02/top-reasons-amazon-ec2-instances-disappear/#comments</comments>
		<pubDate>Sat, 02 Feb 2008 09:09:47 +0000</pubDate>
		<dc:creator>Thorsten</dc:creator>
				<category><![CDATA[AWS]]></category>
		<category><![CDATA[EC2]]></category>
		<category><![CDATA[failure]]></category>

		<guid isPermaLink="false">http://blog.rightscale.com/2008/02/02/top-reasons-amazon-ec2-instances-disappear/</guid>
		<description><![CDATA[(Judging by the posting gap, the author of this blog almost disappeared too! Time to lift the head from the day-to-day scramble and write the next entry!) The fact that Amazon says up-front that computers fail seems to be the &#8230; <a href="http://blog.rightscale.com/2008/02/02/top-reasons-amazon-ec2-instances-disappear/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=11&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>(Judging by the posting gap, the author of this blog almost disappeared too! Time to lift the head from the day-to-day scramble and write the next entry!)</p>
<p>The fact that Amazon says up-front that computers fail seems to be the number one concern and criticism of EC2, specially from people who have not used it extensively. I don&#8217;t actually spend much time thinking about that because in our experience it&#8217;s not something to worry about. It&#8217;s essential to take into account when designing a system: whenever we set something up on a machine we immediately think &#8220;and what do we do when it fails?&#8221; That&#8217;s good thing, not a bad thing as anyone with production datacenter experience can attest.</p>
<p>Since it&#8217;s such a hot topic, I&#8217;ve been keeping a close eye on all the &#8220;my instance disappeared&#8221; threads on the EC2 forum, and it&#8217;s not easy to sort them out. I have no doubt that the vast majority has to do with operator error:</p>
<ul>
<li>trying SSH and forgetting to open port 22 in the security group (or similarly with other ports)</li>
<li>having difficulties with the SSH keys, or forgetting to set-up a key to begin with</li>
<li>using/constructing an AMI that does not have SSH properly set-up</li>
<li>using/constructing an AMI that does not boot properly (network and/or sshd issues) and failing to look into console output</li>
<li>instance reboot failing, for example disk mounts failing due to mount point changes that were not reflected into init scripts</li>
<li>sshd killed by kernel out-of-memory reaper, failing to look into console output for diagnosis</li>
<li>&#8230; and many more</li>
</ul>
<p>Some of these are beginners failing to read the getting started guide, some are more subtle and can happen even to veteran EC2 users. Then there are emails from Amazon saying &#8220;we have noticed that one or more of your instances are running on a host degraded due to hardware failure&#8221; and I wonder how many users don&#8217;t get these emails because their AWS account&#8217;s email address points into a bit bucket.</p>
<p>No doubt there are real failures as well where a host dies and takes the instances with it, or one of the disks used by an instance gives up which is the end of that instance. The question here is how frequent this is relative to the total number of instances running, and since Amazon is so secretive with their numbers it&#8217;s really difficult to make even an educated guess. I tried to go back into our year of logs to see whether I could estimate the failure rate, but I don&#8217;t have enough data to distinguish  failure from shutdown, sigh.</p>
<p>The failures that concern me the most are actually not instance failures but network failures. Anyone having set-up a large datacenter will know that network issues are the most difficult to get under control. The damn network just keeps changing, and as soon as you try to hold still your service providers change stuff. Some of the instance disappearances are really network issues that cause an instance to be unreachable, or unreachable from certain other instances. These are hard to troubleshoot and on more than one occasion I&#8217;ve had to run tcpdump on both ends to see packets departing and never arriving. If I can get to the target instance at all to run tcpdump, that is&#8230; I hope Amazon gets a better handle onto this type of failure and provides us with better troubleshooting tools. In the meantime, it&#8217;s important to flag issues to them so they can troubleshoot and eliminate the root causes.</p>
<p>The really good news is that the Amazon folks are very dedicated to figuring out what&#8217;s going wrong and fixing it. So if you have an issue, be sure to do the troubleshooting you can, then set the instance aside, launch a new one to take its place, and post all the details on the forum. Shut the instance down only after the issue is looked-at. Looking back, there were two big issues causing instance termination, one was the day where some EC2 front-end explicitly terminated a bunch of instances by error. Not good, but from what we saw it wasn&#8217;t a massive failure either. They clearly have done their best to ensure this doesn&#8217;t recur. The other was an instance reboot bug which caused many instances to die in the reboot process. We learned not to reboot ailing instances but instead to relaunch and rescue any data. This issue also seems to be fixed at this point.</p>
<ul>
<li>To summarize, if you can&#8217;t reach an instance, here is what you should do:try to SSH, check the security group</li>
<li>distinguish SSH timeout from key issues (timeout vs. permission denied type of errors)</li>
<li>use ping to test connectivity (enable ICMP if you have the bad habit of disabling it)</li>
<li>check the console output (use the convenient button on the RightScale dashboard), note that it can take a few minutes for stuff to appear</li>
<li>look at the RightScale monitoring to see whether the instance is still sending monitoring data</li>
<li>hop onto an instance in the same security group and try connecting from there (launch an instance if you don&#8217;t have any)</li>
<li>post details (instance ID, what you&#8217;ve tried, symptoms observed) on the forum and set instance aside</li>
</ul>
<p>All in all, the number one lesson is &#8220;relaunch!&#8221; There are thousands of instances waiting to be utilized so use a fresh one if you see trouble with an existing one. If you master this step you can use it in so many situations: to scale up, to scale down, to handle instance failure, to handle software failure, to enable test set-ups, etc. If you use RightScale you will notice that that&#8217;s also what we focus on: making it easier to launch fresh instances.</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/rightscale.wordpress.com/11/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/rightscale.wordpress.com/11/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rightscale.wordpress.com/11/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rightscale.wordpress.com/11/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rightscale.wordpress.com/11/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rightscale.wordpress.com/11/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rightscale.wordpress.com/11/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rightscale.wordpress.com/11/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rightscale.wordpress.com/11/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rightscale.wordpress.com/11/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rightscale.wordpress.com/11/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rightscale.wordpress.com/11/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rightscale.wordpress.com/11/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rightscale.wordpress.com/11/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rightscale.wordpress.com/11/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rightscale.wordpress.com/11/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=11&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.rightscale.com/2008/02/02/top-reasons-amazon-ec2-instances-disappear/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0b359e9ce1a8fb5f3315fbec8beda697?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">TvE</media:title>
		</media:content>
	</item>
		<item>
		<title>Deploying many Rails sites onto Amazon EC2</title>
		<link>http://blog.rightscale.com/2007/10/01/deploying-many-rails-sites-onto-amazon-ec2/</link>
		<comments>http://blog.rightscale.com/2007/10/01/deploying-many-rails-sites-onto-amazon-ec2/#comments</comments>
		<pubDate>Mon, 01 Oct 2007 10:40:46 +0000</pubDate>
		<dc:creator>Thorsten</dc:creator>
				<category><![CDATA[AWS]]></category>
		<category><![CDATA[EC2]]></category>
		<category><![CDATA[Cluster]]></category>
		<category><![CDATA[MySQL]]></category>
		<category><![CDATA[S3]]></category>

		<guid isPermaLink="false">http://blog.rightscale.com/2007/10/01/deploying-many-rails-sites-onto-amazon-ec2/</guid>
		<description><![CDATA[One of our customers is deploying many Rails sites onto EC2, more precisely, many instances of virtually the same site. Basically they have a Rails application and they tweak it for each individual site they set-up. EC2 is a wonderful &#8230; <a href="http://blog.rightscale.com/2007/10/01/deploying-many-rails-sites-onto-amazon-ec2/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=20&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>One of our customers is deploying many Rails sites onto EC2, more precisely, many instances of virtually the same site. Basically they have a Rails application and they tweak it for each individual site they set-up. EC2 is a wonderful deployment platform for this type of business because there is very little friction in adding customers since it takes just one button press to get more servers.</p>
<p>The overall architecture concept we&#8217;re using for this customer is to build a number of app+database clusters and to load multiple sites onto each one. The number of sites per cluster can be adjusted such that the database portion of each cluster is loaded up optimally, and it&#8217;s designed such that sites can be moved around easily, for example to offload a cluster that may have become too heavily loaded as some of the sites on it have grown.</p>
<p>In the end, the architecture boils down to having two instances running a mysql master/slave set-up managed by our <a href="http://blog.rightscale.com/2007/08/20/redundant-mysql-set-up-for-amazon-ec2/">Manager for MySQL</a> plus two instances running load balancers and Rails/Mongrel as redundant app servers. This makes for a fully redundant cluster on which a number of sites can be hosted. It is also easy to add a few more EC2 instances running Rails depending on the Rails vs. MySQL workload balance.</p>
<p>Each site on a cluster has its own logical database (i.e. what MySQL calls a &#8220;database&#8221;), this makes it easy to backup and restore a site individually, and most importantly, to move a site to another cluster in order to free up resources on the original one. The sites on a cluster can also share the app servers as long as there is no HTTPS involved. The reason for this caveat is that each Amazon EC2 instance has only a single IP address and it is not possible to do &#8220;virtual hosting&#8221; with HTTPS sites. With HTTP all the www.site1.com, www.site2.com, etc. DNS entries point to the same two load balancing instances (using what&#8217;s called &#8220;round-robin DNS&#8221; for fail-over purposes) and the load balancer (or front-end Apache, if used) figures out which site the user is visiting based on the &#8220;host&#8221; header included in every HTTP 1.1 request.</p>
<p>What&#8217;s really nice about these 4-6 machine clusters is that they&#8217;re very powerful yet so simple. There&#8217;s no &#8220;infinitely scalable&#8221; magic under the hood that breaks at the worst moments. No, it&#8217;s a plain set-up that anyone with a bit of experience can fully understand. The magic is that it&#8217;s so easy to set these clusters up with Amazon EC2 plus RightScale so you can really take advantage of the same &#8220;horizontal scaling&#8221; as the big guys (Google, Yahoo!, etc.).</p>
<p>One of the interesting design decisions with all this is how to set-up DNS. For example, the app for site1 needs to locate the IP address of the database it&#8217;s supposed to talk to. We use DNS as follows:</p>
<ul>
<li> the app connects to db-master.site1.com</li>
<li>db-master.site1.com resolves to a CNAME for db-master-cluster3.company.com</li>
<li>db-master-cluster3.company.com resolves to the IP address of the instance that currently hosts the master</li>
<li>DNS for db-master-cluster3.company.com is set-up with a low TTL (we use 75 secs) and supports dynamic updates</li>
<li>if the DB master crashes or is otherwise replaced the db-master-cluster3.company.com DNS entry is automatically updated by  the RightScale MySQL manager, which switches all the sites hosted by that cluster over with one stroke</li>
<li>if site1 is moved to different cluster, then the CNAME has to be updated to point to the correct cluster DB</li>
</ul>
<p>For the web sites themselves its also nice to use CNAMEs:</p>
<ul>
<li>www.site1.com points to www-cluster3.company.com</li>
<li>www-cluster3.company.com resolves to the IP addresses of all the load balancer instances</li>
<li>if a load balancer instance is restarted, the www-cluster3.company.com entry is dynamically updated</li>
<li>if the site is moved to different cluster, the CNAME needs to be updated</li>
</ul>
<p>Wow, it&#8217;s amazing all this is actually possible and not just a dream! Amazon EC2 enables it and RightScale makes it possible to manage without an army of sysadmins running around and tweaking servers all the time.</p>
<hr />
<h3>Archived Comments</h3>
<p><a href="http://giantrobots.thoughtbot.com/">Dan Croak</a><br />
Do you have one of your pretty graphics to go along with this setup? For all us visual learners out there… <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p><b>Sushi</b><br />
A script/tutorial/template like “rails-on-ec2-standard” post will be of great help.</p>
<p><b>Thorsten</b><br />
Dan – good suggestion, I’ll pull out my brushes and paints…</p>
<p>Sushi – I fear this is all still a little too new to condense into a tutorial. It’s one thing to set everything up for one site, but another story altogether to figure out a solution that covers many different sites. What is needed is a design pattern that can be used in many slightly different situations and so it needs to be adaptable while also keeping the core that makes it tick constant. We’re not quite there yet.</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/rightscale.wordpress.com/20/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/rightscale.wordpress.com/20/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/rightscale.wordpress.com/20/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/rightscale.wordpress.com/20/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/rightscale.wordpress.com/20/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/rightscale.wordpress.com/20/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/rightscale.wordpress.com/20/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/rightscale.wordpress.com/20/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/rightscale.wordpress.com/20/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/rightscale.wordpress.com/20/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/rightscale.wordpress.com/20/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/rightscale.wordpress.com/20/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/rightscale.wordpress.com/20/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/rightscale.wordpress.com/20/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/rightscale.wordpress.com/20/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/rightscale.wordpress.com/20/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.rightscale.com&amp;blog=2909729&amp;post=20&amp;subd=rightscale&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.rightscale.com/2007/10/01/deploying-many-rails-sites-onto-amazon-ec2/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0b359e9ce1a8fb5f3315fbec8beda697?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">TvE</media:title>
		</media:content>
	</item>
	</channel>
</rss>
