Building a Supercomputer in AWS: Is it even worth it ?

The fact that Cray is still around is mind boggling. You’d think that commodity hardware and network technologies have long made supercomputing affordable for anyone interested. And yet, Cray Sells One of the World’s Fastest Systems: “This, to IDC’s knowledge, is the largest supercomputer sold into the O&G sector and will be one of the biggest in any commercial market,” the report stated. “The system … Continue reading Building a Supercomputer in AWS: Is it even worth it ?

Ordered Sets and Logs in Cassandra vs SQL

I’ve written before that Cassandra’s achilles’ heel is devops: Storage, redundancy and performance are expanded by adding more nodes. This can happen during normal business hours as long as consistency parameters are met. Same applies to node replacements. As the number of servers grows be prepared to hire a devops army or look for a managed solution. Datastax offering helps but still not enough. Even … Continue reading Ordered Sets and Logs in Cassandra vs SQL

Finding Unused Elastic Load Balancers

AWS imposes limits on the number of Elastic LoadBalancers. Before asking for a limit increase, it is worthwhile to check if your load balancers are actually used and have healthy instances. Using excellent boto framework for Python, I built a simple script to find all ELBs where there is an instance in OutOfService state or where there are no instances at all: from optparse import … Continue reading Finding Unused Elastic Load Balancers

Where AWS Elastic BeanStalk Could be Better

Amazon describes their AWS Elastic BeanStalk service as follows: AWS Elastic Beanstalk is an easy-to-use service for deploying and scaling web applications and services developed with Java, .NET, PHP, Node.js, Python, Ruby, Go, and Docker on familiar servers such as Apache, Nginx, Passenger, and IIS. You can simply upload your code and Elastic Beanstalk automatically handles the deployment, from capacity provisioning, load balancing, auto-scaling to … Continue reading Where AWS Elastic BeanStalk Could be Better

Trying to Replace Cassandra with DynamoDB ? Not so fast

In November last year I pointed out how tempted I was to replace Cassandra with DynamoDB. Since then I have done some research and things are not as straightforward as they may seem at first. I’d like to revisit my post and clarify a few things. On elasticity of Cassandra I said the following: Scaling a Cassandra cluster involves adding new nodes. Each additional node … Continue reading Trying to Replace Cassandra with DynamoDB ? Not so fast

Why I am Tempted to Replace Cassandra With DynamoDB

I have written about Cassandra in the past. I have been using Cassandra actively for the past three years, and I am one of the big advocates of technology out there. However, as I have pointed in this blog and on my Twitter page – if you plan on scaling Cassandra out, be prepared to recruit an army of Java developers to do devops. Cassandra … Continue reading Why I am Tempted to Replace Cassandra With DynamoDB

Infrastructure in the cloud vs on-premise

I made a comment on twitter saying that if you are still operating an on-premise data center in the second decade of the 21st century you are wasting a ton of money. I was talking specifically about AWS vs on-premise. I got some pushback on that assertion in the form of private messages. Here is the summary of the feedback I received: AWS only makes sense … Continue reading Infrastructure in the cloud vs on-premise