Facebook Owns the Fattest Elephant (Hadoop Cluster)
It's official! Facebook has the world's largest Hadoop cluster. Facebook's Datawarehouse Hadoop cluster became the largest in May with a system that has more than 21 petabytes of storage in a single HDFS cluster, with 2000 machines. As Facebook releases more information about its Hadoop cluster, it should help advance the data analytics field as well.
PayPal Jumps on the App Engine
Another treat was given to developers this month by PayPal. They just released an open source toolkit for Google App Engine that makes it easy to integrate Java apps on GAE with PayPal's Adaptive Payments API. They say a Python toolkit is coming next. Check out these instructions and examples for using the toolkit.
Python 2.7 Has Landed
The GA release of Python 2.7 is finally here after months of development. It is the last version that will have new features for a while. The next version of Python will focus on stability and language fixes. Many of the features planned for 3.x were back-ported to this release. New features in Python 2.7 include:
- An ordered dictionary type
- New unittest features including test skipping and new assert methods
- A much faster io module
- Automatic numbering of fields in the str.format() method
- Float repr improvements backported from 3.x
- Tile support for Tkinter
- A backport of the memoryview object from 3.x
- Set literals
- Set and dictionary comprehensions
- Dictionary views
- New syntax for nested with statements
- The sysconfig module
Outsourcing Doesn't Work
Management tells you it's a necessary evil, but this blogger argues against outsourcing. Thanks for the link, Amber Shah!