Agile Cat — in the cloud

January 5, 2010

かなり気になる ScaleOut 関連記事 6本

Filed under: MapReduce,NoSQL — Agile Cat @ 8:42 am
Tags: , , , ,

いくら時間があっても足りませんね!

読みたいけど、なかなか時間が取れずに、どんどんと肥大化するブックマーク。。。 その中でも、以下の 6本は、とても面白そうで、なんとかしたいと思っているものです。 とは言え、あまり時間がたってしまっても、、、なので、タイトルと URL を掲載しておきます。

7 Tips for Improving MapReduce Performance
http://www.cloudera.com/blog/2009/12/17/7-tips-for-improving-mapreduce-performance/

Tip 1) Configure your cluster correctly
Tip 2) Use LZO Compression
Tip 3) Tune the number of map and reduce tasks appropriately
Tip 4) Write a Combiner
Tip 5) Use the most appropriate and compact Writable type for your data
Tip 6) Reuse Writables
Tip 7) Use “Poor Man’s Profiling” to see what your tasks are doing

11 Strategies to Rock Your Startup’s Scalability in 2010
http://highscalability.com/blog/2010/1/4/11-strategies-to-rock-your-startups-scalability-in-2010.html?utm_source=feedburner&utm_medium=feed&utm_campaign=Feed%3A+HighScalability+%28High+Scalability%29&utm_content=Google+Feedfetcher

1) Scale Out – Not Up
2) Use Databases Appropriately
3) Soar Through the Clouds
4) Goldfish not Thoroughbreds
5) Simplify, Simplify, Simplify
6)  Be the Master of Your Own Destiny!
7) Learn Aggressively
8) Communicate Asynchronously As Much As Possible
9) Hire The Best People
10) D-I-D Approach for Scalability
11) Design with Fault Isolative “Swim Lanes”

Clearing up MapReduce confusion, yet again
http://www.dbms2.com/2009/12/30/clearing-up-mapreduce-confusion-yet-again/

•MapReduce was named and popularized — but not invented — by Google.
•“MapReduce” variously refers to:
•In particular, Hadoop is a MapReduce execution engine that includes or
•MapReduce and analytic DBMS

Building a distributed concurrent queue with Apache ZooKeeper
http://www.cloudera.com/blog/2009/05/28/building-a-distributed-concurrent-queue-with-apache-zookeeper/

ZooKeeper – A Reliable, Scalable Distributed Coordination System 
http://highscalability.com/blog/2008/7/15/zookeeper-a-reliable-scalable-distributed-coordination-syste.html

Cloud / VPS Apache Performance Comparison
http://chadkeck.com/2009/12/cloud-vps-apache-performance-comparison/

ーーーーー

Twitter で、#Hadoop, #MapReduce, #NoSQL などのタグを見ていると、こんな情報が飛び交っていて、すご~~~い ギャップを感じてしまいますね。Observer と Zookeeper の後半も訳していないし、仕事もせにゃあかんし、ネコの手も借りたい(?) くらいの年明けを迎えています ーーー A.C.

 

Advertisement

Leave a Comment »

No comments yet.

RSS feed for comments on this post. TrackBack URI

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Connecting to %s

Theme: Rubric. Blog at WordPress.com.