Apache Hadoop

  • submit to reddit

Presenting Splout Cloud

We have created Splout Cloud, a web-latency managed service in the AWS cloud. Splout Cloud converts any data files – regardless of their size – into a...

0 replies - 512 views - 04/07/13 by Pere Ferrera Bertran in Articles

Accelerating Big Data with Hadoop (HDFS, MapReduce, and HBase)

In this video, D.K. Panda from Ohio State University presents: Accelerating Big Data with Hadoop (HDFS, MapReduce and HBase) and Memcached. "The SuperMUC...

0 replies - 1882 views - 04/02/13 by Eric Genesky in Articles

Why You Should Care About the Apache Hadoop Ecosystem

In March 2013, I was at 33rd Degree - "A Conference for Java Masters." I had two talks, including a new one: "You are not Facebook or Google? Why you...

0 replies - 1294 views - 03/14/13 by Kai Wähner in Articles

Hadoop Developer - WordCount tutorial using Maven and NetBeans 7.3RC2

I have adapted the WordCount tutorial to Maven based development as this probably the most popular way to develop in companies. I am not going to...

0 replies - 1199 views - 02/13/13 by Armel Nene in Articles

Hadoop Hangover: Launch a Hadoop Cluster CDH4 Using Apache Whirr

This post is about how-to launch a CDH4 MRv1 or CDH4 Yarn cluster on EC2 instances. It's said that you can launch a cluster with the help of Whirr and in a...

0 replies - 1367 views - 02/12/13 by Swathi Venkatachala in Articles

Testing MapReduce with MRUnit

Testing and debugging multi threaded programs is hard. Now take the same programs and massively distribute them across multiple JVMs deployed on a cluster of...

0 replies - 2198 views - 02/05/13 by Muhammad Ashraf in Articles

Starfish : A Hadoop Performance Tuning Tool

Its been a long time since I've blogged... a lapse of 3-4months or so... :( Well, I thought of writing about an awesome tool for performance tuning in...

0 replies - 2147 views - 11/25/12 by Swathi Venkatachala in Articles

Innovation and Big Data in Corporations: A Roadmap

Big Data is all about technology and business model innovation.  Why? Because, a lot of next generation business models are DATA centric.  Almost all...

0 replies - 2334 views - 09/10/12 by Ravi Kalakota in Articles

Berkeley Researchers Highlight Emergence of In-Memory Processing

Researchers at the University of California, Berkeley released an excellent paper recently, analyzing data from the Hadoop installation at Facebook -- one...

0 replies - 1658 views - 09/05/12 by Nikita Ivanov in Articles

Big Data: Enterprise Hype or the Future of Enterprise?

Of all the myriad of terms that the tech industry throws around at the moment, none is as often subverted for marketing spin as “big data”. So much so...

1 replies - 2220 views - 08/15/12 by Ben Kepes in Articles

HBase 0.92, DataStax Community Server Among NoSQL Releases

Hortonworks Data Platform (HDP) ver 2 Now leveraging Apache Hadoop 0.23, this public preview release includes: Next-generation MapReduce Architecture adds...

0 replies - 1863 views - 01/24/12 by Chris Smith in Articles

Google Claims MapReduce Patent

You may not have noticed that Google applied for a patent on its MapReduce software framework in 2004.  Recently, the "Don't Be Evil" company...

2 replies - 5551 views - 01/20/10 by Mitch Pronschinske in News

Grid Engine an Early Supporter of Hadoop Apps

Sun Microsystem's Grid Engine was recently updated with plenty of new features, including industry first they say.  Grid Engine 6.2 update 5 (SGE 6.2u5) just...

0 replies - 8170 views - 01/14/10 by Mitch Pronschinske in News

Apache Mahout Tackles A.I.

Artificial intelligence is a term frequently associated with science fiction, not software development.  However, A.I. is becoming increasingly viable as a...

3 replies - 11087 views - 11/30/09 by Mitch Pronschinske in News