• submit to reddit
Gareth Rushgrove07/24/12
4189 views
0 replies

Hadoop Hive Web Interface

In theory at least, Hive provides a very nice, simple way of getting into analysing large data sets. To make it even easier to show other people what you’re up to Hive has a nascent web interface

Maarten Balliauw07/24/12
3088 views
0 replies

Hands-on Windows Azure Services for Windows

In this post, I’ll share you with some impressions, ideas, pains and speculations about Windows Azure Services for Windows Server

Anders Abel07/24/12
3090 views
0 replies

Code Generation – Blessing or Curse?

Code generation can be a blessing for getting something up and running quickly, but also a curse when maintaining code.

Luke Galea07/24/12
10906 views
0 replies

Graylog2 Optimization for High-Log Environments

Here we will highlight some of the measures that were necessary to further operationalize the system for production use. As our environment receives anywhere from 2,000 to 6,000 log messages per minute (with occasional peaks to 40,000!)

Will Soprano07/24/12
2559 views
0 replies

Riak in Production at Voxer

Voxer relies on Riak as their primary database for various services. They have more than 50 machines dedicated to Riak to support their huge growth and user base, and this has made for an adventure in scaling.

Mark Needham07/24/12
2603 views
0 replies

Thoughts on Software Development Neo4j: Shortest Path With and Without Cypher

I was looking back at some code I wrote a few months ago to query a neo4j database to find the shortest path between two people via the colleagues relationships that exist.

John Esposito07/24/12
11376 views
0 replies

OCR Helps Data Management More Than You Think

IDC estimates that data will grow 50-fold by 2020, while the number of information containers will multiply by 75. Unfortunately, a lot of that data is still locked away in physical documents; and as a developer in the age of big data, you'll be the one expected to make that data useful. Here's how to make it work.

Mitch Pronschinske07/24/12
3690 views
0 replies

Integrating Oracle RIB to BizTalk Server

This blog entry will discuss how to integrate Oracle RIB to BizTalk Server using the JMS adapter, but first it might be worthwhile to understand the particular problems with integrating stand-alone JMS clients.

Dhananjay Nene07/24/12
3233 views
0 replies

Even After AWS Went Down, the Cloud Got Stronger

So some parts of the AWS EC2 specifically related to EBS were non responsive or down yesterday.

Leigh Shevchik07/23/12
11861 views
0 replies

The Tools we Use (and Love) at New Relic

These are the tools in New Relic's stack (mostly open source) that have specifically allowed us to scale to serve 16,000 active accounts with over 38 Billion metrics being reported to us each day!

Lijin Joseji07/23/12
5883 views
0 replies

The Second Level Cache in Hibernate: Settings and Configurations

A Hibernate Session is a transaction-level cache of persistent data. We can configure a cluster or JVM-level (SessionFactory-level) cache on a class-by-class and collection-by-collection basis. We can also plug in a clustered cache into Hibernate.

Stoimen Popov07/23/12
13722 views
0 replies

Algorithm of the Week: Balancing a Binary Search Tree

The binary search tree is a very useful data structure, where searching can be significantly faster than searching into a linked list. However in some cases searching into a binary tree can be as slow as searching into a linked list and this mainly depends on the input sequence.

Mitch Pronschinske07/23/12
5184 views
0 replies

Spin Up New VMs in Their Own Virtual Network - A New FOSS Tool

Terraform is a flexible tool made available under the Apache 2.0 license that makes it easy to define, instantiate and manage environments. Terraform integrates with existing cloud providers. Today, Amazon EC2 and VMWare vSphere are supported.

Joe Stein07/23/12
2944 views
0 replies

Faster Datanodes With Less Wait IO in Hadoop

I have noticed often that the check Hadoop uses to calculate usage for the data nodes causes a fair amount of wait io on them driving up load.

Jim O' Neil07/23/12
1825 views
0 replies

How to Configure Admin Access to the Couchbase Cluster on Azure

Getting Couchbase working on Azure inspired Jim O'Neil to write a series on the topic - here's how to configure admin access.