• submit to reddit
Marko Rodriguez01/14/13
4005 views
0 replies

On Graph Computing

To the adept, graph computing is not only a set of technologies, but a way of thinking about the world in terms of graphs and the processes therein in terms of traversals.

Eric Genesky01/14/13
1424 views
0 replies

All-SQL, No-SQL and SomeSQL

This speaker discusses the various database options his team evaluated or operated in order to meet their needs, which included storing tens of billions of points as time-series.

Jason Whaley01/14/13
2106 views
0 replies

Let's Put an End to Unix Editor Snobbery

"Whoever defined nano as the default crontab editor for Ubuntu deserves a whipping with a rusty chain."

Arthur Charpentier01/14/13
2350 views
0 replies

R for Actuarial Science

As mentioned in the Appendix of Modern Actuarial Risk Theory, “R (and S) is the ‘lingua franca’ of data analysis and statistical computing, used in academia, climate research, computer science, bioinformatics, pharmaceutical industry, customer analytics, data mining, finance and by some insurers.

Scott Leberknight01/14/13
2077 views
0 replies

Limiting Joins in Apache Hive

Working with large datasets in Hadoop / Hive works is difficult when you have an "imbalanced" join.

Todd Merritt01/14/13
1677 views
0 replies

Pirates, Treasure Chests and Architectural Mapping

Pirates are known for drawing treasure maps to their most prized possession. These documents detail the decisions pirates made in order to hide and find their chests of gold. As software engineers, programmers, and architects we need to treat software implementations much like our treasure chest.

Istvan Szegedi01/14/13
1852 views
0 replies

Cloudera Impala – Fast, Interactive Queries with Hadoop

As discussed in the previous post about Twitter’s Storm, Hadoop is a batch oriented solution that has a lack of support for ad-hoc, real-time queries.

Bill Bejeck01/14/13
2042 views
0 replies

Calculating a Co-Occurrence Matrix with Hadoop

A co-occurrence matrix could be described as the tracking of an event, and given a certain window of time or space, what other events seem to occur.

Mitch Pronschinske01/14/13
2057 views
0 replies

RabbitMQ: Turn it Up to 11

Pushing RabbitMQ's scaling limits at myYearbook.com has provided good insight in how to scale RabbitMQ clusters while avoiding RabbitMQ's common pitfalls.

Sasha Goldshtein01/13/13
1448 views
0 replies

Lap Around Roslyn CTP: Syntax Rewriting with Symbol Information

Last time around, we were replacing the 42 numeric literal with 43. This time let’s pretend to do something more useful.

Allen Coin01/13/13
4254 views
0 replies

Decomposing applications for scalability and deployability

In this talk we describe the limitations of a monolithic architecture. You will learn how to use the scale cube to decompose your application into a set of narrowly focused, independently deployable back-end services and an HTML 5 client.

Gaurav Mantri01/13/13
2882 views
0 replies

Azure Cloud Building Best Practices

This blog post will be focused on building Stateless PaaS Cloud Services utilizing Windows Azure Storage (Blobs/Queues/Tables) and Windows Azure SQL Databases (SQL Azure).

Eric Genesky01/13/13
1942 views
0 replies

A Security Flaw in Ruby on Rails

The official Ruby on Rails blog recently announced the release of two "extremely critical security fixes" for a major security flaw.

Simon Jackson01/13/13
2697 views
0 replies

XNA and Beyond, the notable mentions

Just to round up the overview part of this series where I’ve covered frameworks doing their level best to take the dream of XNA onward and available to more platforms, plus looking further into the future to...

Chris Travers01/13/13
1328 views
0 replies

An Awesome One-Liner for PostgreSQL

Recently I was looking at options for exploring CIDR blocks in PostgreSQL. In particular, I was wondering about checking a CIDR block for unallocated IP addresses in another table.