SQL compatibility in Hadoop and Hive
The Stinger initiative is a collection of development threads in the Hive community that will deliver 100X performance improvements as well as SQL compatibility.
|Fast Interactive Query|
An immediate aim of 100x performance increase for Hive is more ambitious than any other effort.
Based on industry standard SQL, the Stinger Initiative improves HiveQL to deliver SQL compatibility.
Apache Hive is the de facto standard for SQL-in-Hadoop today with more enterprises relying on this open source project than any alternative. As Hadoop gains in popularity, enterprise requirements for Hive to become more real time or interactive have evolved… and the Hive community has responded.
He spoke in detail about the Stinger initiative, who is contributing to it, why they decided to improve upon Hive and not create a new system and more.
He talked about how Microsoft is contributing in the open source community to improve upon Hive.ORC File and Vectorization – Hadoop Summit 2013 from Owen O’Malley
Hadoop is so much more than just SQL, one of the wonderful things about Big Data is the power it brings for users to bring different processing models such as realtime streaming with Storm, Graph processing with Giraph and ETL with Pig and all different things to-do beyond just this SQL compatibility.
(Note: Opinions expressed in this article and its replies are the opinions of their respective authors and not those of DZone, Inc.)