Loading…
ApacheCon North America 2014 has ended
Register Now for ApacheCon North America 2014 - April 7-9 in Denver, CO. Registration fees increase on March 15th, so don’t delay!
Tuesday, April 8 • 1:30pm - 2:20pm
Data cubes in Apache Hive

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

This talk is about a system developed at InMobi to support data cubes on top of Hive metastore and Hive Query Language. The Hive metastore in its current state allows users to represent structured data in simple tables. However, it does not allow expressing relationships or richer DWH concepts like facts, dimensions and etc. With Hive data cubes, users can query data stored in HDFS, S3, Redshift and etc, with a single query language and schema. Underlying execution engines like Hive, Impala, Shark can be plugged in and utilized at run time. The execution engine used is transparent to the user. The system provides a unified logical schema to users consisting of cubes, facts and dimensions; and users can issue queries at a conceptual level without knowing about roll-up intervals, partitions, data types, underlying storage and table relationships; they will be figured out automatically.

Speakers
JD

Jaideep Dhok

Software Engineer, InMobi
Jaideep Dhok currently works as a Software Engineer in the Platform team in InMobi, working on systems to support analytics in InMobi, where he works on Apache Hive.  Before joining InMobi he worked as a contractor for Credit Suisse in Singapore where he worked on the APAC regulatory... Read More →
avatar for Amareshwari Sriramadasu

Amareshwari Sriramadasu

Architect, Inmobi
Amareshwari is currently working as Architect in data team at Inmobi, where she works on Hadoop and related projects for data collection and analytics. She is member of the ASF, Apache Incubator PMC, Apache Hadoop PMC, Apache Lens PMC and Apache Falcon PMC, and is Apache Hive committer... Read More →


Tuesday April 8, 2014 1:30pm - 2:20pm PDT
Confluence C

Attendees (0)