Loading…
Tuesday, April 8 • 10:30am - 11:20am
Apache Streams - Simplifying Real-Time data integration

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Interest in analyzing the real-time web has reached a fever pitch among academics and corporate executives. Researchers and professionals tasked with capturing and analyzing high volumes of real-time social data have a plethora of open-source databases and machine learning libraries to choose from, but often spend a large fraction of their time writing code (and manually performing) ingestion, cleansing, normalization, and data management.

Apache Streams seeks to break these problems down into self-contained modules based on simple interfaces and foster a community-based approach to connecting and harmonizing data sources and services. Implementers can compose a data workflow from streams components and run their workflow in real-time or batch modes, using a variety of storage services (Kafka, HDFS, Cassandra, etc...) and execution engines (Tomcat, Storm, Amazon Kinesis, etc...)

Speakers
avatar for Steve Blackmon

Steve Blackmon

VP Technology, People Pattern, Inc.
VP Technology at People Pattern, previously Director of Data Science at W2O Group, co-founder of Ravel, stints at Boeing, Lockheed Martin, and Accenture. Committer and PMC for Apache Streams (incubating). Experienced user of Spark, Storm, Hadoop, Pig, Hive, Nutch, Cassandra, Tinkerpop... Read More →


Tuesday April 8, 2014 10:30am - 11:20am PDT
Lawrence A

Attendees (0)