Cloud (IaaS) & Big Data

HaaS Provider Qubole Now Runs on Google Compute Engine (GCE)

Posted on December 29, 2013 | Comments Off

I’m starting to see applications ported from AWS to GCE, but not sure about the justifications for running production systems on GCE. Maybe price?

source:

http://www.infoq.com/news/2013/12/qubole-on-gce

Comments Off on HaaS Provider Qubole Now Runs on Google Compute Engine (GCE)

Posted in AWS, GCE, HaaS

Tagged infoq.com

Example use case for Hadoop and Machine to Machine (M2M) data

Posted on December 25, 2013 | Comments Off

Telecom OEM WebNMS discusses their use of Hadoop. In one trial, they stored latency data from 7 million cable modems. Using a Hadoop cluster of 20 nodes, they observers a factor of 1o increase in performance compared to a relational database. In addition, the cost to deploy was a small fraction of the a traditional infrastructure.

Source:

http://www.billingworld.com/blogs/baker/2013/12/hadoop-m2m-meet-device-and-network-management-sys.aspx

Comments Off on Example use case for Hadoop and Machine to Machine (M2M) data

Posted in IoT, M2M, performance, Relational DB, Use Case

Tagged billingworld.com

Fast Search and Analytics on Hortonworks with Elasticsearch

Posted on December 24, 2013 | Comments Off

Elasticworks enables real-time searching and analytics. Yarn is supported. Integration extends into Hive and Pig.

Source:

Comments Off on Fast Search and Analytics on Hortonworks with Elasticsearch

Posted in Analytics, Hive, HortonWorks, Pig, Realtime, Yarn

Tagged elasticsearch.com, hortonworks.com

Kiji Project enables development of real-time analytics on Hadoop

Posted on December 24, 2013 | Comments Off

Open source framework for for collection and analysis of data for real-time applications such as energy usage and fraud monitoring.

Source:

http://www.kiji.org

Comments Off on Kiji Project enables development of real-time analytics on Hadoop

Posted in Analytics, Frameworks, Kiji, Realtime

Tagged kiji.org

WibiEnterprise bridges between Hadoop and the application layer

Posted on December 8, 2013 | Comments Off

The core features of WibiEnterprise 3.0 are frameworks that enable:

defining schemas in realtime
layer on top of MapReduce
model lifecycle (machine learning, batch training, development, scoring)
ad hoc queries
RESTful interfaces

Source:

Comments Off on WibiEnterprise bridges between Hadoop and the application layer

Posted in Frameworks, MapReduce, REST

Tagged hispanicbusiness.com, wibidata.com

Interesting use case about migrating away from SQL to Hadoop and NoSQL

Posted on December 7, 2013 | Comments Off

Paytronix analyzes data from 8,000 restaurants that adds up to a few tens of terrabytes of data. Not that complex in terms of volume, but there are a lot of data fields and potential reports. They migrated from MS SQL Sever and constantly evolving ETL jobs to Hadoop and MongoDB with a lot of success.

source:

http://www.informationweek.com/software/information-management/making-the-case-for-hadoop-variety-not-volume/d/d-id/1112894

Comments Off on Interesting use case about migrating away from SQL to Hadoop and NoSQL

Posted in database, mongodb, NoSQL, Relational DB, SQL, Use Case

Don’t run Hadoop on a SAN

Posted on December 7, 2013 | Comments Off

By definition, a SAN is about consolidating data and Hadoop is about distributing data. Can they co-exist? Not according to this article.

If you take data out of a Hadoop node and put it on a SAN, you’re reducing performance. You want data to transfer to the CPU at bus speed, not network speed. And maybe a heavy Hadoop load could saturate your network.

source:

http://www.infoworld.com/d/application-development/never-ever-do-hadoop-232090

Comments Off on Don’t run Hadoop on a SAN

Posted in hardware, Network, performance

Tagged infoworld.com

Big Data as a Service provider has free developer account

Posted on December 7, 2013 | Comments Off

Founders of Qubole built some of the big data technology at Facebook (scaled to 25 petabytes). Their new company has a hosted Hadoop infrastructure. Interesting small and free accounts take the IT configuration out of learning Hadoop.

Source:

http://www.qubole.com/features/

Comments Off on Big Data as a Service provider has free developer account

Posted in cloud, Facebook, hadoop, tutorial

Tagged qubole.com

Two part article of Hello World for Hadoop

Posted on December 4, 2013 | Comments Off

Source:

Comments Off on Two part article of Hello World for Hadoop

Posted in tutorial

Tagged packetpushers.net

Summary of Terradata’s big data approach

Posted on December 4, 2013 | 1 comment

Terradata Aster 6 platform
Includes graph analysis engine (visualization), in addition to traditional rows/columns.
Enables execution of SQL across multiple NoSQL repositories
Integrates with multiple 3rd parties for solutions such as analytical workflow (Alteryx), advanced analytics algorithms (Fuzzy Logix).
Cloud services at comparable cost to on-premises

Source

http://www.information-management.com/blogs/teradata-establishes-trust-in-big-data-technology-10025111-1.html

1 Comment

Posted in cloud, NoSQL, SQL, Teradata, visualization

Tagged information-management.com

Cloud (IaaS) & Big Data

HaaS Provider Qubole Now Runs on Google Compute Engine (GCE)

Example use case for Hadoop and Machine to Machine (M2M) data

Fast Search and Analytics on Hortonworks with Elasticsearch

Kiji Project enables development of real-time analytics on Hadoop

WibiEnterprise bridges between Hadoop and the application layer

Interesting use case about migrating away from SQL to Hadoop and NoSQL

Don’t run Hadoop on a SAN

Big Data as a Service provider has free developer account

Two part article of Hello World for Hadoop

Summary of Terradata’s big data approach

Categories

Sources

RSS

Archives