- Think big: Oregon Health & Services University is using big data to speed up analyis of human genone profiles (approx 1 TB per patient). What if sequencing became commonplace instead of rare, and sequencing needed to be done 5k times per day?
- Find relevant data for the business
- Be flexible: Iterate, don’t build the final system in the 1st release. The lack of schema definition in Hadoop supports this model. Pecan Street In, in Austin, Texas is on their 3rd iteration of a system that collects smart grid energy data, partly because energy meters have become more advanced and provide additional data points.
- Connect the dots: Intel uses manufacturing data to change its design process so that future manufacturing processes will become more efficient.