Adventures in Data Profiling (Part 1)
In my popular post Getting Your Data Freq On, I explained that understanding your data is essential to using it…
VectorWise
I was fortunate enough to speak with Marcin Zukowski earlier about VectorWise. If you missed it, VectorWise came out of…
Zero Latency: The Next Arms Race
In the near future, your company may be competing with a computer. In fact, companies with the fastest computers, most…
Good Data Warehouse DBAs are Hard to Find
As a consultant I’m often asked about how roles and responsibilities should be delegated or identified within the IT organization…
Reality Mining – Too Much Personalization?
What does your mobile phone usage say about you? Probably a lot more than you think. Mobile phone operators are…
It’s data, Jim, but not as we know it – Part 1: What the echo of the Big Bang tells us about the nature of information
Possibly I am just turning into a grumpy old man in my middle-age, but there are two words that when…
#18: Here’s a thought…
An occasional series in which a review of recent posts on SmartData Collective reveals the following nuggets:Less is moreWe live…
HadoopDB discussion with Daniel Abadi
I spoke to Daniel Abadi a few days ago about his HadoopDB announcement that came out recently. I am sure…
Predicting the next Viral Tweet
It is time to use Twitter data for another reason: Can Predictive Analytics be used to identify which tweets have…
Getting Your Data Freq On
One of the most basic features of a data profiling tool is the ability to generate statistical summaries and frequency…