Data Quality Magic
In previous posts I explained that, at least in regards to data quality, there are no magic beans, tooth fairies,…
Auto-correlation for time series analysis
Recently, I was reading the EPFL magazine and was surprised to see an article where they interviewed my master thesis…
Truly Distributed Analytics
The growth and success of Hadoop is very interesting. It is emerging as a highly significant technology for the data…
Data Quality is not an Act, it is a Habit
The Second Law of Data Quality states that it is not a one-time project, but a sustained program. Or to…
Some NoSQL Myths
I have been busy travelling recently but thought I would jot down a couple of NoSQL myths that are fresh…
How to analyze unfamiliar data: circle, dive, and riff
When you come face to face with unfamiliar data, how do you proceed? How do you avoid sending you and…
An app ecosystem for health
Facebook as some 550,000 apps, according to the Wall Street Journal. Among the most popular are Farmville, a virtual farm…
Trust is not a checklist
This is my seventh blog post tagged Karma since I promised to discuss it directly and indirectly on my blog…
Visualization Methods
I thought this was worth sharing….Periodic Table of Visualization Methods. It shows some good examples, and some not so good…