Getting Your Data Freq On
One of the most basic features of a data profiling tool is the ability to generate statistical summaries and frequency…
Slides from OSCON
OSCON 2009 has been a blast so far, and I've really been enjoying the presentations and meeting people from different…
Scraping data from the Web with R
Sometimes the data we need isn't packaged up nicely into a simple comma-separated file or database. It's out there, but…
Accuracy
As might be inferred from my last post, certain sporting matters have been on my mind of late. However, as…
A Few Words on Behavioral Targeting
One can usually divide Online Advertising (OA) in different types. Here are some examples of OA types: Contextual targeting (Google…
Information Waves
Many years ago when I was a researcher in a European think-tank, I wanted to develop an automatic way to…
The Darker Side Of Analytics
I recently read an article in one of the major Dutch newspapers (warning - it's in Dutch!) about their Government's moves…
A long look at Stephen Few’s “Now You See It”
Stephen Few gave a snappy name to his new book, Now You See It, and a cover that signals a…
SIGIR: Meet the Who’s Who of Search and Information Retrieval
Matt Cutts. danah boyd. Bruce Croft. Marti Hearst. What do these people have in common? If you’re thinking that they…
Repurposing Your Data Warehouse Platform—Not!
I’ve noticed lately that data warehouse vendors are dusting off the arguments and pitches of days gone by. Don’t buy…