Are Duplicate Tweets Spam?
The Twitterverse is all a-twitter with a new controversy: Twitter has rolled out a new feature that blocks duplicate tweets.…
Data Modeling with Generalizations – The Tool Issue
A bunch of factors have converged lately on the topic of generalized versus specific data modeling approaches. I’m working through…
Marketing Lessons Learned From Micro-Finance In India
Besides Wall Street bankers, the poor of the world need access to financial liquidity, too. But loaning money to individuals…
Aggregating Tags
One of the funnest parts of any web analytics role is instrumentation: the tagging of the various parts of the…
Who knows what happiness lurks in the hearts of men? Facebook knows.
Have you heard about the Facebook Gross National Happiness Index? On Monday, October 12, the Times ran an article (by…
Oracle OpenWorld Update #2 – Oracle’s use of social media
It's a sign of the times I guess. I mean, social media is everywhere these days and OpenWorld isn't an…
Go Shopping, Be Social
If you’re into search startups, then today’s a great day to check out what a couple of them are up…
Open standards for data mining and the need for training material
PMML awareness is growing. Many companies have recently joined the DMG (Data Mining Group) and others are already in the…
The evolving nature of IT partnerships
I have recently re-joined the corporate world, after taking a few years to pursue a sea change, which is a…
Training students on mega-scale data
In a New York Times article (sub. req.) published on the weekend, IBM and Google expressed doubts that the students…

