Cookies help us display personalized product recommendations and ensure you have great shopping experience.

By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData CollectiveSmartData Collective
  • Analytics
    AnalyticsShow More
    big data and customer service outsourcing
    How Data Analytics Improves Customer Service Outsourcing
    18 Min Read
    How a Specialized Marketing VA Improves Campaign Analytics
    How a Specialized Marketing VA Improves Campaign Analytics
    11 Min Read
    New Data Analytics Breakthroughs Give eCommerce Startups a Fighting Chance
    New Data Analytics Breakthroughs Give eCommerce Startups a Fighting Chance
    6 Min Read
    How Data Analytics Is Reshaping Patient Financing Decisions
    How Data Analytics Is Reshaping Patient Financing Decisions
    13 Min Read
    business using business intelligence
    How to Use a Competitive Intelligence Dashboard to Turn Market Data Into Smarter Marketing Decisions 
    9 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-25 SmartData Collective. All Rights Reserved.
Reading: Clustering the thoughts of Twitter Users
Share
Notification
Font ResizerAa
SmartData CollectiveSmartData Collective
Font ResizerAa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Uncategorized > Clustering the thoughts of Twitter Users
Uncategorized

Clustering the thoughts of Twitter Users

ThemosKalafatis
ThemosKalafatis
5 Min Read
SHARE

During the last two posts i presented the reasons and some problems on analyzing the thoughts of users on the web and particularly Twitter. (For more see Part1 and Part2 ).

As an example, we are going to be looking at a specific kind of thought that Twitter users make : What they don’t want. So let us start : By using the Twitter API i managed to extract all tweets having the phrase “i don’t want to”. The following text file shows the results :

The next step is to remove all phrases that do not give us any information about what users do not want :


Finally we remove the phrase “i don’t want to”. However, consider the following example:

“I must go to Chicago. I don’t want to do that”

The steps discussed above will discard the first sentence which is actually what the user does not want to do and leave only the phrase “i don’t want to do that” which is not particularly informative. At this point we must quantify the problem -let’s assume it involves the 8.5% of our records- and recall what the pareto principle is all about.

After some additional pre-processing steps which are not discussed here, i feed the data to K-Means to see the clusters the algorithm comes up with. For a better pres…

During the last two posts i presented the reasons and some problems on analyzing the thoughts of users on the web and particularly Twitter. (For more see Part1 and Part2 ).

More Read

The Long Term Value of Community Relations
Pull Your Data Together to Sell More Furniture
saP or saS or sasR or saaS
Aspirational Marketing
The Smart Data Collective Is Looking For a Community Manager!
As an example, we are going to be looking at a specific kind of thought that Twitter users make : What they don’t want. So let us start : By using the Twitter API i managed to extract all tweets having the phrase “i don’t want to”. The following text file shows the results :

The next step is to remove all phrases that do not give us any information about what users do not want :


Finally we remove the phrase “i don’t want to”. However, consider the following example:

“I must go to Chicago. I don’t want to do that”

The steps discussed above will discard the first sentence which is actually what the user does not want to do and leave only the phrase “i don’t want to do that” which is not particularly informative. At this point we must quantify the problem -let’s assume it involves the 8.5% of our records- and recall what the pareto principle is all about.

After some additional pre-processing steps which are not discussed here, i feed the data to K-Means to see the clusters the algorithm comes up with. For a better presentation of the results, here is a screen capture from IBM’s UI Modeler :


We immediately see -in descending order- what Tweeter users do not want :

1) They don’t want to go to work
2) They don’t want to go to school
3) They don’t want to hear about various issues
4) They don’t want to stay home

Notice also the top two categories named Miscellaneous and None. These categories contain thoughts that have a very small frequency to form a cluster. These two categories consist the 69.56% of our records and at this point we should think again about the pareto principle.

Please note that not all necessary work is discussed here and i had to omit several actions that have to take place. In trying to understand what people actually think i am using an approach which uses Ontologies, Information Extraction, Clustering and Classification analysis with the ultimate goal to minimize the percentage of thoughts (69.56% in this example) that cannot form a cluster and to increase the accuracy of the analysis.

It is also an interesting fact that we could move further down the sentence branch (see this post) for even better insight. Here i presented a clustering analysis about what users do not want. As an example we could apply clustering on user thoughts for “I don’t want to feel”.

Link to original post

Share This Article
Facebook Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

big data and customer service outsourcing
How Data Analytics Improves Customer Service Outsourcing
Analytics Exclusive
The End of Unstructured Marketing: Forcing Generative AI into Strict HTML Schemas
The End of Unstructured Marketing: Forcing Generative AI into Strict HTML Schemas
Artificial Intelligence Exclusive
How a Specialized Marketing VA Improves Campaign Analytics
How a Specialized Marketing VA Improves Campaign Analytics
Analytics Exclusive
ai marketing tools
The 9 AI Tools Marketers Use to Create Images and Video in 2026
Artificial Intelligence Exclusive

Stay Connected

1.2KFollowersLike
33.7KFollowersFollow
222FollowersPin

You Might also Like

The Second Coming… Can Time Inc. Save the Magazine Business and Make Magazines Measurable?

5 Min Read

MDM Streamlines the Supply Chain

6 Min Read

IBM Will Take Varicent for Sales Performance Management

9 Min Read

How to disrupt Wall Street

7 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

data-driven web design
5 Great Tips for Using Data Analytics for Website UX
Big Data
ai chatbot
The Art of Conversation: Enhancing Chatbots with Advanced AI Prompts
Chatbots

Quick Link

  • About
  • Contact
  • Privacy
Follow US
© 2008-26 SmartData Collective. All Rights Reserved.
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?