World Series Analytics

9 Min Read

IBM and the University of Southern California (USC) Annenberg Innovation Lab (AIL) have been collaborating on a project to apply analytics to public Twitter sentiment regarding the baseball playoffs, including the World Series, which begins tonight.

But this exercise is not relevant only to post-season baseball. Its aim is to improve the accuracy of sentiment analytics on a much broader scale than the game of baseball.

IBM and the University of Southern California (USC) Annenberg Innovation Lab (AIL) have been collaborating on a project to apply analytics to public Twitter sentiment regarding the baseball playoffs, including the World Series, which begins tonight.

But this exercise is not relevant only to post-season baseball. Its aim is to improve the accuracy of sentiment analytics on a much broader scale than the game of baseball.

“But understanding data is not a game.  By taking deep dives into the Twitterverse — concentrating on the short 140 character tweets and polling large data sets over a number of days –- students are able to determine positive or negative sentiments in a matter of minutes.  The index is also being applied to other industries including movies and fashion retail so that students and organizations can see how Watson-inspired technologies, like sophisticated semantic and linguistic analysis software, can crunch Big Data to quickly gain temperature checks on timely issues. “

*********************************

LOS ANGELES, 19 October 2011:  IBM (NYSE: IBM) and the University of Southern California (USC) Annenberg Innovation Lab (AIL) (www.annenberglab.org), today announced a new social media analysis project focused on Major League Baseball during the World Series.  The USC Annenberg Social Sentiment Index is being compiled by students and relies on IBM Social Analytics technology to analyze millions of tweets in order to assess public social media engagement and opinion from sports and film to retail and fashion.

USC students have done an initial analysis of the National League Championship Series (NLCS), and will now broaden the index to follow the World Series games beginning tomorrow to determine “social media MVPs.”  The goal is to uncover hidden insights from Twitter followers that could help better understand player and team sentiment, and illustrate how advanced analytics technologies can help identify important trends.  

The students have used the technology for an initial test of more than 1.5 million public baseball-related tweets during the National League Championship Series, gauging positive and negative nuances and establishing overall sentiment rankings among a sampling of NLCS players.

Initial index findings show:

  • ·         The Cardinals’ Chris Carpenter garnered the highest number of tweets indicating sentiment at 1,573  — 61.4 percent positive and 21.6 percent negative.
  • ·         However, fellow Cardinal David Freese, a fan favorite and official NLCS MVP of the pennant race, garnered 768 tweets; 89.3 percent of his tweets were positive and only 15.4 percent negative; securing one of the highest ‘T’ scores – winner of the most uniformly positive tweets.
  • ·         The Texas Rangers are winning the Twitter buzz battle: the American League’s social media champion was the focus of more than 56,600 tweets – five times more than the St. Louis Cardinals — with 79 percent of the tweets being positive. While the Cardinal’s are behind in the number of postings, they have matched the Rangers’ level of enthusiasm in their tweets. St. Louis garnered 11,500 tweets, 80 percent of which were upbeat.

For baseball fans everywhere, social media is now as integral a part of the game experience as keeping score or enjoying hot dogs and peanuts. In fact, during the post season, a banner behind home plate has been encouraging spectators to connect using the hashtag #postseason, giving fans an opportunity to both share and learn from others instantly, and providing researchers with an unfiltered voice of the fan that is ripe for analysis.

USC and IBM are collaborating to broaden student skills in analytics and demonstrate how Watson-inspired technologies, such as sophisticated semantic and linguistic analysis software, can provide new insights into public opinion by crunching complex data in real-time.

“We’ve known for some time how important statistics are in baseball, and today they’re an even bigger part of understanding not only player performance but also the views of loyal fans,” said Professor Jonathan Taplin, Director of the USC Annenberg Innovation Lab.  “The fans understand that the highest paid player is not necessarily the MVP. The Social Sentiment Index enables our students a unique opportunity to gain valuable knowledge in the use of advanced analytics technologies and apply it to real world settings to understand how this new information can benefit a variety of industries.”

The USC Annenberg Social Sentiment Index enables students to define areas of research and use IBM Social Analytics technology to explore how it can be used by organizations from news outlets and journalists to movie studios and film marketers to better understand, respond, and predict public sentiment.  To date, the Index has been applied to film forecasting in order to accurately predict movie blockbuster success rates, and most recently was used by students to identify top trends for retailers from the New York Fashion Week shows.   

“Analyzing data is not a game — it’s an important way to understand different constituencies and gain competitive advantage,” said Rod Smith, Vice President of Emerging Technology, IBM.  “Whether it’s analyzing fan sentiment during a sports event, hospital patient data for personalized treatment programs, or the latest fashion trends for more targeted marketing campaigns, organizations are realizing the value of analytics to better respond to customer needs.”

The ability to glean insights into viewpoints from Big Data – structured and unstructured information – carries value across all aspects of baseball, from the media outlets covering reactions to the game and players, to businesses marketing to the fans, and most importantly, to the players and coaches themselves. In fact, analyzing data to generate actionable insights in the baseball world has already afforded major league teams better decision-making to create productive ball clubs year after year. For example, Oakland Athletics general manager Billy Beane has become famous for his [explain what he did for people like me that don’t know] through the use of analytics, made famous by ‘Moneyball,’ the best-selling book and motion picture.

IBM’s collaboration with the USC Annenberg Innovation Lab is part of its continued efforts to advance student skills in analytics across academia. IBM is working with more than 6,000 universities around the world to develop curricula and provide training, resources and support for business analytics.

The USC Annenberg Social Sentiment Index on baseball is being conducted as part of the 2011 IBM Information on Demand and Business Analytics Forum taking place in Las Vegas, October 23-27.

The index on baseball will be updated during the World Series on asmarterplanet.com to illustrate the ongoing shifts in fan sentiment throughout the series.

For more information about IBM and analytics, visit www.ibm.com/analytics.

About the University of Southern California Annenberg Innovation Lab
The University of Southern California Annenberg Innovation Lab (AIL) is part of the Annenberg School for Communication & Journalism.  The lab was established 2010 to develop social and technological innovations with real-world application.  AIL brings together scholars, students and the business community to develop digital initiatives and emerging technologies.  The lab has increasingly focused on the evolution of social networking as a platform for commerce, entertainment, education and journalism.

Share This Article
Exit mobile version