Sign up | Login with →

R Programming Language

The programming language

High-Performing Predictive Analytics with R and Hadoop

July 17, 2013 by David Smith

Predictive analytics slideshare

Mario Inchosa gave a standing-room-only talk on high-performance predictive analytics in R and Hadoop at last month's Hadoop Summit. In the talk, he described some of the progress we've made integrating the ScaleR parallel external-memory algorithms into the Hadoop platform.[read more]

A Comprehensive Guide to Time Series Plotting in R

June 28, 2013 by David Smith

Time series plotting in R / shutterstock

As R has evolved, its capabilities have improved in every area. The visual display of time series is no exception: as the folks from Timely Portfolio note that "Through both quiet iteration and significant revolutions, the volunteers of R have made analyzing and charting time series pleasant."[read more]

How Big Data and Statistical Modeling Are Changing Video Games [WEBINAR REPLAY]

June 14, 2013 by David Smith

Big Data & video games

Bill Grosso presented a fascinating webinar about the video gaming industry yesterday: "Knowing How People Are Playing Your Game Gives You the Winning Hand." Here's the replay — don't miss the Q&A section at the end where Bill revealed some of his favorite R packages and books.[read more]

Social Data: The Arteries of the World, in Tweets

June 3, 2013 by David Smith

Social Big Data: Twitter data visualization

What happens when you plot billions of geotagged Tweets on a map? You can see the arteries of the world. Check out this cool data visualization of Tweets in Europe. According to creator Miguel Rios, the dots on this chart represent every geotagged Tweet since 2009.[read more]


7 Big Data Trends That Will Impact Your Business

May 24, 2013 by Bob Zurek

Where's Big Data leading your business?

The topic of big data continues to pulsate with vigor in the market, as demonstrated by the wide variety of data innovations emerging daily and the talented professionals successfully pursuing the creation and use of big data solutions. So what trends might we see emerge in the Big Data ecosystem?[read more]

Lots of Data Does Not Equal "Big Data"

March 29, 2013 by David Smith

Lots of data does not necessarily equate to “Big Data." To my way of thinking, the single most important capability to implement in any large scale data platform that is going to support sophisticated analytics is the ability to quickly construct, high quality random samples.[read more]

NCAA Data Visualizer for March Madness Face-Offs

March 24, 2013 by David Smith

data visualizer for NCAA

If you're laying down a friendly bet on the March Madness games or just tweaking your fantasy roster, this NCAA Data Visualizer by Rodrigo Zamith will be a boon. Just choose two teams to compare head-to-head, and choose an attribute to compare them on.[read more]

Open Data App for the Paris Métro

March 22, 2013 by David Smith

Back when my friends and I lived in different parts of Paris, it was tricky to find a mutually agreeable place to meet, so that we'd all be taking an approximately equally long Métro ride. If only we'd had Jean-Robert's Metro Meeting Point app, the decision would have been an easy one.[read more]

Data Science Education Gets Personal

March 15, 2013 by David Smith

This year with both Udacity and Harvard and MIT-backed edX offering interesting and challenging courses, the growth of MOOC enrollment must be astounding indeed. Then again, while MOOC courses are “free,” for a working professional they not without opportunity costs.[read more]

R Script Creates a Map of Worldwide Email Traffic

March 14, 2013 by David Smith

r script email traffic

The Washington Post reports that by analyzing more than 10 million emails sent through the Yahoo! Mail service in 2012, a team of researchers used the R language to create a map of countries whose citizens email each other most frequently.[read more]

R Script Tracks Bookies' Favorites for the Next Pope

March 5, 2013 by David Smith

Tired of manually running a python script to scrape the latest bookmaker odds on the next Pope, R user AJ (an analytical research manager at a large healthcare company) instead created an R script to track the odds on the Papal successor.[read more]

Revolution Analytics CEO: Big Data Is a New Management Discipline

March 1, 2013 by Gil Press

Dave Rich has an interesting theory explaining the rapidly growing interest in predictive analytics. “When the 2008 recession hit,” he told me, “the question was how come we weren’t better prepared with all the money we’ve spent in the last decade on information systems?"[read more]

Resampling Data in Hadoop with RHadoop

February 28, 2013 by David Smith

Uri Laserson has created an excellent guide to resampling from a large data set in Hadoop. Resampling is an important step in fitting ensemble models (including random forests and other bagging techniques), and Uri provides a step-by-step guide to resampling with RHadoop.[read more]

Political Revolutions on Twitter, Visualized with R

December 13, 2012 by David Smith

Esteban Moro Egido, a mathematics professor at Universidad Carlos III in Madrid has produced a video depicting Twitter activity around Spain's general strike in March this year. He used the R language for analyzing all of the tweets, retweets and mentions related to the strike.[read more]

Big Data Trees with Hadoop HDFS

December 4, 2012 by David Smith

Last month's release of Revolution R Enterprise 6.1 added the capability to fit decision and regresson trees on large data sets (using a new parallel external memory algorithm included in the RevoScaleR package).[read more]