Quantcast
Viewing all articles
Browse latest Browse all 1015

Tutorial: Data Science with SQL Server R Services

By David Smith

You may have heard that R and the big-data RevoScaleR package have been integrated with with SQL Server 2016 as SQL Server R Services. If you’ve been wanting to try out R with SQL Server but haven’t been sure where to start, a new MSDN tutorial will take you through all the steps of creating a predictive model: from obtaining data for analysis, to building a statistical model, to creating a stored prodedure to make predictions from the model. To work through the tutorial, you’ll need a suitable Windows server on which to install the SQL Server 2016 Community Technology Preview, and make sure you have SQL Server R Services installed. You’ll also need a separate Windows machine (say a desktop or laptop) where you’ll install Revolution R Open and Revolution R Enterprise. Most of the computations will be happening in SQL Server, though, so this “data science client machine” doesn’t need to be as powerful.

The tutorial is made up of five lessons, which together should take you about 90 minutes to run though. If you run into problems, each lesson includes troubleshooting tips at the end.

Lesson 1 begins with downloading the New York City taxi data set (which was also used to create these beautiful data visualizations) and loading it into SQL Server. You’ll also set up R to include some useful packages such as ggmap and RODBC.

Lesson 2 starts by having you verify the data using SQL queries. Don’t miss the “Next Steps” links near the end, where you’ll summarize the data using the RevoScaleR package on the data science client machine, and then visualize the data as a map with the ggmaps package (as shown below).

Lesson 3 focuses on using R to augment the data with new features, such as calculating the distance between pickup and …read more

Source:: r-bloggers.com


Viewing all articles
Browse latest Browse all 1015

Trending Articles