AirbnB uses R to scale data science
By David Smith the property-rental marketplace that helps you find a place to stay when you’re travelling, uses R to scale data science. Airbnb is a famously data-driven company, and has recently gone...
View ArticleAirbnB uses R to scale data science
By David Smith Airbnb, the property-rental marketplace that helps you find a place to stay when you’re travelling, uses R to scale data science. Airbnb is a famously data-driven company, and has...
View ArticleTravis CI: “You Have Too Many Tests LOLZ!”
By Randy Zwitch No output has been received in the last 10m0s, this potentially indicates a stalled build or something wrong with the build itself. As part of getting RSiteCatalyst 1.4.8 ready for...
View ArticleAggregation with dplyr: summarise and summarise_each
By Andrea Spanò We use summarise() with aggregate functions, which take a vector of values and return a single number. Function summarise_each() offers an alternative approach to summarise() with...
View ArticlePerform co-operations with the coop package
By wrathematics About The coop package does co-operations: covariance, correlation, and cosine, and it does them quickly. The package is available on CRAN and GitHub, and has two vignettes:...
View ArticleRcppAPT 0.0.2
By Thinking inside the box A new version of RcppAPT — our interface from R to the C++ library behind the awesome apt, apt-get, apt-cache, … commands and their cache — is now on CRAN. It adds three new...
View ArticleR pkg Easter Eggs — Revenge of Pacman!
By Steve Simpson In my last post, I praised the virtues of pacman. I also wanted to indulge of one of the main benefits of working with one of pacman’s devs: Tyler Rinker. Tyler is a geek’s geek, in...
View ArticleComputational Actuarial Science, with R, in Barcelona
By arthur charpentier This Wednesday, I will give a graduate crash course on computational actuarial science, with R, which will be the second part of the lecture of Tuesday. Slides are now available,...
View ArticleThe Pirate Plot (2.0) – The RDI plotting choice of R pirates
By ndphillips Plain vanilla barplots are as uninformative (and ugly) as they are popular. And boy, are they popular. From the floors of congress, to our latest scientific articles, barplots surround...
View ArticleVisualising F1 Stint Strategies
By Tony Hirst With the new F1 season upon us, I’ve started tinkering with bits of code from the Wrangling F1 Data With R book and looking at the data in some new ways. For example, I started wondering...
View ArticleLearn How to Clean Your Data Using R
By DataCamp Blog Data scientists often remark that 80% of their time is spent on cleaning data and only 20% on the actual analysis. Data cleaning is a critical part of the data science process, yet is...
View ArticleAn Analysis of Traffic Violation Data with SQL Server and R
By Joseph Rickert By Srini Kumar, Director of Data Science at Microsoft Who does not hate being stopped and given a traffic ticket? Invariably, we think that something is not fair that we got it and...
View Article3D plotting exercises
By Matteo Renzi In this set of exercises we will introduce the concept of 3D plotting. Specifically, we will use these commands:image(), contour() and persp(). For these exercises, you need to have a...
View ArticleI Went to ROpenSci Unconference and All I Got Were These Lousy Hex Stickers
By Julia Silge Just kidding; it was amazing. Last week, I traveled to San Francisco to participate in an unconference/hackathon organized and hosted by ROpenSci. This was my first R conference or...
View ArticleA workflow for publishing RStudio notebooks on Blogger
By Bart Rogiers The past few years, I have been searching regularly for ways of formatting R code on +Blogger. Although different possibilities were available, almost always I ended up using the online...
View ArticleHow long could it take to run a regression
By arthur charpentier This afternoon, while I was discussing with Montserrat (aka @mguillen_estany) we were wondering how long it might take to run a regression model. More specifically, how long it...
View ArticleA quick introduction to machine learning in R with caret
By Sharp Sight Labs If you’ve been using R for a while, and you’ve been working with basic data visualization and data exploration techniques, the next logical step is to start learning some machine...
View Articlegeom_lollipop() by the Chartettes
By hrbrmstr df read.csv(text="category,pct Other,0.09 South Asian/South Asian Americans,0.12 Interngenerational/Generational,0.21 S Asian/Asian Americans,0.25 Muslim Observance,0.29 Africa/Pan...
View ArticleRcppArmadillo 0.6.700.3.0
By Thinking inside the box A new Armadillo release 6.700.3 is out, and we uploaded RcppArmadillo 0.6.700.3.0 to CRAN and Debian. This followed the usual thorough reverse-dependecy checking of by now...
View ArticleThe new and improved pirateplot()! Now with themes!
By ndphillips Hello fellow R pirates! For those of you who didn’t see it, I recently wrote a post demonstrating the pirateplot() function in the yarrr package. The pirateplot() function replaces the...
View Article