Bringing the powers of SQL into R
By Lionel Hertzog One of the big flaw of R is that data loaded into it are stored in the memory (on the RAM) and not on the disk. As you are working in an analysis with large (big) data the processing...
View ArticleThe ‘lost boarding pass’ puzzle: efficient simulation in R
By David Robinson A family member recently sent me a puzzle: One hundred people are lined up with their boarding passes showing their seats on the 100-seat Plane. The first guy in line drops his pass...
View ArticleR 3.2.3 released
By David Smith Yesterday, the R Core Team released a new update to R (version 3.2.3, codenamed “Wooden Christmas Tree”), and the source distribution is now available for download on CRAN. Binary...
View ArticleImportance of exports and economic growth, cross-country time series
By Peter’s stats stuff – R Exports and economic growth I was looking to show a more substantive piece of analysis using the World Development Indicators data, and at the same time show how to get...
View ArticleModeling and Solving Linear Programming with R – Free book
By Tal Galili Modeling and Solving Linear Programming with R (pdf – free download link) is a book about solving linear programming problems/exercises with R. This book provides a brief introduction to...
View ArticleWin-Vector news
By John Mount Just an update of what we have been up to lately at Win-Vector LLC, and a reminder of some of our current offerings. It has been busy lately (and that is good). Our current professional...
View ArticleCalculate Leave-One-Out Prediction for GLM
By statcompute In the model development, the “leave-one-out” prediction is a way of cross-validation, calculated as below:1. First of all, after a model is developed, each observation used in the model...
View ArticleR 3.2.3 is released (with improvements for Windows users, and general bug fixes)
By Tal Galili R 3.2.2 (codename “Wooden Christmas Tree”) was released several days ago. You can get the latest binaries version from here. (or the .tar.gz source code from here). The full list of new...
View ArticleR 3.2.3 is released (with improvements for Windows users, and general bug fixes)
By Tal Galili R 3.2.2 (codename “Wooden Christmas Tree”) was released several days ago. You can get the latest binaries version from here. (or the .tar.gz source code from here). The full list of new...
View Article“Why do people contribute to the R?” – concolusions from a new PNAS article
By Tal Galili tl;dr: People contribute to R for various reasons, which evolves with time. The main reasons appear to be: “fun coding”, personal commitment to the community, interaction with like-minded...
View Article“Why do people contribute to the R?” – concolusions from a new PNAS article
By Tal Galili tl;dr: People contribute to R for various reasons, which evolves with time. The main reasons appear to be: “fun coding”, personal commitment to the community, interaction with like-minded...
View ArticleFear of WaPo Using Bad Pie Charts Has Increased Since Last Year
By hrbrmstr library(tidyr) library(ggplot2) library(ggthemes) library(scales) library(dplyr) # Easiest way to transcribe the PDF table # The slope calculation will enable us to color the lines/points...
View ArticlePlotting Scopus article level citation data in R
By Daniel Lakens The Royal Society has decided to publish journal citations distributions. This makes sense. The journal impact factor is a single number trying to summarize a distribution, but it’s...
View ArticleMaintaining a database of price files in R
By The R Trader Doing quantitative research implies a lot of data crunching and one needs clean and reliable data to achieve this. What is really needed is clean data that is easily accessible (even...
View ArticleIn case you missed it: November 2015 roundup
By David Smith In case you missed them, here are some articles from November of particular interest to R users. You can use emojis as plotting symbols in ggplot2 charts with the emoGG package. A review...
View ArticleData Science Radar – Technologist Profile
By Mango Blogger by Mark Sellors, Mango Solutions @sellorm Mark Sellors from Mango took the Data Science Radar Challenge and his dominant skill was a Technologist, so we asked him a few questions. 1....
View ArticlePhyllotaxis By Shiny
By aschinchon Antonio, you don’t know what empathy is! (Cecilia, my beautiful wife) Spirals are nice. In the wake of my previous post I have done a Shiny app to explore patterns generated by changing...
View ArticleR and Python: Theory of Linear Least Squares
By Al-Ahmadgaid Asaad In my previous article, we talked about implementations of linear regression models in R, Python and SAS. On the theoretical sides, however, I briefly mentioned the estimation...
View ArticleIn case you missed it: November 2015 roundup
By David Smith In case you missed them, here are some articles from November of particular interest to R users. You can use emojis as plotting symbols in ggplot2 charts with the emoGG package. A review...
View ArticleMaking Sense of Logarithmic Loss
By Andrew Collier Logarithmic Loss, or simply Log Loss, is a classification loss function often used as an evaluation metric in kaggle competitions. Since success in these competitions hinges on...
View Article