Updated dplyrXdf package brings data munging with pipes to Xdf files

by Hong Ooi, Sr. Data Scientist, Microsoft

I’m pleased to announce the release of version 0.62 of the dplyrXdf package, a backend to dplyr that allows the use of pipeline syntax with Microsoft R Server’s Xdf files. This update adds a new verb (persist), fills some holes in support for dplyr verbs, and fixes various bugs.

The `persist` verb

A side-effect of dplyrXdf handling file management is that passing the output from one pipeline into subsequent pipelines can have unexpected results. Consider the following example:

# pipeline 1
output1 <- flightsXdf %>%
    mutate(delay=(arr_delay + dep_delay)/2)

# use the output from pipeline 1
output2 <- output1 %>%
    group_by(carrier) %>%
    summarise(delay=mean(delay))

# reuse the output from pipeline 1 -- WRONG
output3 <- output1 %>%
    group_by(dest) %>%
    summarise(delay=mean(delay))

The problem with this code is that the second pipeline will overwrite or delete its input, so the third pipeline will fail. This is consistent with dplyrXdf’s philosophy of only saving the most recent output of a pipeline, where a pipeline is defined as all operations starting from a raw xdf file. However, in this case it isn’t what’s desired.

Similarly, dplyrXdf stores its output files in R’s temporary directory, so when you close your R session, these files will be deleted. This saves you having to manually delete files that are no longer in use, but it means that you must copy the output of your pipeline to a permanent location if you want to keep it around.

The new persist verb is meant to address these issues. It saves a pipeline’s output to a permanent location and also resets the status of the pipeline, so that subsequent operations will know not to overwrite the data.

# pipeline 1 -- use persist to save the data to the working directory

output1  ...read more
Source:: http://revolutionanalytics.com

Updated dplyrXdf package brings data munging with pipes to Xdf files

The `persist` verb

Trending Articles

Moondru Mudichu 07-06-2016 – Polimer tv Serial

Greg Gutfeld

SUPREME COURT RULES AGAINST O’NEILL GOVERNMENT

Ummet Ozcan – Ocean’s Voice – Single [iTunes Plus M4A]

My Sisters Plan For Me To Smell Her Feet (Fiction): Part 1,2,3 and 4!!!

Black Angus Grilled Artichokes

Who died from the T.V. Show pawn stars ?? #pawnstars

Barry loses dog bite case

Xamarin Forms Android App Connect/Communicate via USB to PC

DeDRM Tools 6.8.1 Released

Practice Sheet of Right form of verbs for HSC Students

Fight Path: Michael Imperato continues climb from controversy over denied UFC...

Telangana Ration Shop Key Register Download

Is DongFang Bubai better than the Greats in Condor heroes trilogy?

Download: Enalia – Malumbo

HOLGER KAMIN Arrested by Miami-Dade County Corrections on Feb 14, 2017

Xiaomi YI YHS-113 HD Smart WiFi IP Camera new firmware upgrade

मुख मैथुन से उठाएं सेक्स का भरपूर मज़ा, जानें क्या है इसका सही तरीकामुख मैथुन...

Stock globe youwin m022(led),m022t firmwares

Best Suvichar in Hindi |बेस्ट सुविचार |शुभ विचार हिंदी में

The persist verb

Trending Articles

The `persist` verb