Channel: r software hub

↧

Scaling data.table using index

November 23, 2015, 7:30 pm

≫ Next: DataOps at SQL in the City

≪ Previous: Using Apache SparkR to Power Shiny Applications: Part I

By Jan Górecki – R

plot of chunk plot_timing

R can handle fairly big data working on a single machine, 2B (2E9) rows and couple of columns require about 100 GB of memory.
This is already well enough to care about performance.
With this post I’m going discuss scalability of filter queries.

The index has been introduced to data.table in 1.9.4. It is also known as secondary keys. Unlike with key, a single data.table can have multiple indexes.
It basically store additional vector of rows order as data.table attribute.
Sounds really simple, it is even better because user does not have use them in any special way – use of index is automatically handled in data.table.
And the performance gains are big enough to write a post on that.

What you should know about data.table index (as of 2015-11-23):

index will be used when subsetting dataset with == or %in% on a single variable
by default if index for a variable is not present on filtering, it is automatically created and used
indexes are lost if you change the order of data
you can check if you are using index with options(datatable.verbose=TRUE)

Above features are likely to be improved in future.

also important to mention, there is an open FR to automatically utilize index when doing unkeyed join (new feature in 1.9.6) – using new on argument. So in future version user will be able to leverage mighty performance of indexes for joining datasets.

Brief look at the structure:

library(data.table)
op = options(datatable.verbose=TRUE,
             datatable.auto.index=TRUE)
dt = data.table(a=letters[c(3L,1L,2L)])
set2keyv(dt, "a")

## forder took 0 sec

attr(dt, "index")

## integer(0)
## attr(,"__a")
## [1] 2 3 1

dt[a=="b"]

## Using existing index 'a'
## Starting bmerge ...done in 0 secs

##    a
## 1: b

dt[a %in% c("b","c")]

## Using existing index 'a'

## Starting bmerge  ...read more
Source:: r-bloggers.com

↧

Latest Images

7 clever tricks Primark does to keep you walking & buying more than you need...

7 clever tricks Primark does to keep you walking & buying more than you need...

July 20, 2025, 5:14 am

Art for Everyone! Autism advocacy, local stories, and indigenous pride in one...

Art for Everyone! Autism advocacy, local stories, and indigenous pride in one...

July 20, 2025, 5:06 am

Paintings of English Downs 2

Paintings of English Downs 2

July 20, 2025, 4:30 am

How Kerala Women Rescued a Dying Forest and Turned It Into a Safe Haven for...

How Kerala Women Rescued a Dying Forest and Turned It Into a Safe Haven for...

July 20, 2025, 3:30 am

Met Eireann warns of heavy rain & spot flooding for DAYS before big...

Met Eireann warns of heavy rain & spot flooding for DAYS before big...

July 20, 2025, 1:14 am

Who is Kevin Lerena’s wife Geraldine?

Who is Kevin Lerena’s wife Geraldine?

July 20, 2025, 12:57 am

Man stabs woman, baby to death inside Queens home, police say

Man stabs woman, baby to death inside Queens home, police say

July 19, 2025, 11:00 pm

Ang papel ni whistleblower Julie Patidongan sa kaso ng mga nawawalang sabungero

Ang papel ni whistleblower Julie Patidongan sa kaso ng mga nawawalang sabungero

July 19, 2025, 9:45 pm

Telangana Human Rights Commission (TGHRC) seeks report from revenue dept on...

Telangana Human Rights Commission (TGHRC) seeks report from revenue dept on...

July 19, 2025, 7:29 pm

Crisis-hit NHS fat cats raking in MASSIVE salaries as frontline services cry...

Crisis-hit NHS fat cats raking in MASSIVE salaries as frontline services cry...

July 19, 2025, 2:11 pm

Trending Articles

Tyler, The Creator – CHROMAKOPIA [iTunes Plus M4A]

October 28, 2024, 4:56 am

Flux Full Pack 2.1 v3.5.16-R2R

May 6, 2016, 3:14 am

मतलबी दोस्त स्टेट्स | Matlabi Dost Status in Hindi – Selfish Friends Status

February 13, 2020, 3:12 am

Pastors Bobby Smith Allegedly Under Investigation For Molesting A Five...

September 29, 2014, 3:12 am

Bureau of Internal Revenue: Regional Offices (Directory)

January 9, 2014, 11:06 pm

236 kg banned scented tobacco worth Rs 1.26 lakh seized in Wadi

June 22, 2021, 5:54 am

Farrah Stone Johnson Pitcher Jon Lester’s wife

October 10, 2016, 9:56 am

Love (2015).H264.Italian.English.Ac3.5.1.multisub.iCV-MIRCrew Seed (62)/Leech...

September 14, 2017, 10:49 am

Jangaon Mandal Sarpanch Wardmumbers Mobile Numbers List Warangal in Telangana...

March 22, 2017, 9:44 pm

[RELEASE THREAD]--_A-Team_--Cricket_Dream_5G

September 25, 2022, 7:14 pm

A/L Technology Stream – Subject combinations, Syllabuses and Teacher guides

December 17, 2013, 6:12 pm

James Martin Normandy tart on James Martin’s French Adventure

February 21, 2017, 7:26 am

Army Public Schools Admission Entry Test Syllabus Papers

March 18, 2020, 4:58 am

ROGER PHILIPPE ROUGET LEON AND...

January 23, 2015, 5:31 pm

Students hit streets to save Agriculture College land in city

October 13, 2018, 2:20 am

Waves Complete v2019.02.14 Incl Emulator-R2R

February 16, 2019, 7:50 am

Rolhin jehey vain

September 19, 2014, 2:59 pm

It’s Kind of a Funny Story 2010 Dual Audio 720p BRRip [Hindi – English] ESubs

June 8, 2016, 6:15 am

100 Collage Life Status for WhatsApp in English: College Life Quotes

March 16, 2017, 11:51 pm

The 10 Texas Cities With The Largest Black Population For 2021

December 21, 2020, 10:13 am

Latest Images

7 clever tricks Primark does to keep you walking & buying more than you need...

7 clever tricks Primark does to keep you walking & buying more than you need...

July 20, 2025, 5:14 am

Art for Everyone! Autism advocacy, local stories, and indigenous pride in one...

Art for Everyone! Autism advocacy, local stories, and indigenous pride in one...

July 20, 2025, 5:06 am

Paintings of English Downs 2

Paintings of English Downs 2

July 20, 2025, 4:30 am

How Kerala Women Rescued a Dying Forest and Turned It Into a Safe Haven for...

How Kerala Women Rescued a Dying Forest and Turned It Into a Safe Haven for...

July 20, 2025, 3:30 am

Met Eireann warns of heavy rain & spot flooding for DAYS before big...

Met Eireann warns of heavy rain & spot flooding for DAYS before big...

July 20, 2025, 1:14 am

Who is Kevin Lerena’s wife Geraldine?

Who is Kevin Lerena’s wife Geraldine?

July 20, 2025, 12:57 am

Man stabs woman, baby to death inside Queens home, police say

Man stabs woman, baby to death inside Queens home, police say

July 19, 2025, 11:00 pm

Ang papel ni whistleblower Julie Patidongan sa kaso ng mga nawawalang sabungero

Ang papel ni whistleblower Julie Patidongan sa kaso ng mga nawawalang sabungero

July 19, 2025, 9:45 pm

Telangana Human Rights Commission (TGHRC) seeks report from revenue dept on...

Telangana Human Rights Commission (TGHRC) seeks report from revenue dept on...

July 19, 2025, 7:29 pm

Crisis-hit NHS fat cats raking in MASSIVE salaries as frontline services cry...

Crisis-hit NHS fat cats raking in MASSIVE salaries as frontline services cry...

July 19, 2025, 2:11 pm

© 2025 //www.rssing.com