Do basic R operations much faster in bash [Slightly off-topic]

R is great, and you can do a LOT OF stuff with it.

However, sometimes you want to do really basic stuff with huge or a lot of files. At work, I have to do that a lot because I am mostly dealing with language data that often needs some pre-processing.

Most of these operations are done much, much faster on the level of the operating system (preferably in Bash on Linux or Unix, i.e. Mac OS). And since R tries to load everything into working memory, these functions might also help you to do stuff with files that are too big for your RAM.

This blog post is some kind of cheat sheet for me to remember some of the bash functions that prove very useful to me. (Most of the functions are quite basic for an advanced user of Linux or Unix, I guess).

Disclaimer: Most of these calls were adapted from different StackExchange questions. There are really lots of very helpful posts. Thanks to the community!

Superfast subset of a tabulated text file (it might also be gzipped!):
[z]grep -E >
could include your separators. If is tab-separated, use -P for Perl-like regular expressions (only works with grep, not with zgrep?).

Superfast extraction of the first column from a tab-separated file:
cut -f1 >
Just replace with * if you want to extract the first column from each file and write them all into the same .

Write unique rows of a file into a new file:
sort | uniq >
Yes, there is no “e” after uniq! You have to sort first.

Get list of files from a directory really fast – this has to be inserted into an R script to get a list of files:
files
I …read more

Source:: r-bloggers.com

Do basic R operations much faster in bash [Slightly off-topic]

Trending Articles

Practice Sheet of Right form of verbs for HSC Students

ОЧІ В ОЧІ – Синоніми – Single [iTunes Plus M4A]

Conman who lived a life of luxury is jailed

Moondru Mudichu 21-07-2016 – Polimer tv Serial

Download – The Last Ship 1ª Temporada RMVB Dublado – MEGA

VIDEO2BRAIN - GETTING STARTED WITH ILLUSTRATOR CS6

Nalgonda District Police Office Mobile Numbers List in Telangana State

Group Policy Update Monitor False alerts

Pass through scenario in SAP PI with no mapping for File to IDoc and Idoc to...

QUIZ: Are You Smart Enough To Be A US Marine?

SAHARA FLASH LIVE IN WERAGOLLA 2018-04-20

NCERT Solutions for Class 9th Sanskrit Chapter 3 पाथेयम्

The 10 Tennessee Cities With The Largest Black Population For 2021

ZARIA CUMMINGS

Cheltenham man avoids prison after glassing girlfriend

Storage DRS Fault won't clear

99 God Status for Whatsapp, Facebook

Shatta Wale – You Shock Me (Prod. by Willis Beatz)

Top 10 FBB OnlyFans & Muscle Girl OnlyFans in 2023

NOTES ZA GENERAL CHEMISTRY ZA NGAIZA