Parallel version of `[[`

Although the solution below isn't faster in parallel on my personal computer, it's at least an example on how to read in parallel:

``` r
library(foreach)
library(doParallel)
library(Coldbir)
library(data.table)

# Prepare data
N <- 10000000L

x <- data.table(
  a = as.Date(sample(c(1000+1:100), N, replace = T), origin = "1960-01-01"),
  b = sample(1:100, N, replace = T),
  c = sample(1:10000, N, replace = T),
  d = sample(LETTERS, N, replace = T)
)

a <- cdb()
a[] <- x

cols <- names(x)

# Read non-parallel
system.time({res <- a[]})

# Read parallel (by column)
cl <- makeCluster(4)
registerDoParallel(cl)

system.time({
  res <- as.data.table(foreach(i = cols) %dopar% {
    require(Coldbir)
    a[i][[1]]
  })
  setnames(res, cols)
})

stopCluster(cl)
```

Perhaps we can figure out a better use case.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallel version of `[[` #103

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Parallel version of [[ #103

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions

Parallel version of `[[` #103