Class: Spark::PipelinedRDD

Inherits:

RDD

Object
RDD
Spark::PipelinedRDD

show all

Defined in:: lib/spark/rdd.rb

Overview

Pipelined Resilient Distributed Dataset, operations are pipelined and sended to worker

RDD
`-- map
    `-- map
        `-- map

Code is executed from top to bottom

Instance Attribute Summary collapse

#command ⇒ Object readonly

Returns the value of attribute command.
#prev_jrdd ⇒ Object readonly

Returns the value of attribute prev_jrdd.

Attributes inherited from RDD

#context

Instance Method Summary collapse

#initialize(prev, command) ⇒ PipelinedRDD constructor

A new instance of PipelinedRDD.
#jrdd ⇒ Object

Serialization necessary things and sent it to RubyRDD (scala extension).
#pipelinable? ⇒ Boolean

Constructor Details

#initialize(prev, command) ⇒ `PipelinedRDD`

Returns a new instance of PipelinedRDD.

# File 'lib/spark/rdd.rb', line 1338

def initialize(prev, command)

  if prev.is_a?(PipelinedRDD) && prev.pipelinable?
    # Second, ... stages
    @prev_jrdd = prev.prev_jrdd
  else
    # First stage
    @prev_jrdd = prev.jrdd
  end

  @cached = false
  @checkpointed = false

  @context = prev.context
  @command = command
end

Instance Attribute Details

#command ⇒ `Object` (readonly)

Returns the value of attribute command.



1336
1337
1338

# File 'lib/spark/rdd.rb', line 1336

def command
  @command
end

#prev_jrdd ⇒ `Object` (readonly)

Returns the value of attribute prev_jrdd.



1336
1337
1338

# File 'lib/spark/rdd.rb', line 1336

def prev_jrdd
  @prev_jrdd
end

Instance Method Details

#jrdd ⇒ `Object`

Serialization necessary things and sent it to RubyRDD (scala extension)



1360
1361
1362

# File 'lib/spark/rdd.rb', line 1360

def jrdd
  @jrdd ||= _jrdd
end

#pipelinable? ⇒ `Boolean`

Returns:

(Boolean)



1355
1356
1357

# File 'lib/spark/rdd.rb', line 1355

def pipelinable?
  !(cached? || checkpointed?)
end

Class: Spark::PipelinedRDD

Overview

Instance Attribute Summary collapse

Attributes inherited from RDD

Instance Method Summary collapse

Methods inherited from RDD

Methods included from Helper::Statistic

Methods included from Helper::Parser

Methods included from Helper::Logger

Constructor Details

#initialize(prev, command) ⇒ PipelinedRDD

Instance Attribute Details

#command ⇒ Object (readonly)

#prev_jrdd ⇒ Object (readonly)