Class: Spacy::Span
Overview
See also spaCy Python API document for Span
.
Instance Attribute Summary collapse
-
#doc ⇒ Doc
readonly
The document to which the span belongs.
-
#py_span ⇒ Object
readonly
A Python
Span
instance accessible viaPyCall
.
Instance Method Summary collapse
-
#[](range) ⇒ Object
Returns a span if a range object is given or a token if an integer representing the position of the doc is given.
-
#as_doc ⇒ Doc
Creates a document instance from the span.
-
#conjuncts ⇒ Array<Token>
Returns tokens conjugated to the root of the span.
-
#each ⇒ Object
Iterates over the elements in the span yielding a token instance each time.
-
#ents ⇒ Array<Span>
Returns an array of spans that represents named entities.
-
#initialize(doc, py_span: nil, start_index: nil, end_index: nil, options: {}) ⇒ Span
constructor
It is recommended to use Doc#span method to create a span.
-
#label ⇒ String
Returns the label.
-
#lefts ⇒ Array<Token>
Returns tokens that are to the left of the span, whose heads are within the span.
-
#method_missing(name, *args) ⇒ Object
Methods defined in Python but not wrapped in ruby-spacy can be called by this dynamic method handling mechanism.
-
#noun_chunks ⇒ Array<Span>
Returns an array of spans of noun chunks.
- #respond_to_missing?(sym) ⇒ Boolean
-
#rights ⇒ Array<Token>
Returns Tokens that are to the right of the span, whose heads are within the span.
-
#root ⇒ Token
Returns the head token.
-
#sent ⇒ Span
Returns a span that represents the sentence that the given span is part of.
-
#sents ⇒ Array<Span>
Returns an array of spans that represents sentences.
-
#similarity(other) ⇒ Float
Returns a semantic similarity estimate.
-
#subtree ⇒ Array<Token>
Returns Tokens that are within the span and tokens that descend from them.
-
#tokens ⇒ Array<Token>
Returns an array of tokens contained in the span.
Constructor Details
#initialize(doc, py_span: nil, start_index: nil, end_index: nil, options: {}) ⇒ Span
It is recommended to use Doc#span method to create a span. If you need to
create one using #initialize, there are two method signatures:
Span.new(doc, py_span: Object)
or Span.new(doc, start_index: Integer, end_index: Integer, options: Hash)
.
384 385 386 387 |
# File 'lib/ruby-spacy.rb', line 384 def initialize(doc, py_span: nil, start_index: nil, end_index: nil, options: {}) @doc = doc @py_span = py_span || @py_span = PySpan.call(@doc.py_doc, start_index, end_index + 1, ) end |
Dynamic Method Handling
This class handles dynamic methods through the method_missing method
#method_missing(name, *args) ⇒ Object
Methods defined in Python but not wrapped in ruby-spacy can be called by this dynamic method handling mechanism.
522 523 524 |
# File 'lib/ruby-spacy.rb', line 522 def method_missing(name, *args) @py_span.send(name, *args) end |
Instance Attribute Details
#doc ⇒ Doc (readonly)
Returns the document to which the span belongs.
369 370 371 |
# File 'lib/ruby-spacy.rb', line 369 def doc @doc end |
#py_span ⇒ Object (readonly)
Returns a Python Span
instance accessible via PyCall
.
366 367 368 |
# File 'lib/ruby-spacy.rb', line 366 def py_span @py_span end |
Instance Method Details
#[](range) ⇒ Object
Returns a span if a range object is given or a token if an integer representing the position of the doc is given.
453 454 455 456 457 458 459 460 |
# File 'lib/ruby-spacy.rb', line 453 def [](range) if range.is_a?(Range) py_span = @py_span[range] Span.new(@doc, start_index: py_span.start, end_index: py_span.end - 1) else Token.new(@py_span[range]) end end |
#as_doc ⇒ Doc
Creates a document instance from the span
471 472 473 |
# File 'lib/ruby-spacy.rb', line 471 def as_doc Doc.new(@doc.py_nlp, text: text) end |
#conjuncts ⇒ Array<Token>
Returns tokens conjugated to the root of the span.
477 478 479 480 481 482 483 |
# File 'lib/ruby-spacy.rb', line 477 def conjuncts conjunct_array = [] PyCall::List.call(@py_span.conjuncts).each do |py_conjunct| conjunct_array << Token.new(py_conjunct) end conjunct_array end |
#each ⇒ Object
Iterates over the elements in the span yielding a token instance each time.
400 401 402 403 404 |
# File 'lib/ruby-spacy.rb', line 400 def each PyCall::List.call(@py_span).each do |py_token| yield Token.new(py_token) end end |
#ents ⇒ Array<Span>
Returns an array of spans that represents named entities.
436 437 438 439 440 441 442 |
# File 'lib/ruby-spacy.rb', line 436 def ents ent_array = [] PyCall::List.call(@py_span.ents).each do |py_span| ent_array << Span.new(@doc, py_span: py_span) end ent_array end |
#label ⇒ String
Returns the label
517 518 519 |
# File 'lib/ruby-spacy.rb', line 517 def label @py_span.label_ end |
#lefts ⇒ Array<Token>
Returns tokens that are to the left of the span, whose heads are within the span.
487 488 489 490 491 492 493 |
# File 'lib/ruby-spacy.rb', line 487 def lefts left_array = [] PyCall::List.call(@py_span.lefts).each do |py_left| left_array << Token.new(py_left) end left_array end |
#noun_chunks ⇒ Array<Span>
Returns an array of spans of noun chunks.
408 409 410 411 412 413 414 415 |
# File 'lib/ruby-spacy.rb', line 408 def noun_chunks chunk_array = [] py_chunks = PyCall::List.call(@py_span.noun_chunks) py_chunks.each do |py_span| chunk_array << Span.new(@doc, py_span: py_span) end chunk_array end |
#respond_to_missing?(sym) ⇒ Boolean
526 527 528 |
# File 'lib/ruby-spacy.rb', line 526 def respond_to_missing?(sym) sym ? true : super end |
#rights ⇒ Array<Token>
Returns Tokens that are to the right of the span, whose heads are within the span.
497 498 499 500 501 502 503 |
# File 'lib/ruby-spacy.rb', line 497 def rights right_array = [] PyCall::List.call(@py_span.rights).each do |py_right| right_array << Token.new(py_right) end right_array end |
#root ⇒ Token
Returns the head token
419 420 421 |
# File 'lib/ruby-spacy.rb', line 419 def root Token.new(@py_span.root) end |
#sent ⇒ Span
Returns a span that represents the sentence that the given span is part of.
446 447 448 449 |
# File 'lib/ruby-spacy.rb', line 446 def sent py_span = @py_span.sent Span.new(@doc, py_span: py_span) end |
#sents ⇒ Array<Span>
Returns an array of spans that represents sentences.
425 426 427 428 429 430 431 432 |
# File 'lib/ruby-spacy.rb', line 425 def sents sentence_array = [] py_sentences = PyCall::List.call(@py_span.sents) py_sentences.each do |py_span| sentence_array << Span.new(@doc, py_span: py_span) end sentence_array end |
#similarity(other) ⇒ Float
Returns a semantic similarity estimate.
465 466 467 |
# File 'lib/ruby-spacy.rb', line 465 def similarity(other) py_span.similarity(other.py_span) end |
#subtree ⇒ Array<Token>
Returns Tokens that are within the span and tokens that descend from them.
507 508 509 510 511 512 513 |
# File 'lib/ruby-spacy.rb', line 507 def subtree subtree_array = [] PyCall::List.call(@py_span.subtree).each do |py_subtree| subtree_array << Token.new(py_subtree) end subtree_array end |