Class: Spacy::Token
- Inherits:
-
Object
- Object
- Spacy::Token
- Defined in:
- lib/ruby-spacy.rb
Overview
See also spaCy Python API document for Token
.
Instance Attribute Summary collapse
-
#py_token ⇒ Object
readonly
A Python
Token
instance accessible viaPyCall
. -
#text ⇒ String
readonly
A string representing the token.
Instance Method Summary collapse
-
#ancestors ⇒ Array<Token>
Returns the token's ancestors.
-
#children ⇒ Array<Token>
Returns a sequence of the token's immediate syntactic children.
-
#dep ⇒ String
Returns the dependency relation by calling
dep_' of
@py_token` object. -
#ent_type ⇒ String
Returns the named entity type by calling
ent_type_' of
@py_token` object. -
#head ⇒ Token
Returns the head token.
-
#initialize(py_token) ⇒ Token
constructor
It is recommended to use Doc#tokens or Span#tokens methods to create tokens.
-
#lang ⇒ String
Returns the language by calling
lang_' of
@py_token` object. -
#lefts ⇒ Array<Token>
The leftward immediate children of the word in the syntactic dependency parse.
-
#lemma ⇒ String
Returns the lemma by calling
lemma_' of
@py_token` object. -
#lexeme ⇒ Lexeme
Returns a lexeme object.
-
#lower ⇒ String
Returns the lowercase form by calling
lower_' of
@py_token` object. -
#method_missing(name, *args) ⇒ Object
Methods defined in Python but not wrapped in ruby-spacy can be called by this dynamic method handling mechanism.
-
#morphology(hash: true) ⇒ Hash, String
Returns a hash or string of morphological information.
-
#pos ⇒ String
Returns the pos by calling
pos_' of
@py_token` object. - #respond_to_missing?(sym) ⇒ Boolean
-
#rights ⇒ Array<Token>
The rightward immediate children of the word in the syntactic dependency parse.
-
#shape ⇒ String
Returns the shape (e.g. "Xxxxx") by calling
shape_' of
@py_token` object. -
#subtree ⇒ Array<Token>
Returns the token in question and the tokens that descend from it.
-
#tag ⇒ String
Returns the fine-grained pos by calling
tag_' of
@py_token` object. -
#to_s ⇒ String
String representation of the token.
-
#whitespace ⇒ String
Returns the trailing space character if present by calling
whitespace_' of
@py_token` object.
Constructor Details
#initialize(py_token) ⇒ Token
It is recommended to use Doc#tokens or Span#tokens methods to create tokens.
There is no way to generate a token from scratch but relying on a pre-exising Python Token
object.
542 543 544 545 |
# File 'lib/ruby-spacy.rb', line 542 def initialize(py_token) @py_token = py_token @text = @py_token.text end |
Dynamic Method Handling
This class handles dynamic methods through the method_missing method
#method_missing(name, *args) ⇒ Object
Methods defined in Python but not wrapped in ruby-spacy can be called by this dynamic method handling mechanism.
688 689 690 |
# File 'lib/ruby-spacy.rb', line 688 def method_missing(name, *args) @py_token.send(name, *args) end |
Instance Attribute Details
#py_token ⇒ Object (readonly)
Returns a Python Token
instance accessible via PyCall
.
534 535 536 |
# File 'lib/ruby-spacy.rb', line 534 def py_token @py_token end |
#text ⇒ String (readonly)
Returns a string representing the token.
537 538 539 |
# File 'lib/ruby-spacy.rb', line 537 def text @text end |
Instance Method Details
#ancestors ⇒ Array<Token>
Returns the token's ancestors.
565 566 567 568 569 570 571 |
# File 'lib/ruby-spacy.rb', line 565 def ancestors ancestor_array = [] PyCall::List.call(@py_token.ancestors).each do |ancestor| ancestor_array << Token.new(ancestor) end ancestor_array end |
#children ⇒ Array<Token>
Returns a sequence of the token's immediate syntactic children.
575 576 577 578 579 580 581 |
# File 'lib/ruby-spacy.rb', line 575 def children child_array = [] PyCall::List.call(@py_token.children).each do |child| child_array << Token.new(child) end child_array end |
#dep ⇒ String
Returns the dependency relation by calling dep_' of
@py_token` object
659 660 661 |
# File 'lib/ruby-spacy.rb', line 659 def dep @py_token.dep_ end |
#ent_type ⇒ String
Returns the named entity type by calling ent_type_' of
@py_token` object
677 678 679 |
# File 'lib/ruby-spacy.rb', line 677 def ent_type @py_token.ent_type_ end |
#head ⇒ Token
Returns the head token
549 550 551 |
# File 'lib/ruby-spacy.rb', line 549 def head Token.new(@py_token.head) end |
#lang ⇒ String
Returns the language by calling lang_' of
@py_token` object
665 666 667 |
# File 'lib/ruby-spacy.rb', line 665 def lang @py_token.lang_ end |
#lefts ⇒ Array<Token>
The leftward immediate children of the word in the syntactic dependency parse.
585 586 587 588 589 590 591 |
# File 'lib/ruby-spacy.rb', line 585 def lefts token_array = [] PyCall::List.call(@py_token.lefts).each do |token| token_array << Token.new(token) end token_array end |
#lemma ⇒ String
Returns the lemma by calling lemma_' of
@py_token` object
629 630 631 |
# File 'lib/ruby-spacy.rb', line 629 def lemma @py_token.lemma_ end |
#lexeme ⇒ Lexeme
Returns a lexeme object
683 684 685 |
# File 'lib/ruby-spacy.rb', line 683 def lexeme Lexeme.new(@py_token.lex) end |
#lower ⇒ String
Returns the lowercase form by calling lower_' of
@py_token` object
635 636 637 |
# File 'lib/ruby-spacy.rb', line 635 def lower @py_token.lower_ end |
#morphology(hash: true) ⇒ Hash, String
Returns a hash or string of morphological information
612 613 614 615 616 617 618 619 620 621 622 623 624 625 |
# File 'lib/ruby-spacy.rb', line 612 def morphology(hash: true) if @py_token.has_morph morph_analysis = @py_token.morph if hash morph_analysis.to_dict else morph_analysis.to_s end elsif hash {} else "" end end |
#pos ⇒ String
Returns the pos by calling pos_' of
@py_token` object
647 648 649 |
# File 'lib/ruby-spacy.rb', line 647 def pos @py_token.pos_ end |
#respond_to_missing?(sym) ⇒ Boolean
692 693 694 |
# File 'lib/ruby-spacy.rb', line 692 def respond_to_missing?(sym) sym ? true : super end |
#rights ⇒ Array<Token>
The rightward immediate children of the word in the syntactic dependency parse.
595 596 597 598 599 600 601 |
# File 'lib/ruby-spacy.rb', line 595 def rights token_array = [] PyCall::List.call(@py_token.rights).each do |token| token_array << Token.new(token) end token_array end |
#shape ⇒ String
Returns the shape (e.g. "Xxxxx") by calling shape_' of
@py_token` object
641 642 643 |
# File 'lib/ruby-spacy.rb', line 641 def shape @py_token.shape_ end |
#subtree ⇒ Array<Token>
Returns the token in question and the tokens that descend from it.
555 556 557 558 559 560 561 |
# File 'lib/ruby-spacy.rb', line 555 def subtree descendant_array = [] PyCall::List.call(@py_token.subtree).each do |descendant| descendant_array << Token.new(descendant) end descendant_array end |
#tag ⇒ String
Returns the fine-grained pos by calling tag_' of
@py_token` object
653 654 655 |
# File 'lib/ruby-spacy.rb', line 653 def tag @py_token.tag_ end |
#to_s ⇒ String
String representation of the token.
605 606 607 |
# File 'lib/ruby-spacy.rb', line 605 def to_s @text end |
#whitespace ⇒ String
Returns the trailing space character if present by calling whitespace_' of
@py_token` object
671 672 673 |
# File 'lib/ruby-spacy.rb', line 671 def whitespace @py_token.whitespace_ end |