Class: Transformers::XlmRoberta::XLMRobertaPreTrainedModel

Inherits:

PreTrainedModel

Object
Torch::NN::Module
PreTrainedModel
Transformers::XlmRoberta::XLMRobertaPreTrainedModel

show all

Defined in:: lib/transformers/models/xlm_roberta/modeling_xlm_roberta.rb

Direct Known Subclasses

XLMRobertaForCausalLM, XLMRobertaForMaskedLM, XLMRobertaForMultipleChoice, XLMRobertaForQuestionAnswering, XLMRobertaForSequenceClassification, XLMRobertaForTokenClassification, XLMRobertaModel

Instance Attribute Summary

Attributes inherited from PreTrainedModel

#config

Instance Method Summary collapse

#_init_weights(module_) ⇒ Object

self.supports_gradient_checkpointing = true self._no_split_modules = [“XLMRobertaEmbeddings”, “XLMRobertaSelfAttention”, “XLMRobertaSdpaSelfAttention”] self._supports_sdpa = true.

Constructor Details

This class inherits a constructor from Transformers::PreTrainedModel

Instance Method Details

#_init_weights(module_) ⇒ `Object`

self.supports_gradient_checkpointing = true self._no_split_modules = [“XLMRobertaEmbeddings”, “XLMRobertaSelfAttention”, “XLMRobertaSdpaSelfAttention”] self._supports_sdpa = true

# File 'lib/transformers/models/xlm_roberta/modeling_xlm_roberta.rb', line 586

def _init_weights(module_)
  if module_.is_a?(Torch::NN::Linear)
    # Slightly different from the TF version which uses truncated_normal for initialization
    # cf https://github.com/pytorch/pytorch/pull/5617
    module_.weight.data.normal!(mean: 0.0, std: @config.initializer_range)
    if !module_.bias.nil?
      module_.bias.data.zero!
    end
  elsif module_.is_a?(Torch::NN::Embedding)
    module_.weight.data.normal!(mean: 0.0, std: @config.initializer_range)
    if !module_.padding_idx.nil?
      module_.weight.data.fetch(module_.padding_idx).zero!
    end
  elsif module_.is_a?(Torch::NN::LayerNorm)
    module_.bias.data.zero!
    module_.weight.data.fill!(1.0)
  end
end

Class: Transformers::XlmRoberta::XLMRobertaPreTrainedModel

Direct Known Subclasses

Instance Attribute Summary

Attributes inherited from PreTrainedModel

Instance Method Summary collapse

Methods inherited from PreTrainedModel

Methods included from ClassAttribute

Methods included from ModuleUtilsMixin

Constructor Details

Instance Method Details

#_init_weights(module_) ⇒ Object

#_init_weights(module_) ⇒ `Object`