Class: Google::Cloud::Dataplex::V1::DataDiscoverySpec

Inherits:

Object

Object
Google::Cloud::Dataplex::V1::DataDiscoverySpec

show all

Extended by:: Protobuf::MessageExts::ClassMethods

Includes:: Protobuf::MessageExts

Defined in:: proto_docs/google/cloud/dataplex/v1/data_discovery.rb

Overview

Spec for a data discovery scan.

Defined Under Namespace

Classes: BigQueryPublishingConfig, StorageConfig

Instance Attribute Summary collapse

#bigquery_publishing_config ⇒ ::Google::Cloud::Dataplex::V1::DataDiscoverySpec::BigQueryPublishingConfig
Optional.
#storage_config ⇒ ::Google::Cloud::Dataplex::V1::DataDiscoverySpec::StorageConfig
Cloud Storage related configurations.

Instance Attribute Details

#bigquery_publishing_config ⇒ `::Google::Cloud::Dataplex::V1::DataDiscoverySpec::BigQueryPublishingConfig`

Returns Optional. Configuration for metadata publishing.

Returns:

(::Google::Cloud::Dataplex::V1::DataDiscoverySpec::BigQueryPublishingConfig) —
Optional. Configuration for metadata publishing.

# File 'proto_docs/google/cloud/dataplex/v1/data_discovery.rb', line 31

class DataDiscoverySpec
  include ::Google::Protobuf::MessageExts
  extend ::Google::Protobuf::MessageExts::ClassMethods

  # Describes BigQuery publishing configurations.
  # @!attribute [rw] table_type
  #   @return [::Google::Cloud::Dataplex::V1::DataDiscoverySpec::BigQueryPublishingConfig::TableType]
  #     Optional. Determines whether to  publish discovered tables as BigLake
  #     external tables or non-BigLake external tables.
  # @!attribute [rw] connection
  #   @return [::String]
  #     Optional. The BigQuery connection used to create BigLake tables.
  #     Must be in the form
  #     `projects/{project_id}/locations/{location_id}/connections/{connection_id}`
  # @!attribute [rw] location
  #   @return [::String]
  #     Optional. The location of the BigQuery dataset to publish BigLake
  #     external or non-BigLake external tables to.
  #     1. If the Cloud Storage bucket is located in a multi-region bucket, then
  #     BigQuery dataset can be in the same multi-region bucket or any single
  #     region that is included in the same multi-region bucket. The datascan can
  #     be created in any single region that is included in the same multi-region
  #     bucket
  #     2. If the Cloud Storage bucket is located in a dual-region bucket, then
  #     BigQuery dataset can be located in regions that are included in the
  #     dual-region bucket, or in a multi-region that includes the dual-region.
  #     The datascan can be created in any single region that is included in the
  #     same dual-region bucket.
  #     3. If the Cloud Storage bucket is located in a single region, then
  #     BigQuery dataset can be in the same single region or any multi-region
  #     bucket that includes the same single region. The datascan will be created
  #     in the same single region as the bucket.
  #     4. If the BigQuery dataset is in single region, it must be in the same
  #     single region as the datascan.
  #
  #     For supported values, refer to
  #     https://cloud.google.com/bigquery/docs/locations#supported_locations.
  class BigQueryPublishingConfig
    include ::Google::Protobuf::MessageExts
    extend ::Google::Protobuf::MessageExts::ClassMethods

    # Determines how discovered tables are published.
    module TableType
      # Table type unspecified.
      TABLE_TYPE_UNSPECIFIED = 0

      # Default. Discovered tables are published as BigQuery external tables
      # whose data is accessed using the credentials of the user querying the
      # table.
      EXTERNAL = 1

      # Discovered tables are published as BigLake external tables whose data
      # is accessed using the credentials of the associated BigQuery
      # connection.
      BIGLAKE = 2
    end
  end

  # Configurations related to Cloud Storage as the data source.
  # @!attribute [rw] include_patterns
  #   @return [::Array<::String>]
  #     Optional. Defines the data to include during discovery when only a subset
  #     of the data should be considered. Provide a list of patterns that
  #     identify the data to include. For Cloud Storage bucket assets, these
  #     patterns are interpreted as glob patterns used to match object names. For
  #     BigQuery dataset assets, these patterns are interpreted as patterns to
  #     match table names.
  # @!attribute [rw] exclude_patterns
  #   @return [::Array<::String>]
  #     Optional. Defines the data to exclude during discovery. Provide a list of
  #     patterns that identify the data to exclude. For Cloud Storage bucket
  #     assets, these patterns are interpreted as glob patterns used to match
  #     object names. For BigQuery dataset assets, these patterns are interpreted
  #     as patterns to match table names.
  # @!attribute [rw] csv_options
  #   @return [::Google::Cloud::Dataplex::V1::DataDiscoverySpec::StorageConfig::CsvOptions]
  #     Optional. Configuration for CSV data.
  # @!attribute [rw] json_options
  #   @return [::Google::Cloud::Dataplex::V1::DataDiscoverySpec::StorageConfig::JsonOptions]
  #     Optional. Configuration for JSON data.
  class StorageConfig
    include ::Google::Protobuf::MessageExts
    extend ::Google::Protobuf::MessageExts::ClassMethods

    # Describes CSV and similar semi-structured data formats.
    # @!attribute [rw] header_rows
    #   @return [::Integer]
    #     Optional. The number of rows to interpret as header rows that should be
    #     skipped when reading data rows.
    # @!attribute [rw] delimiter
    #   @return [::String]
    #     Optional. The delimiter that is used to separate values. The default is
    #     `,` (comma).
    # @!attribute [rw] encoding
    #   @return [::String]
    #     Optional. The character encoding of the data. The default is UTF-8.
    # @!attribute [rw] type_inference_disabled
    #   @return [::Boolean]
    #     Optional. Whether to disable the inference of data types for CSV data.
    #     If true, all columns are registered as strings.
    # @!attribute [rw] quote
    #   @return [::String]
    #     Optional. The character used to quote column values. Accepts `"`
    #     (double quotation mark) or `'` (single quotation mark). If unspecified,
    #     defaults to `"` (double quotation mark).
    class CsvOptions
      include ::Google::Protobuf::MessageExts
      extend ::Google::Protobuf::MessageExts::ClassMethods
    end

    # Describes JSON data format.
    # @!attribute [rw] encoding
    #   @return [::String]
    #     Optional. The character encoding of the data. The default is UTF-8.
    # @!attribute [rw] type_inference_disabled
    #   @return [::Boolean]
    #     Optional. Whether to disable the inference of data types for JSON data.
    #     If true, all columns are registered as their primitive types
    #     (strings, number, or boolean).
    class JsonOptions
      include ::Google::Protobuf::MessageExts
      extend ::Google::Protobuf::MessageExts::ClassMethods
    end
  end
end

#storage_config ⇒ `::Google::Cloud::Dataplex::V1::DataDiscoverySpec::StorageConfig`

Returns Cloud Storage related configurations.

Returns:

(::Google::Cloud::Dataplex::V1::DataDiscoverySpec::StorageConfig) —
Cloud Storage related configurations.

# File 'proto_docs/google/cloud/dataplex/v1/data_discovery.rb', line 31

class DataDiscoverySpec
  include ::Google::Protobuf::MessageExts
  extend ::Google::Protobuf::MessageExts::ClassMethods

  # Describes BigQuery publishing configurations.
  # @!attribute [rw] table_type
  #   @return [::Google::Cloud::Dataplex::V1::DataDiscoverySpec::BigQueryPublishingConfig::TableType]
  #     Optional. Determines whether to  publish discovered tables as BigLake
  #     external tables or non-BigLake external tables.
  # @!attribute [rw] connection
  #   @return [::String]
  #     Optional. The BigQuery connection used to create BigLake tables.
  #     Must be in the form
  #     `projects/{project_id}/locations/{location_id}/connections/{connection_id}`
  # @!attribute [rw] location
  #   @return [::String]
  #     Optional. The location of the BigQuery dataset to publish BigLake
  #     external or non-BigLake external tables to.
  #     1. If the Cloud Storage bucket is located in a multi-region bucket, then
  #     BigQuery dataset can be in the same multi-region bucket or any single
  #     region that is included in the same multi-region bucket. The datascan can
  #     be created in any single region that is included in the same multi-region
  #     bucket
  #     2. If the Cloud Storage bucket is located in a dual-region bucket, then
  #     BigQuery dataset can be located in regions that are included in the
  #     dual-region bucket, or in a multi-region that includes the dual-region.
  #     The datascan can be created in any single region that is included in the
  #     same dual-region bucket.
  #     3. If the Cloud Storage bucket is located in a single region, then
  #     BigQuery dataset can be in the same single region or any multi-region
  #     bucket that includes the same single region. The datascan will be created
  #     in the same single region as the bucket.
  #     4. If the BigQuery dataset is in single region, it must be in the same
  #     single region as the datascan.
  #
  #     For supported values, refer to
  #     https://cloud.google.com/bigquery/docs/locations#supported_locations.
  class BigQueryPublishingConfig
    include ::Google::Protobuf::MessageExts
    extend ::Google::Protobuf::MessageExts::ClassMethods

    # Determines how discovered tables are published.
    module TableType
      # Table type unspecified.
      TABLE_TYPE_UNSPECIFIED = 0

      # Default. Discovered tables are published as BigQuery external tables
      # whose data is accessed using the credentials of the user querying the
      # table.
      EXTERNAL = 1

      # Discovered tables are published as BigLake external tables whose data
      # is accessed using the credentials of the associated BigQuery
      # connection.
      BIGLAKE = 2
    end
  end

  # Configurations related to Cloud Storage as the data source.
  # @!attribute [rw] include_patterns
  #   @return [::Array<::String>]
  #     Optional. Defines the data to include during discovery when only a subset
  #     of the data should be considered. Provide a list of patterns that
  #     identify the data to include. For Cloud Storage bucket assets, these
  #     patterns are interpreted as glob patterns used to match object names. For
  #     BigQuery dataset assets, these patterns are interpreted as patterns to
  #     match table names.
  # @!attribute [rw] exclude_patterns
  #   @return [::Array<::String>]
  #     Optional. Defines the data to exclude during discovery. Provide a list of
  #     patterns that identify the data to exclude. For Cloud Storage bucket
  #     assets, these patterns are interpreted as glob patterns used to match
  #     object names. For BigQuery dataset assets, these patterns are interpreted
  #     as patterns to match table names.
  # @!attribute [rw] csv_options
  #   @return [::Google::Cloud::Dataplex::V1::DataDiscoverySpec::StorageConfig::CsvOptions]
  #     Optional. Configuration for CSV data.
  # @!attribute [rw] json_options
  #   @return [::Google::Cloud::Dataplex::V1::DataDiscoverySpec::StorageConfig::JsonOptions]
  #     Optional. Configuration for JSON data.
  class StorageConfig
    include ::Google::Protobuf::MessageExts
    extend ::Google::Protobuf::MessageExts::ClassMethods

    # Describes CSV and similar semi-structured data formats.
    # @!attribute [rw] header_rows
    #   @return [::Integer]
    #     Optional. The number of rows to interpret as header rows that should be
    #     skipped when reading data rows.
    # @!attribute [rw] delimiter
    #   @return [::String]
    #     Optional. The delimiter that is used to separate values. The default is
    #     `,` (comma).
    # @!attribute [rw] encoding
    #   @return [::String]
    #     Optional. The character encoding of the data. The default is UTF-8.
    # @!attribute [rw] type_inference_disabled
    #   @return [::Boolean]
    #     Optional. Whether to disable the inference of data types for CSV data.
    #     If true, all columns are registered as strings.
    # @!attribute [rw] quote
    #   @return [::String]
    #     Optional. The character used to quote column values. Accepts `"`
    #     (double quotation mark) or `'` (single quotation mark). If unspecified,
    #     defaults to `"` (double quotation mark).
    class CsvOptions
      include ::Google::Protobuf::MessageExts
      extend ::Google::Protobuf::MessageExts::ClassMethods
    end

    # Describes JSON data format.
    # @!attribute [rw] encoding
    #   @return [::String]
    #     Optional. The character encoding of the data. The default is UTF-8.
    # @!attribute [rw] type_inference_disabled
    #   @return [::Boolean]
    #     Optional. Whether to disable the inference of data types for JSON data.
    #     If true, all columns are registered as their primitive types
    #     (strings, number, or boolean).
    class JsonOptions
      include ::Google::Protobuf::MessageExts
      extend ::Google::Protobuf::MessageExts::ClassMethods
    end
  end
end

Class: Google::Cloud::Dataplex::V1::DataDiscoverySpec

Overview

Defined Under Namespace

Instance Attribute Summary collapse

Instance Attribute Details

#bigquery_publishing_config ⇒ ::Google::Cloud::Dataplex::V1::DataDiscoverySpec::BigQueryPublishingConfig

#storage_config ⇒ ::Google::Cloud::Dataplex::V1::DataDiscoverySpec::StorageConfig

#bigquery_publishing_config ⇒ `::Google::Cloud::Dataplex::V1::DataDiscoverySpec::BigQueryPublishingConfig`

#storage_config ⇒ `::Google::Cloud::Dataplex::V1::DataDiscoverySpec::StorageConfig`