oracle.oci.oci_data_labeling_service_dataset_actions – Perform actions on a Dataset resource in Oracle Cloud Infrastructure

Note

This plugin is part of the oracle.oci collection (version 4.14.0).

You might already have this collection installed if you are using the ansible package. It is not included in ansible-core. To check whether it is installed, run ansible-galaxy collection list.

To install it, use: ansible-galaxy collection install oracle.oci.

To use it in a playbook, specify: oracle.oci.oci_data_labeling_service_dataset_actions.

New in version 2.9.0: of oracle.oci

Synopsis

  • Perform actions on a Dataset resource in Oracle Cloud Infrastructure

  • For action=add_dataset_labels, add Labels to the Dataset LabelSet until the maximum number of Labels has been reached.

  • For action=change_compartment, moves a Dataset resource from one compartment identifier to another. When provided, If-Match is checked against ETag values of the resource.

  • For action=generate_dataset_records, generates Record resources from the Dataset’s data source

  • For action=remove_dataset_labels, removes the labels from the Dataset Labelset. Labels can only be removed if there are no Annotations associated with the Dataset that reference the Label names.

  • For action=rename_dataset_labels, renames the labels from the Dataset Labelset. Labels that are renamed will be reflected in Annotations associated with the Dataset that reference the Label names.

  • For action=snapshot, writes the dataset records and annotations in a consolidated format out to an object storage reference for consumption. While the snapshot takes place, there may be a time while records and annotations cannot be created to ensure the snapshot is a point in time.

Requirements

The below requirements are needed on the host that executes this module.

Parameters

Parameter Choices/Defaults Comments
action
string / required
    Choices:
  • add_dataset_labels
  • change_compartment
  • generate_dataset_records
  • remove_dataset_labels
  • rename_dataset_labels
  • snapshot
The action to perform on the Dataset.
api_user
string
The OCID of the user, on whose behalf, OCI APIs are invoked. If not set, then the value of the OCI_USER_ID environment variable, if any, is used. This option is required if the user is not specified through a configuration file (See config_file_location). To get the user's OCID, please refer https://docs.us-phoenix-1.oraclecloud.com/Content/API/Concepts/apisigningkey.htm.
api_user_fingerprint
string
Fingerprint for the key pair being used. If not set, then the value of the OCI_USER_FINGERPRINT environment variable, if any, is used. This option is required if the key fingerprint is not specified through a configuration file (See config_file_location). To get the key pair's fingerprint value please refer https://docs.us-phoenix-1.oraclecloud.com/Content/API/Concepts/apisigningkey.htm.
api_user_key_file
string
Full path and filename of the private key (in PEM format). If not set, then the value of the OCI_USER_KEY_FILE variable, if any, is used. This option is required if the private key is not specified through a configuration file (See config_file_location). If the key is encrypted with a pass-phrase, the api_user_key_pass_phrase option must also be provided.
api_user_key_pass_phrase
string
Passphrase used by the key referenced in api_user_key_file, if it is encrypted. If not set, then the value of the OCI_USER_KEY_PASS_PHRASE variable, if any, is used. This option is required if the key passphrase is not specified through a configuration file (See config_file_location).
are_annotations_included
boolean
    Choices:
  • no
  • yes
Whether annotations are to be included in the export dataset digest.
Required for action=snapshot.
are_unannotated_records_included
boolean
    Choices:
  • no
  • yes
Whether to include records that have yet to be annotated in the export dataset digest.
Required for action=snapshot.
auth_purpose
string
    Choices:
  • service_principal
The auth purpose which can be used in conjunction with 'auth_type=instance_principal'. The default auth_purpose for instance_principal is None.
auth_type
string
    Choices:
  • api_key ←
  • instance_principal
  • instance_obo_user
  • resource_principal
The type of authentication to use for making API requests. By default auth_type="api_key" based authentication is performed and the API key (see api_user_key_file) in your config file will be used. If this 'auth_type' module option is not specified, the value of the OCI_ANSIBLE_AUTH_TYPE, if any, is used. Use auth_type="instance_principal" to use instance principal based authentication when running ansible playbooks within an OCI compute instance.
cert_bundle
string
The full path to a CA certificate bundle to be used for SSL verification. This will override the default CA certificate bundle. If not set, then the value of the OCI_ANSIBLE_CERT_BUNDLE variable, if any, is used.
compartment_id
string
The OCID of the compartment where the resource should be moved.
Required for action=change_compartment.
config_file_location
string
Path to configuration file. If not set then the value of the OCI_CONFIG_FILE environment variable, if any, is used. Otherwise, defaults to ~/.oci/config.
config_profile_name
string
The profile to load from the config file referenced by config_file_location. If not set, then the value of the OCI_CONFIG_PROFILE environment variable, if any, is used. Otherwise, defaults to the "DEFAULT" profile in config_file_location.
dataset_id
string / required
Unique Dataset OCID

aliases: id
export_details
dictionary
Required for action=snapshot.
bucket
string / required
Bucket name
export_type
string / required
    Choices:
  • OBJECT_STORAGE
The target destination for the snapshot. Using OBJECT_STORAGE means the snapshot will be written to Object Storage.
namespace
string / required
Bucket namespace name
prefix
string
Object path prefix to put snapshot file(s)
export_format
dictionary
Applicable only for action=snapshot.
name
string
    Choices:
  • JSONL
  • JSONL_CONSOLIDATED
  • CONLL
  • SPACY
  • COCO
  • YOLO
  • PASCAL_VOC
  • JSONL_COMPACT_PLUS_CONTENT
Name of export format.
version
string
    Choices:
  • V2003
  • V5
Version of export format.
label_set
dictionary
Applicable only for action=add_dataset_labelsaction=remove_dataset_labels.
items
list / elements=dictionary
An ordered collection of labels that are unique by name.
name
string
An unique name for a label within its dataset.
limit
float
the maximum number of records to generate.
Applicable only for action=generate_dataset_records.
region
string
The Oracle Cloud Infrastructure region to use for all OCI API requests. If not set, then the value of the OCI_REGION variable, if any, is used. This option is required if the region is not specified through a configuration file (See config_file_location). Please refer to https://docs.us-phoenix-1.oraclecloud.com/Content/General/Concepts/regions.htm for more information on OCI regions.
source_label_set
dictionary
Applicable only for action=rename_dataset_labels.
items
list / elements=dictionary
An ordered collection of labels that are unique by name.
name
string
An unique name for a label within its dataset.
target_label_set
dictionary
Applicable only for action=rename_dataset_labels.
items
list / elements=dictionary
An ordered collection of labels that are unique by name.
name
string
An unique name for a label within its dataset.
tenancy
string
OCID of your tenancy. If not set, then the value of the OCI_TENANCY variable, if any, is used. This option is required if the tenancy OCID is not specified through a configuration file (See config_file_location). To get the tenancy OCID, please refer https://docs.us-phoenix-1.oraclecloud.com/Content/API/Concepts/apisigningkey.htm
wait
boolean
    Choices:
  • no
  • yes ←
Whether to wait for create or delete operation to complete.
wait_timeout
integer
Time, in seconds, to wait when wait=yes. Defaults to 1200 for most of the services but some services might have a longer wait timeout.

Examples

- name: Perform action add_dataset_labels on dataset
  oci_data_labeling_service_dataset_actions:
    # required
    dataset_id: "ocid1.dataset.oc1..xxxxxxEXAMPLExxxxxx"
    action: add_dataset_labels

    # optional
    label_set:
      # optional
      items:
      - # optional
        name: name_example

- name: Perform action change_compartment on dataset
  oci_data_labeling_service_dataset_actions:
    # required
    compartment_id: "ocid1.compartment.oc1..xxxxxxEXAMPLExxxxxx"
    dataset_id: "ocid1.dataset.oc1..xxxxxxEXAMPLExxxxxx"
    action: change_compartment

- name: Perform action generate_dataset_records on dataset
  oci_data_labeling_service_dataset_actions:
    # required
    dataset_id: "ocid1.dataset.oc1..xxxxxxEXAMPLExxxxxx"
    action: generate_dataset_records

    # optional
    limit: 3.4

- name: Perform action remove_dataset_labels on dataset
  oci_data_labeling_service_dataset_actions:
    # required
    dataset_id: "ocid1.dataset.oc1..xxxxxxEXAMPLExxxxxx"
    action: remove_dataset_labels

    # optional
    label_set:
      # optional
      items:
      - # optional
        name: name_example

- name: Perform action rename_dataset_labels on dataset
  oci_data_labeling_service_dataset_actions:
    # required
    dataset_id: "ocid1.dataset.oc1..xxxxxxEXAMPLExxxxxx"
    action: rename_dataset_labels

    # optional
    source_label_set:
      # optional
      items:
      - # optional
        name: name_example
    target_label_set:
      # optional
      items:
      - # optional
        name: name_example

- name: Perform action snapshot on dataset
  oci_data_labeling_service_dataset_actions:
    # required
    dataset_id: "ocid1.dataset.oc1..xxxxxxEXAMPLExxxxxx"
    are_annotations_included: true
    are_unannotated_records_included: true
    export_details:
      # required
      export_type: OBJECT_STORAGE
      namespace: namespace_example
      bucket: bucket_example

      # optional
      prefix: prefix_example
    action: snapshot

    # optional
    export_format:
      # optional
      name: JSONL
      version: V2003

Return Values

Common return values are documented here, the following are the fields unique to this module:

Key Returned Description
dataset
complex
on success
Details of the Dataset resource acted upon by the current operation

Sample:
{'annotation_format': 'annotation_format_example', 'compartment_id': 'ocid1.compartment.oc1..xxxxxxEXAMPLExxxxxx', 'dataset_format_details': {'format_type': 'DOCUMENT', 'text_file_type_metadata': {'column_delimiter': 'column_delimiter_example', 'column_index': 56, 'column_name': 'column_name_example', 'escape_character': 'escape_character_example', 'format_type': 'DELIMITED', 'line_delimiter': 'line_delimiter_example'}}, 'dataset_source_details': {'bucket': 'bucket_example', 'namespace': 'namespace_example', 'prefix': 'prefix_example', 'source_type': 'OBJECT_STORAGE'}, 'defined_tags': {'Operations': {'CostCenter': 'US'}}, 'description': 'description_example', 'display_name': 'display_name_example', 'freeform_tags': {'Department': 'Finance'}, 'id': 'ocid1.resource.oc1..xxxxxxEXAMPLExxxxxx', 'initial_record_generation_configuration': {'limit': 10}, 'label_set': {'items': [{'name': 'name_example'}]}, 'labeling_instructions': 'labeling_instructions_example', 'lifecycle_details': 'lifecycle_details_example', 'lifecycle_state': 'CREATING', 'system_tags': {}, 'time_created': '2013-10-20T19:20:30+01:00', 'time_updated': '2013-10-20T19:20:30+01:00'}
 
annotation_format
string
on success
The annotation format name required for labeling records.

Sample:
annotation_format_example
 
compartment_id
string
on success
The OCID of the compartment of the resource.

Sample:
ocid1.compartment.oc1..xxxxxxEXAMPLExxxxxx
 
dataset_format_details
complex
on success

   
format_type
string
on success
The format type. DOCUMENT format is for record contents that are PDFs or TIFFs. IMAGE format is for record contents that are JPEGs or PNGs. TEXT format is for record contents that are TXT files.

Sample:
DOCUMENT
   
text_file_type_metadata
complex
on success

     
column_delimiter
string
on success
A column delimiter

Sample:
column_delimiter_example
     
column_index
integer
on success
The index of a selected column. This is a zero-based index.

Sample:
56
     
column_name
string
on success
The name of a selected column.

Sample:
column_name_example
     
escape_character
string
on success
An escape character.

Sample:
escape_character_example
     
format_type
string
on success
It defines the format type of text files.

Sample:
DELIMITED
     
line_delimiter
string
on success
A line delimiter.

Sample:
line_delimiter_example
 
dataset_source_details
complex
on success

   
bucket
string
on success
The object storage bucket that contains the dataset data source.

Sample:
bucket_example
   
namespace
string
on success
The namespace of the bucket that contains the dataset data source.

Sample:
namespace_example
   
prefix
string
on success
A common path prefix shared by the objects that make up the dataset. Except for the CSV file type, records are not generated for the objects whose names exactly match with the prefix.

Sample:
prefix_example
   
source_type
string
on success
The source type. OBJECT_STORAGE allows the user to describe where in object storage the dataset is.

Sample:
OBJECT_STORAGE
 
defined_tags
dictionary
on success
The defined tags for this resource. Each key is predefined and scoped to a namespace. For example: `{"foo-namespace": {"bar-key": "value"}}`

Sample:
{'Operations': {'CostCenter': 'US'}}
 
description
string
on success
A user provided description of the dataset

Sample:
description_example
 
display_name
string
on success
A user-friendly display name for the resource.

Sample:
display_name_example
 
freeform_tags
dictionary
on success
A simple key-value pair that is applied without any predefined name, type, or scope. It exists for cross-compatibility only. For example: `{"bar-key": "value"}`

Sample:
{'Department': 'Finance'}
 
id
string
on success
The OCID of the Dataset.

Sample:
ocid1.resource.oc1..xxxxxxEXAMPLExxxxxx
 
initial_record_generation_configuration
complex
on success

   
limit
float
on success
The maximum number of records to generate.

Sample:
10
 
label_set
complex
on success

   
items
complex
on success
An ordered collection of labels that are unique by name.

     
name
string
on success
An unique name for a label within its dataset.

Sample:
name_example
 
labeling_instructions
string
on success
The labeling instructions for human labelers in rich text format

Sample:
labeling_instructions_example
 
lifecycle_details
string
on success
A message describing the current state in more detail. For example, it can be used to provide actionable information for a resource in FAILED or NEEDS_ATTENTION state.

Sample:
lifecycle_details_example
 
lifecycle_state
string
on success
The state of a dataset. CREATING - The dataset is being created. It will transition to ACTIVE when it is ready for labeling. ACTIVE - The dataset is ready for labeling. UPDATING - The dataset is being updated. It and its related resources may be unavailable for other updates until it returns to ACTIVE. NEEDS_ATTENTION - A dataset updation operation has failed due to validation or other errors and needs attention. DELETING - The dataset and its related resources are being deleted. DELETED - The dataset has been deleted and is no longer available. FAILED - The dataset has failed due to validation or other errors.

Sample:
CREATING
 
system_tags
dictionary
on success
The usage of system tag keys. These predefined keys are scoped to namespaces. For example: `{"orcl-cloud": {"free-tier-retained": "true"}}`

 
time_created
string
on success
The date and time the resource was created, in the timestamp format defined by RFC3339.

Sample:
2013-10-20T19:20:30+01:00
 
time_updated
string
on success
The date and time the resource was last updated, in the timestamp format defined by RFC3339.

Sample:
2013-10-20T19:20:30+01:00


Authors

  • Oracle (@oracle)