AWSComprehendAugmentedManifestsListItem

Objective-C

@interface AWSComprehendAugmentedManifestsListItem

Swift

class AWSComprehendAugmentedManifestsListItem

An augmented manifest file that provides training data for your custom model. An augmented manifest file is a labeled dataset that is produced by Amazon SageMaker Ground Truth.

Required parameters: [S3Uri, AttributeNames]

  • The S3 prefix to the annotation files that are referred in the augmented manifest file.

    Declaration

    Objective-C

    @property (nonatomic, strong) NSString *_Nullable annotationDataS3Uri;

    Swift

    var annotationDataS3Uri: String? { get set }
  • The JSON attribute that contains the annotations for your training documents. The number of attribute names that you specify depends on whether your augmented manifest file is the output of a single labeling job or a chained labeling job.

    If your file is the output of a single labeling job, specify the LabelAttributeName key that was used when the job was created in Ground Truth.

    If your file is the output of a chained labeling job, specify the LabelAttributeName key for one or more jobs in the chain. Each LabelAttributeName key provides the annotations from an individual job.

    Declaration

    Objective-C

    @property (nonatomic, strong) NSArray<NSString *> *_Nullable attributeNames;

    Swift

    var attributeNames: [String]? { get set }
  • The type of augmented manifest. PlainTextDocument or SemiStructuredDocument. If you don’t specify, the default is PlainTextDocument.

    • PLAIN_TEXT_DOCUMENT A document type that represents any unicode text that is encoded in UTF-8.

    • SEMI_STRUCTURED_DOCUMENT A document type with positional and structural context, like a PDF. For training with Amazon Comprehend, only PDFs are supported. For inference, Amazon Comprehend support PDFs, DOCX and TXT.

    Declaration

    Objective-C

    @property (nonatomic) AWSComprehendAugmentedManifestsDocumentTypeFormat documentType;

    Swift

    var documentType: AWSComprehendAugmentedManifestsDocumentTypeFormat { get set }
  • The Amazon S3 location of the augmented manifest file.

    Declaration

    Objective-C

    @property (nonatomic, strong) NSString *_Nullable s3Uri;

    Swift

    var s3Uri: String? { get set }
  • The S3 prefix to the source files (PDFs) that are referred to in the augmented manifest file.

    Declaration

    Objective-C

    @property (nonatomic, strong) NSString *_Nullable sourceDocumentsS3Uri;

    Swift

    var sourceDocumentsS3Uri: String? { get set }
  • The purpose of the data you’ve provided in the augmented manifest. You can either train or test this data. If you don’t specify, the default is train.

    TRAIN - all of the documents in the manifest will be used for training. If no test documents are provided, Amazon Comprehend will automatically reserve a portion of the training documents for testing.

    TEST - all of the documents in the manifest will be used for testing.

    Declaration

    Objective-C

    @property (nonatomic) AWSComprehendSplit split;

    Swift

    var split: AWSComprehendSplit { get set }