AWSComprehendDocumentClassifierInputDataConfig

@interface AWSComprehendDocumentClassifierInputDataConfig

The input properties for training a document classifier.

For more information on how the input file is formatted, see how-document-classification-training-data.

Required parameters: [S3Uri]

  • Indicates the delimiter used to separate each label for training a multi-label classifier. The default delimiter between labels is a pipe (|). You can use a different character as a delimiter (if it’s an allowed character) by specifying it under Delimiter for labels. If the training documents use a delimiter other than the default or the delimiter you specify, the labels on that line will be combined to make a single unique label, such as LABELLABELLABEL.

    Declaration

    Objective-C

    @property (readwrite, strong, nonatomic) NSString *_Nullable labelDelimiter;

    Swift

    var labelDelimiter: String? { get set }
  • The Amazon S3 URI for the input data. The S3 bucket must be in the same region as the API endpoint that you are calling. The URI can point to a single input file or it can provide the prefix for a collection of input files.

    For example, if you use the URI S3://bucketName/prefix, if the prefix is a single file, Amazon Comprehend uses that file as input. If more than one file begins with the prefix, Amazon Comprehend uses all of them as input.

    Declaration

    Objective-C

    @property (readwrite, strong, nonatomic) NSString *_Nullable s3Uri;

    Swift

    var s3Uri: String? { get set }