public class EntityRecognizerInputDataConfig
extends java.lang.Object
implements java.io.Serializable
Specifies the format and location of the input data.
Constructor and Description |
---|
EntityRecognizerInputDataConfig() |
Modifier and Type | Method and Description |
---|---|
boolean |
equals(java.lang.Object obj) |
EntityRecognizerAnnotations |
getAnnotations()
The S3 location of the CSV file that annotates your training documents.
|
java.util.List<AugmentedManifestsListItem> |
getAugmentedManifests()
A list of augmented manifest files that provide training data for your
custom model.
|
java.lang.String |
getDataFormat()
The format of your training data:
|
EntityRecognizerDocuments |
getDocuments()
The S3 location of the folder that contains the training documents for
your custom entity recognizer.
|
EntityRecognizerEntityList |
getEntityList()
The S3 location of the CSV file that has the entity list for your custom
entity recognizer.
|
java.util.List<EntityTypesListItem> |
getEntityTypes()
The entity types in the labeled training data that Amazon Comprehend uses
to train the custom entity recognizer.
|
int |
hashCode() |
void |
setAnnotations(EntityRecognizerAnnotations annotations)
The S3 location of the CSV file that annotates your training documents.
|
void |
setAugmentedManifests(java.util.Collection<AugmentedManifestsListItem> augmentedManifests)
A list of augmented manifest files that provide training data for your
custom model.
|
void |
setDataFormat(EntityRecognizerDataFormat dataFormat)
The format of your training data:
|
void |
setDataFormat(java.lang.String dataFormat)
The format of your training data:
|
void |
setDocuments(EntityRecognizerDocuments documents)
The S3 location of the folder that contains the training documents for
your custom entity recognizer.
|
void |
setEntityList(EntityRecognizerEntityList entityList)
The S3 location of the CSV file that has the entity list for your custom
entity recognizer.
|
void |
setEntityTypes(java.util.Collection<EntityTypesListItem> entityTypes)
The entity types in the labeled training data that Amazon Comprehend uses
to train the custom entity recognizer.
|
java.lang.String |
toString()
Returns a string representation of this object; useful for testing and
debugging.
|
EntityRecognizerInputDataConfig |
withAnnotations(EntityRecognizerAnnotations annotations)
The S3 location of the CSV file that annotates your training documents.
|
EntityRecognizerInputDataConfig |
withAugmentedManifests(AugmentedManifestsListItem... augmentedManifests)
A list of augmented manifest files that provide training data for your
custom model.
|
EntityRecognizerInputDataConfig |
withAugmentedManifests(java.util.Collection<AugmentedManifestsListItem> augmentedManifests)
A list of augmented manifest files that provide training data for your
custom model.
|
EntityRecognizerInputDataConfig |
withDataFormat(EntityRecognizerDataFormat dataFormat)
The format of your training data:
|
EntityRecognizerInputDataConfig |
withDataFormat(java.lang.String dataFormat)
The format of your training data:
|
EntityRecognizerInputDataConfig |
withDocuments(EntityRecognizerDocuments documents)
The S3 location of the folder that contains the training documents for
your custom entity recognizer.
|
EntityRecognizerInputDataConfig |
withEntityList(EntityRecognizerEntityList entityList)
The S3 location of the CSV file that has the entity list for your custom
entity recognizer.
|
EntityRecognizerInputDataConfig |
withEntityTypes(java.util.Collection<EntityTypesListItem> entityTypes)
The entity types in the labeled training data that Amazon Comprehend uses
to train the custom entity recognizer.
|
EntityRecognizerInputDataConfig |
withEntityTypes(EntityTypesListItem... entityTypes)
The entity types in the labeled training data that Amazon Comprehend uses
to train the custom entity recognizer.
|
public java.lang.String getDataFormat()
The format of your training data:
COMPREHEND_CSV
: A CSV file that supplements your training
documents. The CSV file contains information about the custom entities
that your trained model will detect. The required format of the file
depends on whether you are providing annotations or an entity list.
If you use this value, you must provide your CSV file by using either the
Annotations
or EntityList
parameters. You must
provide your training documents by using the Documents
parameter.
AUGMENTED_MANIFEST
: A labeled dataset that is produced by
Amazon SageMaker Ground Truth. This file is in JSON lines format. Each
line is a complete JSON object that contains a training document and its
labels. Each label annotates a named entity in the training document.
If you use this value, you must provide the
AugmentedManifests
parameter in your request.
If you don't specify a value, Amazon Comprehend uses
COMPREHEND_CSV
as the default.
Constraints:
Allowed Values: COMPREHEND_CSV, AUGMENTED_MANIFEST
The format of your training data:
COMPREHEND_CSV
: A CSV file that supplements your
training documents. The CSV file contains information about the
custom entities that your trained model will detect. The required
format of the file depends on whether you are providing
annotations or an entity list.
If you use this value, you must provide your CSV file by using
either the Annotations
or EntityList
parameters. You must provide your training documents by using the
Documents
parameter.
AUGMENTED_MANIFEST
: A labeled dataset that is
produced by Amazon SageMaker Ground Truth. This file is in JSON
lines format. Each line is a complete JSON object that contains a
training document and its labels. Each label annotates a named
entity in the training document.
If you use this value, you must provide the
AugmentedManifests
parameter in your request.
If you don't specify a value, Amazon Comprehend uses
COMPREHEND_CSV
as the default.
EntityRecognizerDataFormat
public void setDataFormat(java.lang.String dataFormat)
The format of your training data:
COMPREHEND_CSV
: A CSV file that supplements your training
documents. The CSV file contains information about the custom entities
that your trained model will detect. The required format of the file
depends on whether you are providing annotations or an entity list.
If you use this value, you must provide your CSV file by using either the
Annotations
or EntityList
parameters. You must
provide your training documents by using the Documents
parameter.
AUGMENTED_MANIFEST
: A labeled dataset that is produced by
Amazon SageMaker Ground Truth. This file is in JSON lines format. Each
line is a complete JSON object that contains a training document and its
labels. Each label annotates a named entity in the training document.
If you use this value, you must provide the
AugmentedManifests
parameter in your request.
If you don't specify a value, Amazon Comprehend uses
COMPREHEND_CSV
as the default.
Constraints:
Allowed Values: COMPREHEND_CSV, AUGMENTED_MANIFEST
dataFormat
- The format of your training data:
COMPREHEND_CSV
: A CSV file that supplements your
training documents. The CSV file contains information about
the custom entities that your trained model will detect. The
required format of the file depends on whether you are
providing annotations or an entity list.
If you use this value, you must provide your CSV file by using
either the Annotations
or EntityList
parameters. You must provide your training documents by using
the Documents
parameter.
AUGMENTED_MANIFEST
: A labeled dataset that is
produced by Amazon SageMaker Ground Truth. This file is in
JSON lines format. Each line is a complete JSON object that
contains a training document and its labels. Each label
annotates a named entity in the training document.
If you use this value, you must provide the
AugmentedManifests
parameter in your request.
If you don't specify a value, Amazon Comprehend uses
COMPREHEND_CSV
as the default.
EntityRecognizerDataFormat
public EntityRecognizerInputDataConfig withDataFormat(java.lang.String dataFormat)
The format of your training data:
COMPREHEND_CSV
: A CSV file that supplements your training
documents. The CSV file contains information about the custom entities
that your trained model will detect. The required format of the file
depends on whether you are providing annotations or an entity list.
If you use this value, you must provide your CSV file by using either the
Annotations
or EntityList
parameters. You must
provide your training documents by using the Documents
parameter.
AUGMENTED_MANIFEST
: A labeled dataset that is produced by
Amazon SageMaker Ground Truth. This file is in JSON lines format. Each
line is a complete JSON object that contains a training document and its
labels. Each label annotates a named entity in the training document.
If you use this value, you must provide the
AugmentedManifests
parameter in your request.
If you don't specify a value, Amazon Comprehend uses
COMPREHEND_CSV
as the default.
Returns a reference to this object so that method calls can be chained together.
Constraints:
Allowed Values: COMPREHEND_CSV, AUGMENTED_MANIFEST
dataFormat
- The format of your training data:
COMPREHEND_CSV
: A CSV file that supplements your
training documents. The CSV file contains information about
the custom entities that your trained model will detect. The
required format of the file depends on whether you are
providing annotations or an entity list.
If you use this value, you must provide your CSV file by using
either the Annotations
or EntityList
parameters. You must provide your training documents by using
the Documents
parameter.
AUGMENTED_MANIFEST
: A labeled dataset that is
produced by Amazon SageMaker Ground Truth. This file is in
JSON lines format. Each line is a complete JSON object that
contains a training document and its labels. Each label
annotates a named entity in the training document.
If you use this value, you must provide the
AugmentedManifests
parameter in your request.
If you don't specify a value, Amazon Comprehend uses
COMPREHEND_CSV
as the default.
EntityRecognizerDataFormat
public void setDataFormat(EntityRecognizerDataFormat dataFormat)
The format of your training data:
COMPREHEND_CSV
: A CSV file that supplements your training
documents. The CSV file contains information about the custom entities
that your trained model will detect. The required format of the file
depends on whether you are providing annotations or an entity list.
If you use this value, you must provide your CSV file by using either the
Annotations
or EntityList
parameters. You must
provide your training documents by using the Documents
parameter.
AUGMENTED_MANIFEST
: A labeled dataset that is produced by
Amazon SageMaker Ground Truth. This file is in JSON lines format. Each
line is a complete JSON object that contains a training document and its
labels. Each label annotates a named entity in the training document.
If you use this value, you must provide the
AugmentedManifests
parameter in your request.
If you don't specify a value, Amazon Comprehend uses
COMPREHEND_CSV
as the default.
Constraints:
Allowed Values: COMPREHEND_CSV, AUGMENTED_MANIFEST
dataFormat
- The format of your training data:
COMPREHEND_CSV
: A CSV file that supplements your
training documents. The CSV file contains information about
the custom entities that your trained model will detect. The
required format of the file depends on whether you are
providing annotations or an entity list.
If you use this value, you must provide your CSV file by using
either the Annotations
or EntityList
parameters. You must provide your training documents by using
the Documents
parameter.
AUGMENTED_MANIFEST
: A labeled dataset that is
produced by Amazon SageMaker Ground Truth. This file is in
JSON lines format. Each line is a complete JSON object that
contains a training document and its labels. Each label
annotates a named entity in the training document.
If you use this value, you must provide the
AugmentedManifests
parameter in your request.
If you don't specify a value, Amazon Comprehend uses
COMPREHEND_CSV
as the default.
EntityRecognizerDataFormat
public EntityRecognizerInputDataConfig withDataFormat(EntityRecognizerDataFormat dataFormat)
The format of your training data:
COMPREHEND_CSV
: A CSV file that supplements your training
documents. The CSV file contains information about the custom entities
that your trained model will detect. The required format of the file
depends on whether you are providing annotations or an entity list.
If you use this value, you must provide your CSV file by using either the
Annotations
or EntityList
parameters. You must
provide your training documents by using the Documents
parameter.
AUGMENTED_MANIFEST
: A labeled dataset that is produced by
Amazon SageMaker Ground Truth. This file is in JSON lines format. Each
line is a complete JSON object that contains a training document and its
labels. Each label annotates a named entity in the training document.
If you use this value, you must provide the
AugmentedManifests
parameter in your request.
If you don't specify a value, Amazon Comprehend uses
COMPREHEND_CSV
as the default.
Returns a reference to this object so that method calls can be chained together.
Constraints:
Allowed Values: COMPREHEND_CSV, AUGMENTED_MANIFEST
dataFormat
- The format of your training data:
COMPREHEND_CSV
: A CSV file that supplements your
training documents. The CSV file contains information about
the custom entities that your trained model will detect. The
required format of the file depends on whether you are
providing annotations or an entity list.
If you use this value, you must provide your CSV file by using
either the Annotations
or EntityList
parameters. You must provide your training documents by using
the Documents
parameter.
AUGMENTED_MANIFEST
: A labeled dataset that is
produced by Amazon SageMaker Ground Truth. This file is in
JSON lines format. Each line is a complete JSON object that
contains a training document and its labels. Each label
annotates a named entity in the training document.
If you use this value, you must provide the
AugmentedManifests
parameter in your request.
If you don't specify a value, Amazon Comprehend uses
COMPREHEND_CSV
as the default.
EntityRecognizerDataFormat
public java.util.List<EntityTypesListItem> getEntityTypes()
The entity types in the labeled training data that Amazon Comprehend uses to train the custom entity recognizer. Any entity types that you don't specify are ignored.
A maximum of 25 entity types can be used at one time to train an entity recognizer. Entity types must not contain the following invalid characters: \n (line break), \\n (escaped line break), \r (carriage return), \\r (escaped carriage return), \t (tab), \\t (escaped tab), space, and , (comma).
The entity types in the labeled training data that Amazon Comprehend uses to train the custom entity recognizer. Any entity types that you don't specify are ignored.
A maximum of 25 entity types can be used at one time to train an entity recognizer. Entity types must not contain the following invalid characters: \n (line break), \\n (escaped line break), \r (carriage return), \\r (escaped carriage return), \t (tab), \\t (escaped tab), space, and , (comma).
public void setEntityTypes(java.util.Collection<EntityTypesListItem> entityTypes)
The entity types in the labeled training data that Amazon Comprehend uses to train the custom entity recognizer. Any entity types that you don't specify are ignored.
A maximum of 25 entity types can be used at one time to train an entity recognizer. Entity types must not contain the following invalid characters: \n (line break), \\n (escaped line break), \r (carriage return), \\r (escaped carriage return), \t (tab), \\t (escaped tab), space, and , (comma).
entityTypes
- The entity types in the labeled training data that Amazon Comprehend uses to train the custom entity recognizer. Any entity types that you don't specify are ignored.
A maximum of 25 entity types can be used at one time to train an entity recognizer. Entity types must not contain the following invalid characters: \n (line break), \\n (escaped line break), \r (carriage return), \\r (escaped carriage return), \t (tab), \\t (escaped tab), space, and , (comma).
public EntityRecognizerInputDataConfig withEntityTypes(EntityTypesListItem... entityTypes)
The entity types in the labeled training data that Amazon Comprehend uses to train the custom entity recognizer. Any entity types that you don't specify are ignored.
A maximum of 25 entity types can be used at one time to train an entity recognizer. Entity types must not contain the following invalid characters: \n (line break), \\n (escaped line break), \r (carriage return), \\r (escaped carriage return), \t (tab), \\t (escaped tab), space, and , (comma).
Returns a reference to this object so that method calls can be chained together.
entityTypes
- The entity types in the labeled training data that Amazon Comprehend uses to train the custom entity recognizer. Any entity types that you don't specify are ignored.
A maximum of 25 entity types can be used at one time to train an entity recognizer. Entity types must not contain the following invalid characters: \n (line break), \\n (escaped line break), \r (carriage return), \\r (escaped carriage return), \t (tab), \\t (escaped tab), space, and , (comma).
public EntityRecognizerInputDataConfig withEntityTypes(java.util.Collection<EntityTypesListItem> entityTypes)
The entity types in the labeled training data that Amazon Comprehend uses to train the custom entity recognizer. Any entity types that you don't specify are ignored.
A maximum of 25 entity types can be used at one time to train an entity recognizer. Entity types must not contain the following invalid characters: \n (line break), \\n (escaped line break), \r (carriage return), \\r (escaped carriage return), \t (tab), \\t (escaped tab), space, and , (comma).
Returns a reference to this object so that method calls can be chained together.
entityTypes
- The entity types in the labeled training data that Amazon Comprehend uses to train the custom entity recognizer. Any entity types that you don't specify are ignored.
A maximum of 25 entity types can be used at one time to train an entity recognizer. Entity types must not contain the following invalid characters: \n (line break), \\n (escaped line break), \r (carriage return), \\r (escaped carriage return), \t (tab), \\t (escaped tab), space, and , (comma).
public EntityRecognizerDocuments getDocuments()
The S3 location of the folder that contains the training documents for your custom entity recognizer.
This parameter is required if you set DataFormat
to
COMPREHEND_CSV
.
The S3 location of the folder that contains the training documents for your custom entity recognizer.
This parameter is required if you set DataFormat
to
COMPREHEND_CSV
.
public void setDocuments(EntityRecognizerDocuments documents)
The S3 location of the folder that contains the training documents for your custom entity recognizer.
This parameter is required if you set DataFormat
to
COMPREHEND_CSV
.
documents
- The S3 location of the folder that contains the training documents for your custom entity recognizer.
This parameter is required if you set DataFormat
to COMPREHEND_CSV
.
public EntityRecognizerInputDataConfig withDocuments(EntityRecognizerDocuments documents)
The S3 location of the folder that contains the training documents for your custom entity recognizer.
This parameter is required if you set DataFormat
to
COMPREHEND_CSV
.
Returns a reference to this object so that method calls can be chained together.
documents
- The S3 location of the folder that contains the training documents for your custom entity recognizer.
This parameter is required if you set DataFormat
to COMPREHEND_CSV
.
public EntityRecognizerAnnotations getAnnotations()
The S3 location of the CSV file that annotates your training documents.
The S3 location of the CSV file that annotates your training documents.
public void setAnnotations(EntityRecognizerAnnotations annotations)
The S3 location of the CSV file that annotates your training documents.
annotations
- The S3 location of the CSV file that annotates your training documents.
public EntityRecognizerInputDataConfig withAnnotations(EntityRecognizerAnnotations annotations)
The S3 location of the CSV file that annotates your training documents.
Returns a reference to this object so that method calls can be chained together.
annotations
- The S3 location of the CSV file that annotates your training documents.
public EntityRecognizerEntityList getEntityList()
The S3 location of the CSV file that has the entity list for your custom entity recognizer.
The S3 location of the CSV file that has the entity list for your custom entity recognizer.
public void setEntityList(EntityRecognizerEntityList entityList)
The S3 location of the CSV file that has the entity list for your custom entity recognizer.
entityList
- The S3 location of the CSV file that has the entity list for your custom entity recognizer.
public EntityRecognizerInputDataConfig withEntityList(EntityRecognizerEntityList entityList)
The S3 location of the CSV file that has the entity list for your custom entity recognizer.
Returns a reference to this object so that method calls can be chained together.
entityList
- The S3 location of the CSV file that has the entity list for your custom entity recognizer.
public java.util.List<AugmentedManifestsListItem> getAugmentedManifests()
A list of augmented manifest files that provide training data for your custom model. An augmented manifest file is a labeled dataset that is produced by Amazon SageMaker Ground Truth.
This parameter is required if you set DataFormat
to
AUGMENTED_MANIFEST
.
A list of augmented manifest files that provide training data for your custom model. An augmented manifest file is a labeled dataset that is produced by Amazon SageMaker Ground Truth.
This parameter is required if you set DataFormat
to
AUGMENTED_MANIFEST
.
public void setAugmentedManifests(java.util.Collection<AugmentedManifestsListItem> augmentedManifests)
A list of augmented manifest files that provide training data for your custom model. An augmented manifest file is a labeled dataset that is produced by Amazon SageMaker Ground Truth.
This parameter is required if you set DataFormat
to
AUGMENTED_MANIFEST
.
augmentedManifests
- A list of augmented manifest files that provide training data for your custom model. An augmented manifest file is a labeled dataset that is produced by Amazon SageMaker Ground Truth.
This parameter is required if you set DataFormat
to AUGMENTED_MANIFEST
.
public EntityRecognizerInputDataConfig withAugmentedManifests(AugmentedManifestsListItem... augmentedManifests)
A list of augmented manifest files that provide training data for your custom model. An augmented manifest file is a labeled dataset that is produced by Amazon SageMaker Ground Truth.
This parameter is required if you set DataFormat
to
AUGMENTED_MANIFEST
.
Returns a reference to this object so that method calls can be chained together.
augmentedManifests
- A list of augmented manifest files that provide training data for your custom model. An augmented manifest file is a labeled dataset that is produced by Amazon SageMaker Ground Truth.
This parameter is required if you set DataFormat
to AUGMENTED_MANIFEST
.
public EntityRecognizerInputDataConfig withAugmentedManifests(java.util.Collection<AugmentedManifestsListItem> augmentedManifests)
A list of augmented manifest files that provide training data for your custom model. An augmented manifest file is a labeled dataset that is produced by Amazon SageMaker Ground Truth.
This parameter is required if you set DataFormat
to
AUGMENTED_MANIFEST
.
Returns a reference to this object so that method calls can be chained together.
augmentedManifests
- A list of augmented manifest files that provide training data for your custom model. An augmented manifest file is a labeled dataset that is produced by Amazon SageMaker Ground Truth.
This parameter is required if you set DataFormat
to AUGMENTED_MANIFEST
.
public java.lang.String toString()
toString
in class java.lang.Object
Object.toString()
public int hashCode()
hashCode
in class java.lang.Object
public boolean equals(java.lang.Object obj)
equals
in class java.lang.Object
Copyright © 2018 Amazon Web Services, Inc. All Rights Reserved.