public class AmazonTextractClient extends AmazonWebServiceClient implements AmazonTextract
Amazon Textract detects and analyzes text in documents and converts it into machine-readable text. This is the API reference documentation for Amazon Textract.
LOGGING_AWS_REQUEST_METRIC
Constructor and Description |
---|
AmazonTextractClient()
Deprecated.
|
AmazonTextractClient(AWSCredentials awsCredentials)
Constructs a new client to invoke service methods on AmazonTextract using
the specified AWS account credentials.
|
AmazonTextractClient(AWSCredentials awsCredentials,
ClientConfiguration clientConfiguration)
Constructs a new client to invoke service methods on AmazonTextract using
the specified AWS account credentials and client configuration options.
|
AmazonTextractClient(AWSCredentialsProvider awsCredentialsProvider)
Constructs a new client to invoke service methods on AmazonTextract using
the specified AWS account credentials provider.
|
AmazonTextractClient(AWSCredentialsProvider awsCredentialsProvider,
ClientConfiguration clientConfiguration)
Constructs a new client to invoke service methods on AmazonTextract using
the specified AWS account credentials provider and client configuration
options.
|
AmazonTextractClient(AWSCredentialsProvider awsCredentialsProvider,
ClientConfiguration clientConfiguration,
HttpClient httpClient)
Constructs a new client to invoke service methods on AmazonTextract using
the specified AWS account credentials provider, client configuration
options and request metric collector.
|
AmazonTextractClient(AWSCredentialsProvider awsCredentialsProvider,
ClientConfiguration clientConfiguration,
com.amazonaws.metrics.RequestMetricCollector requestMetricCollector)
Deprecated.
|
AmazonTextractClient(ClientConfiguration clientConfiguration)
Deprecated.
|
Modifier and Type | Method and Description |
---|---|
AnalyzeDocumentResult |
analyzeDocument(AnalyzeDocumentRequest analyzeDocumentRequest)
Analyzes an input document for relationships between detected items.
|
DetectDocumentTextResult |
detectDocumentText(DetectDocumentTextRequest detectDocumentTextRequest)
Detects text in the input document.
|
ResponseMetadata |
getCachedResponseMetadata(AmazonWebServiceRequest request)
Deprecated.
ResponseMetadata cache can hold up to 50 requests and
responses in memory and will cause memory issue. This method
now always returns null.
|
GetDocumentAnalysisResult |
getDocumentAnalysis(GetDocumentAnalysisRequest getDocumentAnalysisRequest)
Gets the results for an Amazon Textract asynchronous operation that
analyzes text in a document.
|
GetDocumentTextDetectionResult |
getDocumentTextDetection(GetDocumentTextDetectionRequest getDocumentTextDetectionRequest)
Gets the results for an Amazon Textract asynchronous operation that
detects text in a document.
|
StartDocumentAnalysisResult |
startDocumentAnalysis(StartDocumentAnalysisRequest startDocumentAnalysisRequest)
Starts the asynchronous analysis of an input document for relationships
between detected items such as key-value pairs, tables, and selection
elements.
|
StartDocumentTextDetectionResult |
startDocumentTextDetection(StartDocumentTextDetectionRequest startDocumentTextDetectionRequest)
Starts the asynchronous detection of text in a document.
|
addRequestHandler, addRequestHandler, getEndpoint, getEndpointPrefix, getRegions, getRequestMetricsCollector, getServiceName, getSignerByURI, getSignerRegionOverride, getTimeOffset, removeRequestHandler, removeRequestHandler, setConfiguration, setEndpoint, setEndpoint, setRegion, setServiceNameIntern, setSignerRegionOverride, setTimeOffset, shutdown, withTimeOffset
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
setEndpoint, setRegion, shutdown
@Deprecated public AmazonTextractClient()
All service calls made using this new client object are blocking, and will not return until the service call completes.
DefaultAWSCredentialsProviderChain
@Deprecated public AmazonTextractClient(ClientConfiguration clientConfiguration)
All service calls made using this new client object are blocking, and will not return until the service call completes.
clientConfiguration
- The client configuration options controlling
how this client connects to AmazonTextract (ex: proxy
settings, retry counts, etc.).DefaultAWSCredentialsProviderChain
public AmazonTextractClient(AWSCredentials awsCredentials)
The client requests are authenticated using the AWSCredentials
provided in this constructor. Static AWSCredentials can be passed for
quick testing. However, it is strongly recommended to use Amazon Cognito
vended temporary credentials for use in production. This can be achieved
by using AWSMobileClient
. Please see
https://aws-amplify.github.io/docs/android/authentication for
instructions on how to enable AWSMobileClient
.
AWSMobileClient.getInstance().initialize(getApplicationContext(), new Callback<UserStateDetails>() { @Override public void onResult(final UserStateDetails details) { AmazonTextractClient client = new AmazonTextractClient(AWSMobileClient.getInstance()); } @Override public void onError(final Exception e) { e.printStackTrace(); } });
All service calls made using this new client object are blocking, and will not return until the service call completes.
awsCredentials
- The AWS credentials (access key ID and secret key)
to use when authenticating with AWS services.public AmazonTextractClient(AWSCredentials awsCredentials, ClientConfiguration clientConfiguration)
The client requests are authenticated using the AWSCredentials
provided in this constructor. Static AWSCredentials can be passed for
quick testing. However, it is strongly recommended to use Amazon Cognito
vended temporary credentials for use in production. This can be achieved
by using AWSMobileClient
. Please see
https://aws-amplify.github.io/docs/android/authentication for
instructions on how to enable AWSMobileClient
.
AWSMobileClient.getInstance().initialize(getApplicationContext(), new Callback<UserStateDetails>() { @Override public void onResult(final UserStateDetails details) { AmazonTextractClient client = new AmazonTextractClient(AWSMobileClient.getInstance()); } @Override public void onError(final Exception e) { e.printStackTrace(); } });
All service calls made using this new client object are blocking, and will not return until the service call completes.
awsCredentials
- The AWS credentials (access key ID and secret key)
to use when authenticating with AWS services.clientConfiguration
- The client configuration options controlling
how this client connects to AmazonTextract (ex: proxy
settings, retry counts, etc.).public AmazonTextractClient(AWSCredentialsProvider awsCredentialsProvider)
The client requests are authenticated using the AWSCredentials
provided by the AWSCredentialsProvider
. Static AWSCredentials can
be passed for quick testing. However, it is strongly recommended to use
Amazon Cognito vended temporary credentials for use in production. This
can be achieved by using AWSMobileClient
. Please see
https://aws-amplify.github.io/docs/android/authentication for
instructions on how to enable AWSMobileClient
.
AWSMobileClient.getInstance().initialize(getApplicationContext(), new Callback<UserStateDetails>() { @Override public void onResult(final UserStateDetails details) { AmazonTextractClient client = new AmazonTextractClient(AWSMobileClient.getInstance()); } @Override public void onError(final Exception e) { e.printStackTrace(); } });
All service calls made using this new client object are blocking, and will not return until the service call completes.
awsCredentialsProvider
- The AWS credentials provider which will
provide credentials to authenticate requests with AWS
services.public AmazonTextractClient(AWSCredentialsProvider awsCredentialsProvider, ClientConfiguration clientConfiguration)
The client requests are authenticated using the AWSCredentials
provided by the AWSCredentialsProvider
. Static AWSCredentials can
be passed for quick testing. However, it is strongly recommended to use
Amazon Cognito vended temporary credentials for use in production. This
can be achieved by using AWSMobileClient
. Please see
https://aws-amplify.github.io/docs/android/authentication for
instructions on how to enable AWSMobileClient
.
AWSMobileClient.getInstance().initialize(getApplicationContext(), new Callback<UserStateDetails>() { @Override public void onResult(final UserStateDetails details) { AmazonTextractClient client = new AmazonTextractClient(AWSMobileClient.getInstance()); } @Override public void onError(final Exception e) { e.printStackTrace(); } });
All service calls made using this new client object are blocking, and will not return until the service call completes.
awsCredentialsProvider
- The AWS credentials provider which will
provide credentials to authenticate requests with AWS
services.clientConfiguration
- The client configuration options controlling
how this client connects to AmazonTextract (ex: proxy
settings, retry counts, etc.).@Deprecated public AmazonTextractClient(AWSCredentialsProvider awsCredentialsProvider, ClientConfiguration clientConfiguration, com.amazonaws.metrics.RequestMetricCollector requestMetricCollector)
All service calls made using this new client object are blocking, and will not return until the service call completes.
awsCredentialsProvider
- The AWS credentials provider which will
provide credentials to authenticate requests with AWS
services.clientConfiguration
- The client configuration options controlling
how this client connects to AmazonTextract (ex: proxy
settings, retry counts, etc.).requestMetricCollector
- optional request metric collectorpublic AmazonTextractClient(AWSCredentialsProvider awsCredentialsProvider, ClientConfiguration clientConfiguration, HttpClient httpClient)
The client requests are authenticated using the AWSCredentials
provided by the AWSCredentialsProvider
. Static AWSCredentials can
be passed for quick testing. However, it is strongly recommended to use
Amazon Cognito vended temporary credentials for use in production. This
can be achieved by using AWSMobileClient
. Please see
https://aws-amplify.github.io/docs/android/authentication for
instructions on how to enable AWSMobileClient
.
AWSMobileClient.getInstance().initialize(getApplicationContext(), new Callback<UserStateDetails>() { @Override public void onResult(final UserStateDetails details) { AmazonTextractClient client = new AmazonTextractClient(AWSMobileClient.getInstance()); } @Override public void onError(final Exception e) { e.printStackTrace(); } });
All service calls made using this new client object are blocking, and will not return until the service call completes.
awsCredentialsProvider
- The AWS credentials provider which will
provide credentials to authenticate requests with AWS
services.clientConfiguration
- The client configuration options controlling
how this client connects to AmazonTextract (ex: proxy
settings, retry counts, etc.).httpClient
- A http clientpublic AnalyzeDocumentResult analyzeDocument(AnalyzeDocumentRequest analyzeDocumentRequest) throws AmazonServiceException, AmazonClientException
Analyzes an input document for relationships between detected items.
The types of information returned are as follows:
Form data (key-value pairs). The related information is returned in two
Block objects, each of type KEY_VALUE_SET
: a KEY
Block
object and a VALUE Block
object. For
example, Name: Ana Silva Carolina contains a key and value.
Name: is the key. Ana Silva Carolina is the value.
Table and table cell data. A TABLE Block
object contains
information about a detected table. A CELL Block
object is
returned for each cell in a table.
Lines and words of text. A LINE Block
object contains one or
more WORD Block
objects. All lines and words that are
detected in the document are returned (including text that doesn't have a
relationship with the value of FeatureTypes
).
Selection elements such as check boxes and option buttons (radio buttons)
can be detected in form data and in tables. A SELECTION_ELEMENT
Block
object contains information about a selection element,
including the selection status.
You can choose which type of analysis to perform by specifying the
FeatureTypes
list.
The output is returned in a list of Block
objects.
AnalyzeDocument
is a synchronous operation. To analyze
documents asynchronously, use StartDocumentAnalysis.
For more information, see Document Text Analysis.
analyzeDocument
in interface AmazonTextract
analyzeDocumentRequest
- InvalidParameterException
InvalidS3ObjectException
UnsupportedDocumentException
DocumentTooLargeException
BadDocumentException
AccessDeniedException
ProvisionedThroughputExceededException
InternalServerErrorException
ThrottlingException
HumanLoopQuotaExceededException
AmazonClientException
- If any internal errors are encountered
inside the client while attempting to make the request or
handle the response. For example if a network connection is
not available.AmazonServiceException
- If an error response is returned by Amazon
Textract indicating either a problem with the data in the
request, or a server side issue.public DetectDocumentTextResult detectDocumentText(DetectDocumentTextRequest detectDocumentTextRequest) throws AmazonServiceException, AmazonClientException
Detects text in the input document. Amazon Textract can detect lines of
text and the words that make up a line of text. The input document must
be an image in JPEG or PNG format. DetectDocumentText
returns the detected text in an array of Block objects.
Each document page has as an associated Block
of type PAGE.
Each PAGE Block
object is the parent of LINE
Block
objects that represent the lines of detected text on a
page. A LINE Block
object is a parent for each word that
makes up the line. Words are represented by Block
objects of
type WORD.
DetectDocumentText
is a synchronous operation. To analyze
documents asynchronously, use StartDocumentTextDetection.
For more information, see Document Text Detection.
detectDocumentText
in interface AmazonTextract
detectDocumentTextRequest
- InvalidParameterException
InvalidS3ObjectException
UnsupportedDocumentException
DocumentTooLargeException
BadDocumentException
AccessDeniedException
ProvisionedThroughputExceededException
InternalServerErrorException
ThrottlingException
AmazonClientException
- If any internal errors are encountered
inside the client while attempting to make the request or
handle the response. For example if a network connection is
not available.AmazonServiceException
- If an error response is returned by Amazon
Textract indicating either a problem with the data in the
request, or a server side issue.public GetDocumentAnalysisResult getDocumentAnalysis(GetDocumentAnalysisRequest getDocumentAnalysisRequest) throws AmazonServiceException, AmazonClientException
Gets the results for an Amazon Textract asynchronous operation that analyzes text in a document.
You start asynchronous text analysis by calling
StartDocumentAnalysis, which returns a job identifier (
JobId
). When the text analysis operation finishes, Amazon
Textract publishes a completion status to the Amazon Simple Notification
Service (Amazon SNS) topic that's registered in the initial call to
StartDocumentAnalysis
. To get the results of the
text-detection operation, first check that the status value published to
the Amazon SNS topic is SUCCEEDED
. If so, call
GetDocumentAnalysis
, and pass the job identifier (
JobId
) from the initial call to
StartDocumentAnalysis
.
GetDocumentAnalysis
returns an array of Block
objects. The following types of information are returned:
Form data (key-value pairs). The related information is returned in two
Block objects, each of type KEY_VALUE_SET
: a KEY
Block
object and a VALUE Block
object. For
example, Name: Ana Silva Carolina contains a key and value.
Name: is the key. Ana Silva Carolina is the value.
Table and table cell data. A TABLE Block
object contains
information about a detected table. A CELL Block
object is
returned for each cell in a table.
Lines and words of text. A LINE Block
object contains one or
more WORD Block
objects. All lines and words that are
detected in the document are returned (including text that doesn't have a
relationship with the value of the StartDocumentAnalysis
FeatureTypes
input parameter).
Selection elements such as check boxes and option buttons (radio buttons)
can be detected in form data and in tables. A SELECTION_ELEMENT
Block
object contains information about a selection element,
including the selection status.
Use the MaxResults
parameter to limit the number of blocks
that are returned. If there are more results than specified in
MaxResults
, the value of NextToken
in the
operation response contains a pagination token for getting the next set
of results. To get the next page of results, call
GetDocumentAnalysis
, and populate the NextToken
request parameter with the token value that's returned from the previous
call to GetDocumentAnalysis
.
For more information, see Document Text Analysis.
getDocumentAnalysis
in interface AmazonTextract
getDocumentAnalysisRequest
- InvalidParameterException
AccessDeniedException
ProvisionedThroughputExceededException
InvalidJobIdException
InternalServerErrorException
ThrottlingException
InvalidS3ObjectException
AmazonClientException
- If any internal errors are encountered
inside the client while attempting to make the request or
handle the response. For example if a network connection is
not available.AmazonServiceException
- If an error response is returned by Amazon
Textract indicating either a problem with the data in the
request, or a server side issue.public GetDocumentTextDetectionResult getDocumentTextDetection(GetDocumentTextDetectionRequest getDocumentTextDetectionRequest) throws AmazonServiceException, AmazonClientException
Gets the results for an Amazon Textract asynchronous operation that detects text in a document. Amazon Textract can detect lines of text and the words that make up a line of text.
You start asynchronous text detection by calling
StartDocumentTextDetection, which returns a job identifier (
JobId
). When the text detection operation finishes, Amazon
Textract publishes a completion status to the Amazon Simple Notification
Service (Amazon SNS) topic that's registered in the initial call to
StartDocumentTextDetection
. To get the results of the
text-detection operation, first check that the status value published to
the Amazon SNS topic is SUCCEEDED
. If so, call
GetDocumentTextDetection
, and pass the job identifier (
JobId
) from the initial call to
StartDocumentTextDetection
.
GetDocumentTextDetection
returns an array of Block
objects.
Each document page has as an associated Block
of type PAGE.
Each PAGE Block
object is the parent of LINE
Block
objects that represent the lines of detected text on a
page. A LINE Block
object is a parent for each word that
makes up the line. Words are represented by Block
objects of
type WORD.
Use the MaxResults parameter to limit the number of blocks that are
returned. If there are more results than specified in
MaxResults
, the value of NextToken
in the
operation response contains a pagination token for getting the next set
of results. To get the next page of results, call
GetDocumentTextDetection
, and populate the
NextToken
request parameter with the token value that's
returned from the previous call to GetDocumentTextDetection
.
For more information, see Document Text Detection.
getDocumentTextDetection
in interface AmazonTextract
getDocumentTextDetectionRequest
- InvalidParameterException
AccessDeniedException
ProvisionedThroughputExceededException
InvalidJobIdException
InternalServerErrorException
ThrottlingException
InvalidS3ObjectException
AmazonClientException
- If any internal errors are encountered
inside the client while attempting to make the request or
handle the response. For example if a network connection is
not available.AmazonServiceException
- If an error response is returned by Amazon
Textract indicating either a problem with the data in the
request, or a server side issue.public StartDocumentAnalysisResult startDocumentAnalysis(StartDocumentAnalysisRequest startDocumentAnalysisRequest) throws AmazonServiceException, AmazonClientException
Starts the asynchronous analysis of an input document for relationships between detected items such as key-value pairs, tables, and selection elements.
StartDocumentAnalysis
can analyze text in documents that are
in JPEG, PNG, and PDF format. The documents are stored in an Amazon S3
bucket. Use DocumentLocation to specify the bucket name and file
name of the document.
StartDocumentAnalysis
returns a job identifier (
JobId
) that you use to get the results of the operation.
When text analysis is finished, Amazon Textract publishes a completion
status to the Amazon Simple Notification Service (Amazon SNS) topic that
you specify in NotificationChannel
. To get the results of
the text analysis operation, first check that the status value published
to the Amazon SNS topic is SUCCEEDED
. If so, call
GetDocumentAnalysis, and pass the job identifier (
JobId
) from the initial call to
StartDocumentAnalysis
.
For more information, see Document Text Analysis.
startDocumentAnalysis
in interface AmazonTextract
startDocumentAnalysisRequest
- InvalidParameterException
InvalidS3ObjectException
InvalidKMSKeyException
UnsupportedDocumentException
DocumentTooLargeException
BadDocumentException
AccessDeniedException
ProvisionedThroughputExceededException
InternalServerErrorException
IdempotentParameterMismatchException
ThrottlingException
LimitExceededException
AmazonClientException
- If any internal errors are encountered
inside the client while attempting to make the request or
handle the response. For example if a network connection is
not available.AmazonServiceException
- If an error response is returned by Amazon
Textract indicating either a problem with the data in the
request, or a server side issue.public StartDocumentTextDetectionResult startDocumentTextDetection(StartDocumentTextDetectionRequest startDocumentTextDetectionRequest) throws AmazonServiceException, AmazonClientException
Starts the asynchronous detection of text in a document. Amazon Textract can detect lines of text and the words that make up a line of text.
StartDocumentTextDetection
can analyze text in documents
that are in JPEG, PNG, and PDF format. The documents are stored in an
Amazon S3 bucket. Use DocumentLocation to specify the bucket name
and file name of the document.
StartTextDetection
returns a job identifier (
JobId
) that you use to get the results of the operation.
When text detection is finished, Amazon Textract publishes a completion
status to the Amazon Simple Notification Service (Amazon SNS) topic that
you specify in NotificationChannel
. To get the results of
the text detection operation, first check that the status value published
to the Amazon SNS topic is SUCCEEDED
. If so, call
GetDocumentTextDetection, and pass the job identifier (
JobId
) from the initial call to
StartDocumentTextDetection
.
For more information, see Document Text Detection.
startDocumentTextDetection
in interface AmazonTextract
startDocumentTextDetectionRequest
- InvalidParameterException
InvalidS3ObjectException
InvalidKMSKeyException
UnsupportedDocumentException
DocumentTooLargeException
BadDocumentException
AccessDeniedException
ProvisionedThroughputExceededException
InternalServerErrorException
IdempotentParameterMismatchException
ThrottlingException
LimitExceededException
AmazonClientException
- If any internal errors are encountered
inside the client while attempting to make the request or
handle the response. For example if a network connection is
not available.AmazonServiceException
- If an error response is returned by Amazon
Textract indicating either a problem with the data in the
request, or a server side issue.@Deprecated public ResponseMetadata getCachedResponseMetadata(AmazonWebServiceRequest request)
Response metadata is only cached for a limited period of time, so if you need to access this extra diagnostic information for an executed request, you should use this method to retrieve it as soon as possible after executing the request.
getCachedResponseMetadata
in interface AmazonTextract
request
- The originally executed requestCopyright © 2018 Amazon Web Services, Inc. All Rights Reserved.