Published: Dec 15, 2022
Amazon Web Services announced earlier this month Amazon Comprehend for Intelligent Document Processing (IDP). This is a new natural language processing (NLP) service that uses machine learning (ML) to uncover relationships and connections in various document types.
Amazon Comprehend for IDP enables you to classify and extract entities from PDF documents, Microsoft Word files, and images directly without requiring you to extract the text first. Previously, businesses would typically need to pre-process those documents and convert them into some type of machine-readable text which can reduce the quality of the document context.
Some of the things that businesses can do with Amazon Comprehend for IDP include:
You can use Amazon Comprehend for IDP from the AWS Management Console, AWS SDKs, or the AWS Command Line Interface (CLI). Amazon Comprehend offers a free tier covering 50K units of text (5M characters) per API per month. Eligible APIs for the free tier include Key Phrase Extraction, Sentiment, Targeted Sentiment, Entity Recognition, Language Detection, Event Detection, Syntax Analysis, Detect PII, and Contains PII.
Beyond the free tier, Amazon Comprehend for IDP is priced per unit and per feature. Some sample pricing is shown in the table below.
Feature | Up to 10M units | From 10M-50M units | Over 50M units |
Key Phrase Extraction | $0.0001 | $0.00005 | $0.000025 |
Sentiment Analysis | $0.0001 | $0.00005 | $0.000025 |
Targeted Sentiment | $0.0001 | $0.00005 | $0.000025 |
Entity Recognition | $0.0001 | $0.00005 | $0.000025 |
Language Detection | $0.0001 | $0.00005 | $0.000025 |
Event Detection | $0.003 | $0.0015 | $0.00075 |
Syntax Analysis | $0.00005 | $0.000025 | $0.0000125 |
You can estimate your costs for Amazon Comprehend for IDP using the AWS Pricing Calculator. For regional availability, you can refer to the AWS Regional Services List.