AWS Textract

Extract text, tables, and forms from documents

AWS Textract is a powerful AI service from Amazon Web Services designed to automatically extract printed text, handwriting, tables, forms, key-value pairs, and other structured data from scanned documents and images. Textract leverages advanced optical character recognition (OCR) and document analysis to transform documents into actionable data, enabling automation, analytics, compliance, and more.

With AWS Textract, you can:

  • Extract text from images and documents: Recognize printed text and handwriting in formats such as PDF, JPEG, PNG, or TIFF
  • Detect and extract tables: Automatically find tables and output their structured content
  • Parse forms and key-value pairs: Pull structured data from forms, including fields and their corresponding values
  • Identify signatures and layout features: Detect signatures, geometric layout, and relationships between document elements
  • Customize extraction with queries: Extract specific fields and answers using query-based extraction (e.g., "What is the invoice number?")

In Sim, the AWS Textract integration empowers your agents to intelligently process documents as part of their workflows. This unlocks automation scenarios such as data entry from invoices, onboarding documents, contracts, receipts, and more. Your agents can extract relevant data, analyze structured forms, and generate summaries or reports directly from document uploads or URLs. By connecting Sim with AWS Textract, you can reduce manual effort, improve data accuracy, and streamline your business processes with robust document understanding.

Usage Instructions

Integrate AWS Textract into your workflow to extract text, tables, forms, and key-value pairs from documents. Single-page mode supports JPEG, PNG, and single-page PDF. Multi-page mode supports multi-page PDF and TIFF.

Tools

textract_parser

Input

ParameterTypeRequiredDescription
accessKeyIdstringYesAWS Access Key ID
secretAccessKeystringYesAWS Secret Access Key
regionstringYesAWS region for Textract service (e.g., us-east-1)
processingModestringNoDocument type: single-page or multi-page. Defaults to single-page.
filePathstringNoURL to a document to be processed (JPEG, PNG, or single-page PDF).
filefileNoDocument file to be processed (JPEG, PNG, or single-page PDF).
s3UristringNoS3 URI for multi-page processing (s3://bucket/key).
featureTypesarrayNoFeature types to detect: TABLES, FORMS, QUERIES, SIGNATURES, LAYOUT. If not specified, only text detection is performed.
itemsstringNoFeature type
queriesarrayNoCustom queries to extract specific information. Only used when featureTypes includes QUERIES.
itemsobjectNoQuery configuration
propertiesstringNoThe query text
TextstringNoNo description
AliasstringNoNo description

Output

This tool does not produce any outputs.

On this page

Start building today
Trusted by over 60,000 builders.
Build Agentic workflows visually on a drag-and-drop canvas or with natural language.
Get started