cloudmersive.com – ocr

OpenAPI apis-guru text

The powerful Optical Character Recognition (OCR) APIs let you convert scanned images of pages into recognized text.

Homepage: https://api.apis.guru/v2/specs/cloudmersive.com:ocr/v1.json
Provider: cloudmersive.com:ocr / ocr
OpenAPI version: 3.0.0
Spec (JSON): https://api.apis.guru/v2/specs/cloudmersive.com/ocr/v1/openapi.json
Spec (YAML): https://api.apis.guru/v2/specs/cloudmersive.com/ocr/v1/openapi.yaml

Tools (21)

Extracted live via the executor SDK.

imageOcr.imageOcrImageLinesWithLocation

Converts an uploaded image in common formats such as JPEG, PNG into lines/text with location information and other metdata via Optical Character Recognition. This API is intended to be run on scanned documents. If you want to OCR photos (e.g. taken with a smart phone camera), be sure to use the photo/toText API instead, as it is designed to unskew the image first. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.
imageOcr.imageOcrImageWordsWithLocation

Converts an uploaded image in common formats such as JPEG, PNG into words/text with location information and other metdata via Optical Character Recognition. This API is intended to be run on scanned documents. If you want to OCR photos (e.g. taken with a smart phone camera), be sure to use the photo/toText API instead, as it is designed to unskew the image first. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.
imageOcr.imageOcrPhotoRecognizeBusinessCard

Analyzes a photograph of a business card as input, and outputs key business information such as the name of the person, name of the business, the address of the business, the phone number, the email address and more. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.
imageOcr.imageOcrPhotoRecognizeForm

Analyzes a photograph of a form as input, and outputs key business fields and information. Customzie data to be extracted by defining fields for the form. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.
imageOcr.imageOcrPhotoRecognizeFormAdvanced

Analyzes a photograph of a form as input, and outputs key business fields and information. Customzie data to be extracted by defining fields for the form. Uses template definitions stored in Cloudmersive Configuration; to configure stored templates in a configuration bucket, log into Cloudmersive Management Portal and navigate to Settings > API Configuration > Create Bucket. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.
imageOcr.imageOcrPhotoRecognizeReceipt

Analyzes a photograph of a receipt as input, and outputs key business information such as the name of the business, the address of the business, the phone number of the business, the total of the receipt, the date of the receipt, and more. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.
imageOcr.imageOcrPhotoToText

Converts an uploaded photo of a document in common formats such as JPEG, PNG into text via Optical Character Recognition. This API is intended to be run on photos of documents, e.g. taken with a smartphone and supports cases where other content, such as a desk, are in the frame and the camera is crooked. If you want to OCR a scanned image, use the image/toText API call instead as it is designed for scanned images. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.
imageOcr.imageOcrPhotoWordsWithLocation

Converts a photo of a document or receipt in common formats such as JPEG, PNG into words/text with location information and other metdata via Optical Character Recognition. This API is intended to be run on photographs of documents. If you want to OCR scanned documents (e.g. taken with a scanner), be sure to use the image/toText API instead, as it is designed for that use case. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.
imageOcr.imageOcrPost

Converts an uploaded image in common formats such as JPEG, PNG into text via Optical Character Recognition. This API is intended to be run on scanned documents. If you want to OCR photos (e.g. taken with a smart phone camera), be sure to use the photo/toText API instead, as it is designed to unskew the image first. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.
pdfOcr.pdfOcrPdfToLinesWithLocation

Converts a PDF into lines/text with location information and other metdata via Optical Character Recognition. This API is intended to be run on scanned documents. If you want to OCR photos (e.g. taken with a smart phone camera), be sure to use the photo/toText API instead, as it is designed to unskew the image first.
pdfOcr.pdfOcrPdfToWordsWithLocation

Converts a PDF into words/text with location information and other metdata via Optical Character Recognition. This API is intended to be run on scanned documents. If you want to OCR photos (e.g. taken with a smart phone camera), be sure to use the photo/toText API instead, as it is designed to unskew the image first.
pdfOcr.pdfOcrPost

Converts an uploaded PDF file into text via Optical Character Recognition.
preprocessing.preprocessingBinarize

Perform an adaptive binarization algorithm on the input image to prepare it for further OCR operations.
preprocessing.preprocessingBinarizeAdvanced

Perform an advanced adaptive, Deep Learning-based binarization algorithm on the input image to prepare it for further OCR operations. Provides enhanced accuracy than adaptive binarization. Image will be upsampled to 300 DPI if it has a DPI below 300.
preprocessing.preprocessingGetPageAngle

Analyzes a photo or image of a document and identifies the rotation angle of the page.
preprocessing.preprocessingUnrotate

Detect and unrotate an image of a document (e.g. that was scanned at an angle). Great for document scanning applications; once unskewed, this image is perfect for converting to PDF using the Convert API or optical character recognition using the OCR API.
preprocessing.preprocessingUnrotateAdvanced

Detect and unrotate an image of a document (e.g. that was scanned at an angle) using deep learning. Great for document scanning applications; once unskewed, this image is perfect for converting to PDF using the Convert API or optical character recognition using the OCR API.
preprocessing.preprocessingUnskew

Detect and unskew a photo of a document (e.g. taken on a cell phone) into a perfectly square image. Great for document scanning applications; once unskewed, this image is perfect for converting to PDF using the Convert API or optical character recognition using the OCR API.
receipts.receiptsPhotoToCsv

Leverage Deep Learning to automatically turn a photo of a receipt into a CSV file containing the structured information from the receipt.
openapi.previewSpec

Preview an OpenAPI document before adding it as a source
openapi.addSource

Add an OpenAPI source and register its operations as tools