integrations.sh
← all integrations

cloudmersive.com – ocr

OpenAPI apis-guru text

The powerful Optical Character Recognition (OCR) APIs let you convert scanned images of pages into recognized text.

Homepage
https://api.apis.guru/v2/specs/cloudmersive.com:ocr/v1.json
Provider
cloudmersive.com:ocr / ocr
OpenAPI version
3.0.0
Spec (JSON)
https://api.apis.guru/v2/specs/cloudmersive.com/ocr/v1/openapi.json
Spec (YAML)
https://api.apis.guru/v2/specs/cloudmersive.com/ocr/v1/openapi.yaml

Tools (21)

Extracted live via the executor SDK.

  • imageOcr.imageOcrImageLinesWithLocation

    Converts an uploaded image in common formats such as JPEG, PNG into lines/text with location information and other metdata via Optical Character Recognition. This API is intended to be run on scanned documents. If you want to OCR photos (e.g. taken with a smart phone camera), be sure to use the photo/toText API instead, as it is designed to unskew the image first. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.

  • imageOcr.imageOcrImageWordsWithLocation

    Converts an uploaded image in common formats such as JPEG, PNG into words/text with location information and other metdata via Optical Character Recognition. This API is intended to be run on scanned documents. If you want to OCR photos (e.g. taken with a smart phone camera), be sure to use the photo/toText API instead, as it is designed to unskew the image first. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.

  • imageOcr.imageOcrPhotoRecognizeBusinessCard

    Analyzes a photograph of a business card as input, and outputs key business information such as the name of the person, name of the business, the address of the business, the phone number, the email address and more. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.

  • imageOcr.imageOcrPhotoRecognizeForm

    Analyzes a photograph of a form as input, and outputs key business fields and information. Customzie data to be extracted by defining fields for the form. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.

  • imageOcr.imageOcrPhotoRecognizeFormAdvanced

    Analyzes a photograph of a form as input, and outputs key business fields and information. Customzie data to be extracted by defining fields for the form. Uses template definitions stored in Cloudmersive Configuration; to configure stored templates in a configuration bucket, log into Cloudmersive Management Portal and navigate to Settings > API Configuration > Create Bucket. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.

  • imageOcr.imageOcrPhotoRecognizeReceipt

    Analyzes a photograph of a receipt as input, and outputs key business information such as the name of the business, the address of the business, the phone number of the business, the total of the receipt, the date of the receipt, and more. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.

  • imageOcr.imageOcrPhotoToText

    Converts an uploaded photo of a document in common formats such as JPEG, PNG into text via Optical Character Recognition. This API is intended to be run on photos of documents, e.g. taken with a smartphone and supports cases where other content, such as a desk, are in the frame and the camera is crooked. If you want to OCR a scanned image, use the image/toText API call instead as it is designed for scanned images. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.

  • imageOcr.imageOcrPhotoWordsWithLocation

    Converts a photo of a document or receipt in common formats such as JPEG, PNG into words/text with location information and other metdata via Optical Character Recognition. This API is intended to be run on photographs of documents. If you want to OCR scanned documents (e.g. taken with a scanner), be sure to use the image/toText API instead, as it is designed for that use case. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.

  • imageOcr.imageOcrPost

    Converts an uploaded image in common formats such as JPEG, PNG into text via Optical Character Recognition. This API is intended to be run on scanned documents. If you want to OCR photos (e.g. taken with a smart phone camera), be sure to use the photo/toText API instead, as it is designed to unskew the image first. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.

  • pdfOcr.pdfOcrPdfToLinesWithLocation

    Converts a PDF into lines/text with location information and other metdata via Optical Character Recognition. This API is intended to be run on scanned documents. If you want to OCR photos (e.g. taken with a smart phone camera), be sure to use the photo/toText API instead, as it is designed to unskew the image first.

  • pdfOcr.pdfOcrPdfToWordsWithLocation

    Converts a PDF into words/text with location information and other metdata via Optical Character Recognition. This API is intended to be run on scanned documents. If you want to OCR photos (e.g. taken with a smart phone camera), be sure to use the photo/toText API instead, as it is designed to unskew the image first.

  • pdfOcr.pdfOcrPost

    Converts an uploaded PDF file into text via Optical Character Recognition.

  • preprocessing.preprocessingBinarize

    Perform an adaptive binarization algorithm on the input image to prepare it for further OCR operations.

  • preprocessing.preprocessingBinarizeAdvanced

    Perform an advanced adaptive, Deep Learning-based binarization algorithm on the input image to prepare it for further OCR operations. Provides enhanced accuracy than adaptive binarization. Image will be upsampled to 300 DPI if it has a DPI below 300.

  • preprocessing.preprocessingGetPageAngle

    Analyzes a photo or image of a document and identifies the rotation angle of the page.

  • preprocessing.preprocessingUnrotate

    Detect and unrotate an image of a document (e.g. that was scanned at an angle). Great for document scanning applications; once unskewed, this image is perfect for converting to PDF using the Convert API or optical character recognition using the OCR API.

  • preprocessing.preprocessingUnrotateAdvanced

    Detect and unrotate an image of a document (e.g. that was scanned at an angle) using deep learning. Great for document scanning applications; once unskewed, this image is perfect for converting to PDF using the Convert API or optical character recognition using the OCR API.

  • preprocessing.preprocessingUnskew

    Detect and unskew a photo of a document (e.g. taken on a cell phone) into a perfectly square image. Great for document scanning applications; once unskewed, this image is perfect for converting to PDF using the Convert API or optical character recognition using the OCR API.

  • receipts.receiptsPhotoToCsv

    Leverage Deep Learning to automatically turn a photo of a receipt into a CSV file containing the structured information from the receipt.

  • openapi.previewSpec

    Preview an OpenAPI document before adding it as a source

  • openapi.addSource

    Add an OpenAPI source and register its operations as tools