cloudmersive.com – ocr
The powerful Optical Character Recognition (OCR) APIs let you convert scanned images of pages into recognized text.
- Homepage
- https://api.apis.guru/v2/specs/cloudmersive.com:ocr/v1.json
- Provider
- cloudmersive.com:ocr / ocr
- OpenAPI version
- 3.0.0
- Spec (JSON)
- https://api.apis.guru/v2/specs/cloudmersive.com/ocr/v1/openapi.json
- Spec (YAML)
- https://api.apis.guru/v2/specs/cloudmersive.com/ocr/v1/openapi.yaml
Tools (21)
Extracted live via the executor SDK.
-
imageOcr.imageOcrImageLinesWithLocationConverts an uploaded image in common formats such as JPEG, PNG into lines/text with location information and other metdata via Optical Character Recognition. This API is intended to be run on scanned documents. If you want to OCR photos (e.g. taken with a smart phone camera), be sure to use the photo/toText API instead, as it is designed to unskew the image first. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.
-
imageOcr.imageOcrImageWordsWithLocationConverts an uploaded image in common formats such as JPEG, PNG into words/text with location information and other metdata via Optical Character Recognition. This API is intended to be run on scanned documents. If you want to OCR photos (e.g. taken with a smart phone camera), be sure to use the photo/toText API instead, as it is designed to unskew the image first. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.
-
imageOcr.imageOcrPhotoRecognizeBusinessCardAnalyzes a photograph of a business card as input, and outputs key business information such as the name of the person, name of the business, the address of the business, the phone number, the email address and more. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.
-
imageOcr.imageOcrPhotoRecognizeFormAnalyzes a photograph of a form as input, and outputs key business fields and information. Customzie data to be extracted by defining fields for the form. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.
-
imageOcr.imageOcrPhotoRecognizeFormAdvancedAnalyzes a photograph of a form as input, and outputs key business fields and information. Customzie data to be extracted by defining fields for the form. Uses template definitions stored in Cloudmersive Configuration; to configure stored templates in a configuration bucket, log into Cloudmersive Management Portal and navigate to Settings > API Configuration > Create Bucket. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.
-
imageOcr.imageOcrPhotoRecognizeReceiptAnalyzes a photograph of a receipt as input, and outputs key business information such as the name of the business, the address of the business, the phone number of the business, the total of the receipt, the date of the receipt, and more. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.
-
imageOcr.imageOcrPhotoToTextConverts an uploaded photo of a document in common formats such as JPEG, PNG into text via Optical Character Recognition. This API is intended to be run on photos of documents, e.g. taken with a smartphone and supports cases where other content, such as a desk, are in the frame and the camera is crooked. If you want to OCR a scanned image, use the image/toText API call instead as it is designed for scanned images. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.
-
imageOcr.imageOcrPhotoWordsWithLocationConverts a photo of a document or receipt in common formats such as JPEG, PNG into words/text with location information and other metdata via Optical Character Recognition. This API is intended to be run on photographs of documents. If you want to OCR scanned documents (e.g. taken with a scanner), be sure to use the image/toText API instead, as it is designed for that use case. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.
-
imageOcr.imageOcrPostConverts an uploaded image in common formats such as JPEG, PNG into text via Optical Character Recognition. This API is intended to be run on scanned documents. If you want to OCR photos (e.g. taken with a smart phone camera), be sure to use the photo/toText API instead, as it is designed to unskew the image first. Note: for free tier API keys, it is required to add a credit card to your account for security reasons, to use the free tier key with this API.
-
pdfOcr.pdfOcrPdfToLinesWithLocationConverts a PDF into lines/text with location information and other metdata via Optical Character Recognition. This API is intended to be run on scanned documents. If you want to OCR photos (e.g. taken with a smart phone camera), be sure to use the photo/toText API instead, as it is designed to unskew the image first.
-
pdfOcr.pdfOcrPdfToWordsWithLocationConverts a PDF into words/text with location information and other metdata via Optical Character Recognition. This API is intended to be run on scanned documents. If you want to OCR photos (e.g. taken with a smart phone camera), be sure to use the photo/toText API instead, as it is designed to unskew the image first.
-
pdfOcr.pdfOcrPostConverts an uploaded PDF file into text via Optical Character Recognition.
-
preprocessing.preprocessingBinarizePerform an adaptive binarization algorithm on the input image to prepare it for further OCR operations.
-
preprocessing.preprocessingBinarizeAdvancedPerform an advanced adaptive, Deep Learning-based binarization algorithm on the input image to prepare it for further OCR operations. Provides enhanced accuracy than adaptive binarization. Image will be upsampled to 300 DPI if it has a DPI below 300.
-
preprocessing.preprocessingGetPageAngleAnalyzes a photo or image of a document and identifies the rotation angle of the page.
-
preprocessing.preprocessingUnrotateDetect and unrotate an image of a document (e.g. that was scanned at an angle). Great for document scanning applications; once unskewed, this image is perfect for converting to PDF using the Convert API or optical character recognition using the OCR API.
-
preprocessing.preprocessingUnrotateAdvancedDetect and unrotate an image of a document (e.g. that was scanned at an angle) using deep learning. Great for document scanning applications; once unskewed, this image is perfect for converting to PDF using the Convert API or optical character recognition using the OCR API.
-
preprocessing.preprocessingUnskewDetect and unskew a photo of a document (e.g. taken on a cell phone) into a perfectly square image. Great for document scanning applications; once unskewed, this image is perfect for converting to PDF using the Convert API or optical character recognition using the OCR API.
-
receipts.receiptsPhotoToCsvLeverage Deep Learning to automatically turn a photo of a receipt into a CSV file containing the structured information from the receipt.
-
openapi.previewSpecPreview an OpenAPI document before adding it as a source
-
openapi.addSourceAdd an OpenAPI source and register its operations as tools