API documentation for vision_v1p2beta1.types
module.
Response to a single file annotation request. A file may contain one or more images, which individually have their own responses.
Individual responses to images found within the file.
AnnotateImageRequestRequest for performing Google Cloud Vision API tasks over a user-provided image, with user-requested features.
Requested features.
AnnotateImageResponseResponse to an image annotation request.
If present, landmark detection has completed successfully.
If present, label detection has completed successfully.
If present, text (OCR) detection or document (OCR) text detection has completed successfully. This annotation provides the structural hierarchy for the OCR detected text.
If present, image properties were extracted successfully.
If present, web detection has completed successfully.
If present, contextual information is needed to understand where this image comes from.
AnyAPI documentation for vision_v1p2beta1.types.Any
class.
An offline file annotation request.
Required. Requested features.
Required. The desired output location and metadata (e.g. format).
AsyncAnnotateFileResponseThe response for a single offline file annotation request.
AsyncBatchAnnotateFilesRequestMultiple async file annotation requests are batched into a single service call.
AsyncBatchAnnotateFilesResponseResponse to an async batch file annotation request.
BatchAnnotateImagesRequestMultiple image annotation requests are batched into a single service call.
BatchAnnotateImagesResponseResponse to a batch image annotation request.
BlockLogical element on the page.
The bounding box for the block. The vertices are in the order of top-left, top-right, bottom-right, bottom-left. When a rotation of the bounding box is detected the rotation is represented as around the top-left corner as defined when the text is read in the 'natural' orientation. For example: - when the text is horizontal it might look like: :: 0----1 | | 3----2 - when it's rotated 180 degrees around the top-left corner it becomes: :: 2----3 | | 1----0 and the vertice order will still be (0, 1, 2, 3).
Detected block type (text, image etc) for this block.
BoolValueAPI documentation for vision_v1p2beta1.types.BoolValue
class.
A bounding polygon for the detected image annotation.
The bounding polygon normalized vertices.
BytesValueAPI documentation for vision_v1p2beta1.types.BytesValue
class.
API documentation for vision_v1p2beta1.types.CancelOperationRequest
class.
API documentation for vision_v1p2beta1.types.Color
class.
Color information consists of RGB channels, score, and the fraction of the image that the color occupies in the image.
Image-specific score for this color. Value in range [0, 1].
CropHintSingle crop hint that is used to generate a new crop when serving an image.
Confidence of this being a salient region. Range [0, 1].
CropHintsAnnotationSet of crop hints that are used to generate new crops when serving images.
CropHintsParamsParameters for crop hints annotation request.
DeleteOperationRequestAPI documentation for vision_v1p2beta1.types.DeleteOperationRequest
class.
Set of dominant colors and their corresponding scores.
DoubleValueAPI documentation for vision_v1p2beta1.types.DoubleValue
class.
Set of detected entity features.
The language code for the locale in which the entity textual description
is expressed.
Overall score of the result. Range [0, 1].
The relevancy of the ICA (Image Content Annotation) label to the image. For example, the relevancy of "tower" is likely higher to an image containing the detected "Eiffel Tower" than to an image containing a detected distant towering building, even though the confidence that there is a tower in each image may be the same. Range [0, 1].
The location information for the detected entity. Multiple LocationInfo
elements can be present because one location may indicate the location of the scene in the image, and another location may indicate the location of the place where the image was taken. Location information is usually present for landmarks.
A face annotation object contains the results of face detection.
The fd_bounding_poly
bounding polygon is tighter than the boundingPoly
, and encloses only the skin part of the face. Typically, it is used to eliminate the face from any image analysis that detects the "amount of skin" visible in an image. It is not based on the landmarker results, only on the initial face detection, hence the fd (face detection) prefix.
Roll angle, which indicates the amount of clockwise/anti- clockwise rotation of the face relative to the image vertical about the axis perpendicular to the face. Range [-180,180].
Pitch angle, which indicates the upwards/downwards angle that the face is pointing relative to the image's horizontal plane. Range [-180,180].
Face landmarking confidence. Range [0, 1].
Sorrow likelihood.
Surprise likelihood.
Blurred likelihood.
FeatureThe type of Google Cloud Vision API detection to perform, and the maximum number of results to return for that type. Multiple Feature
objects can be specified in the features
list.
Maximum number of results of this type. Does not apply to TEXT_DETECTION
, DOCUMENT_TEXT_DETECTION
, or CROP_HINTS
.
API documentation for vision_v1p2beta1.types.FloatValue
class.
The Google Cloud Storage location where the output will be written to.
GcsSourceThe Google Cloud Storage location where the input will be read from.
GetOperationRequestAPI documentation for vision_v1p2beta1.types.GetOperationRequest
class.
Client image to perform Google Cloud Vision API tasks over.
Google Cloud Storage image location, or publicly-accessible image URL. If both content
and source
are provided for an image, content
takes precedence and is used to perform the image annotation request.
If an image was produced from a file (e.g. a PDF), this message gives information about the source of that image.
If the file was a PDF or TIFF, this field gives the page number within the file used to produce the image.
ImageContextImage context and/or feature-specific parameters.
List of languages to use for TEXT_DETECTION. In most cases, an empty value yields the best results since it enables automatic language detection. For languages based on the Latin alphabet, setting language_hints
is not needed. In rare cases, when the language of the text in the image is known, setting a hint will help get better results (although it will be a significant hindrance if the hint is wrong). Text detection returns an error if one or more of the specified languages is not one of the supported languages </vision/docs/languages>
__.
Parameters for web detection.
ImagePropertiesStores image properties, such as dominant colors.
ImageSourceExternal image source (Google Cloud Storage or web URL image location).
The URI of the source image. Can be either: 1. A Google Cloud Storage URI of the form gs://bucket_name/object_name
. Object versioning is not supported. See Google Cloud Storage Request URIs <https://cloud.google.com/storage/docs/reference-uris>
__ for more info. 2. A publicly-accessible image HTTP/HTTPS URL. When fetching images from HTTP/HTTPS URLs, Google cannot guarantee that the request will be completed. Your request may fail if the specified host denies the request (e.g. due to request throttling or DOS prevention), or if Google throttles requests to the site for abuse prevention. You should not depend on externally-hosted images for production applications. When both gcs_image_uri
and image_uri
are specified, image_uri
takes precedence.
The desired input location and metadata.
The type of the file. Currently only "application/pdf" and "image/tiff" are supported. Wildcards are not supported.
Int32ValueAPI documentation for vision_v1p2beta1.types.Int32Value
class.
API documentation for vision_v1p2beta1.types.Int64Value
class.
API documentation for vision_v1p2beta1.types.LatLng
class.
Rectangle determined by min and max LatLng
pairs.
Max lat/long pair.
ListOperationsRequestAPI documentation for vision_v1p2beta1.types.ListOperationsRequest
class.
API documentation for vision_v1p2beta1.types.ListOperationsResponse
class.
Detected entity location information.
NormalizedVertexX coordinate.
OperationAPI documentation for vision_v1p2beta1.types.Operation
class.
API documentation for vision_v1p2beta1.types.OperationInfo
class.
Contains metadata for the BatchAnnotateImages operation.
The time when the batch request was received.
OutputConfigThe desired output location and metadata.
The max number of response protos to put into each output JSON file on GCS. The valid range is [1, 100]. If not specified, the default value is 20. For example, for one pdf file with 100 pages, 100 response protos will be generated. If batch_size
= 20, then 5 json files each containing 20 response protos will be written under the prefix gcs_destination
.\ uri
. Currently, batch_size only applies to GcsDestination, with potential future support for other output configurations.
Detected page from OCR.
Page width. For PDFs the unit is points. For images (including TIFFs) the unit is pixels.
List of blocks of text, images etc on this page.
ParagraphStructural unit of text representing a number of words in certain order.
The bounding box for the paragraph. The vertices are in the order of top-left, top-right, bottom-right, bottom-left. When a rotation of the bounding box is detected the rotation is represented as around the top-left corner as defined when the text is read in the 'natural' orientation. For example: * when the text is horizontal it might look like: 0----1 | | 3 ----2 * when it's rotated 180 degrees around the top-left corner it becomes: 2----3 | | 1----0 and the vertice order will still be (0, 1, 2, 3).
Confidence of the OCR results for the paragraph. Range [0, 1].
PositionA 3D position in the image, used primarily for Face detection landmarks. A valid Position must have both x and y coordinates. The position coordinates are in the same scale as the original image.
Y coordinate.
PropertyA Property
consists of a user-supplied name/value pair.
Value of the property.
SafeSearchAnnotationSet of features pertaining to the image, computed by computer vision methods over safe-search verticals (for example, adult, spoof, medical, violence).
Spoof likelihood. The likelihood that an modification was made to the image's canonical version to make it appear funny or offensive.
Likelihood that this image contains violent content.
StatusAPI documentation for vision_v1p2beta1.types.Status
class.
API documentation for vision_v1p2beta1.types.StringValue
class.
A single symbol representation.
The bounding box for the symbol. The vertices are in the order of top-left, top-right, bottom-right, bottom-left. When a rotation of the bounding box is detected the rotation is represented as around the top-left corner as defined when the text is read in the 'natural' orientation. For example: * when the text is horizontal it might look like: 0----1 | | 3 ----2 * when it's rotated 180 degrees around the top-left corner it becomes: 2----3 | | 1----0 and the vertice order will still be (0, 1, 2, 3).
Confidence of the OCR results for the symbol. Range [0, 1].
TextAnnotationTextAnnotation contains a structured representation of OCR extracted text. The hierarchy of an OCR extracted text structure is like this: TextAnnotation -> Page -> Block -> Paragraph -> Word -> Symbol Each structural component, starting from Page, may further have their own properties. Properties describe detected languages, breaks etc.. Please refer to the TextAnnotation.TextProperty message definition below for more detail.
UTF-8 text detected on the pages.
TimestampAPI documentation for vision_v1p2beta1.types.Timestamp
class.
API documentation for vision_v1p2beta1.types.UInt32Value
class.
API documentation for vision_v1p2beta1.types.UInt64Value
class.
X coordinate.
WaitOperationRequestAPI documentation for vision_v1p2beta1.types.WaitOperationRequest
class.
Relevant information for the image from the Internet.
Fully matching images from the Internet. Can include resized copies of the query image.
Web pages containing the matching images from the Internet.
Best guess text labels for the request image.
WebDetectionParamsParameters for web detection request.
WordA word representation.
The bounding box for the word. The vertices are in the order of top-left, top-right, bottom-right, bottom-left. When a rotation of the bounding box is detected the rotation is represented as around the top-left corner as defined when the text is read in the 'natural' orientation. For example: * when the text is horizontal it might look like: 0----1 | | 3 ----2 * when it's rotated 180 degrees around the top-left corner it becomes: 2----3 | | 1----0 and the vertice order will still be (0, 1, 2, 3).
Confidence of the OCR results for the word. Range [0, 1].
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4