Machine learning APIs for Google Cloud Platform

Great Leaders Are Ambidextrous, Are You?

By G C Network | November 17, 2014

By: Melvin Greer Managing Director, Greer Institute  There are many important characteristics of great leaders. Team players, good listeners and visionary are clear hallmarks. But being ambidextrous is required now…

Cloud Computing Advantages and Disadvantages

By G C Network | November 13, 2014

What is Cloud Computing & what are its advantages & disadvantages. Join us for this installment of our Technical Insight series as expert Learning Tree instructor Kevin Jackson examines the…

Ingram Micro Honors Veterans with a $10,000 Gift to Veterans 360

By G C Network | November 13, 2014

On Veteran’s Day, hundreds of military veterans, active-duty service members and retirees were recognized and celebrated along with family members at the third annual Ingram Micro Veterans Day ceremonies held…

What Are You Waiting For? The Cloud Era is HERE!

By G C Network | November 11, 2014

by Kevin L. Jackson (This post first appeared at https://blog.learningtree.com) The revolutionary business aspects of cloud excite me every day, but the business diversity is even more exciting. This fact…

Schizophrenic About Cloud?

By G C Network | November 5, 2014

By Kevin L. Jackson This week Dell released its first Global Technology Adoption Index (GTAI). This survey of more than 2,000 global organizations took a close look at how organizations…

Why You Need to Pay Attention to Cloud Computing

By G C Network | October 23, 2014

(This post was originally published by Learning Tree International  https://blog.learningtree.com/why-you-need-to-pay-attention-to-cloud-computing/ ) The adoption of cloud computing is revolutionizing today’s business. This trend has also elevated the importance of IT and…

Thriving in a Cloud, Big Data, Mobility and Security World

By G C Network | October 21, 2014

“The next generation of technology solutions will transform lives, businesses and economies.” This is the theme at this year’s Dell World opening keynote and this view is supported by Gartner’s…

Vets360 Founder Rick Collins Being Honored at the ‘Champions’ Leadership Conference

By G C Network | October 20, 2014

Veteran’s 360 and Rick Collins, Founder & Executive Director of Vets 360, Inc., are being honored at this year’sChampions Leadership, and Research Conference™ . This event, November 6th-7th, 2014 at…

“Cloud Musings” Joins Dell Content Provider Network

By G C Network | October 16, 2014

Cloud Musings, a GovCloud Network Property, is proud and honored to announce that we will now be serving over 3 Million Dell Community online daily viewers. Our content will focus…

Grounding the Cloud: Basics and Brokerage

By G C Network | September 29, 2014

“Picture Ben Franklin attempting to harness energy from a lightning-filled sky. The key tied to his kite was the middleman between electricity and the ground. This book details how using…

Google Cloud Platform (GCP) is considered to be one of the Big 3 cloud platforms among Microsoft Azure and AWS. GCP is widely used cloud solutions supporting AI capabilities to design and develop smart models to turn your data into insights at a cheap, affordable cost.

(The following excerpt is taken from the book ‘Cloud Analytics with Google Cloud Platform‘ authored by Sanket Thodge.)

GCP offers many machine learning APIs, among which we take a look at the 3 most popular APIs:

Cloud Speech API

A powerful API from GCP! This enables the user to convert speech to text by using a neural network model. This API is used to recognize over 100 languages throughout the world. It can also support filter of unwanted noise/ content from a text, under various types of environments. It supports context-awareness recognition, works on any device, any platform, anywhere, including IoT. It has features like Automatic Speech Recognition (ASR), Global Vocabulary, Streaming Recognition, Word Hints, Real-Time Audio support, Noise Robustness, Inappropriate Content Filtering and supports for integration with other APIs of GCP.

The architecture of the Cloud Speech API is as follows:

In other words, this model enables speech to text conversion by ML.

The components used by the Speech API are:

  • REST API or Google Remote Procedure Call (gRPC) API
  • Google Cloud Client Library
  • JSON API
  • Python
  • Cloud DataLab
  • Cloud Data Storage
  • Cloud Endpoints

The applications of the model include:

  • Voice user interfaces
  • Domotic appliance control
  • Preparation of structured documents
  • Aircraft / direct voice outputs
  • Speech to text processing
  • Telecommunication

It is free of charge for 15 seconds per usage, up to 60 minutes per month. More than that will be charged at $0.006 per usage.

Now, as we have learned about the concepts and the applications of the model, let’s learn some use cases where we can implement the model:

  • Solving crimes with voice recognition: AGNITIO, A voice biometrics specialist partnered with Morpho (Safran) to bring Voice ID technology into its multimodal suite of criminal identification products.
  • Buying products and services with the sound of your voice: Another most popular and mainstream application of biometrics, in general, is mobile payments. Voice recognition has also made its way into this highly competitive arena.
  • A hands-free AI assistant that knows who you are: Any mobile phone nowadays has voice recognition software in the form of AI machine learning algorithms.

Cloud Translation API

Natural language processing (NLP) is a part of artificial intelligence that focuses on Machine Translation (MT). MT has become the main focus of NLP group for many years. MT deals with translating text from the source language to text in the target language. Cloud Translation API provides a graphical user interface to translate an inputted string of a language to targeted language, it’s highly responsive, scalable and dynamic in nature.

This API enables translation among 100+ languages. It also supports language detection automatically with accuracy. It provides a feature to read a web page contents and translate to another language, and need not be text extracted from a document. The Translation API supports various features such as programmatic access, text translation, language detection, continuous updates and adjustable quota, and affordable pricing.

The following image shows the architecture of the translation model:

In other words, the cloud translation API is an adaptive Machine Translation Algorithm.

The components used by this model are:

  • REST API
  • Cloud DataLab
  • Cloud data storage
  • Python, Ruby
  • Clients Library
  • Cloud Endpoints

The most important application of the model is the conversion of a regional language to a foreign language.

The cost of text translation and language detection is $20 per 1 million characters.

Use cases

Now, as we have learned about the concepts and applications of the API, let’s learn two use cases where it has been successfully implemented:

  • Rule-based Machine Translation
  • Local Tissue Response to Injury and Trauma

We will discuss each of these use cases in the following sections.

Rule-based Machine Translation

The steps to implement rule-based Machine Translation successfully are as follows:

  1. Input text
  2. Parsing
  3. Tokenization
  4. Compare the rules to extract the meaning of prepositional phrase
  5. Find word of inputted language to word of the targeted language
  6. Frame the sentence of the targeted language

Local tissue response to injury and trauma

We can learn about the Machine Translation process from the responses of a local tissue to injuries and trauma. The human body follows a process similar to Machine Translation when dealing with injuries. We can roughly describe the process as follows:

  1. Hemorrhaging from lesioned vessels and blood clotting
  2. Blood-borne physiological components, leaking from the usually closed sanguineous compartment, are recognized as foreign material by the surrounding tissue since they are not tissue-specific
  3. Inflammatory response mediated by macrophages (and more rarely by foreign-body giant cells)
  4. Resorption of blood clot
  5. Ingrowth of blood vessels and fibroblasts, and the formation of granulation tissue
  6. Deposition of an unspecific but biocompatible type of repair (scar) tissue by fibroblasts

Cloud Vision API

Cloud Vision API is powerful image analytic tool. It enables the users to understand the content of an image. It helps in finding various attributes or categories of an image, such as labels, web, text, document, properties, safe search, and code of that image in JSON. In labels field, there are many sub-categories like text, line, font, area, graphics, screenshots, and points. How much area of graphics involved, text percentage, what percentage of empty area and area covered by text, is there any image partially or fully mapped in web are included web contents.

The document consists of blocks of the image with detailed description, properties show that the colors used in image is visualized. If any unwanted or inappropriate content is removed from the image through safe search. The main features of this API are label detection, explicit content detection, logo and landmark detection, face detection, web detection, and to extract the text the API used Optical Character Reader (OCR) and is supported for many languages. It does not support face recognition system.

The architecture for the Cloud Vision API is as follows:

We can summarize the functionalities of the API as extracting quantitative information from images, taking the input as an image and the output as numerics and text.

The components used in the API are:

  • Client Library
  • REST API
  • RPC API
  • OCR Language Support
  • Cloud Storage
  • Cloud Endpoints

Applications of the API include:

  • Industrial Robotics
  • Cartography
  • Geology
  • Forensics and Military
  • Medical and Healthcare

Cost: Free of charge for the first 1,000 units per month; after that, pay as you go.

Use cases

This technique can be successfully implemented in:

  • Image detection using an Android or iOS mobile device
  • Retinal Image Analysis (Ophthalmology)

We will discuss each of these use cases in the following topics.

Image detection using Android or iOS mobile device

Cloud Vision API can be successfully implemented to detect images using your smartphone. The steps to do this are simple:

  1. Input the image
  2. Run the Cloud Vision API
  3. Executes methods for detection of Face, Label, Text, Web and Document properties
  4. Generate the response in the form of phrase or string
  5. Populate the image details as a text view

Retinal Image Analysis – ophthalmology

Similarly, the API can also be used to analyze retinal images. The steps to implement this are as follows:

  1. Input the images of an eye
  2. Estimate the retinal biomarkers
  3. Do the process to remove the effected portion without losing necessary information
  4. Identify the location of specific structures
  5. Identify the boundaries of the object
  6. Find similar regions in two or more images
  7. Quantify the image with retinal portion damage

You can learn a lot more about the machine learning capabilities of GCP on their official documentation page.

If you found the above excerpt useful, make sure you check out our book ‘Cloud Analytics with Google Cloud Platform‘ for more information on why GCP is a top cloud solution for machine learning and AI.

( This sponsored post is part of a series designed to highlight recently published Packt books about leading technologies and software applications. The opinions expressed are solely those of the author and do not represent the views of GovCloud Network, GovCloud Network Partners.)

Posted in

G C Network