This is the repo for swyx's blog - Blog content is created in github issues, then posted on swyx.io as blog pages! Comment/watch to follow along my blog within GitHub
Face detection (within the photo, here is the location of the face)
OCR (read text from photos)
Explicit content detection (violence/adult)
Landmark detection (that's the Eiffel tower!)
Local Detection (not sure)
Other Cloud Vision features:
crop hints - suggested crop dimensions
web annotations - suggested other metadata to search about your page - eg from a photo of an iconic car, it can tell you the model of car, what film it was from, where it probably is. And can give you other matching images to back it up.
Cloud event trigger walkthrough
Cloud storage -> Cloud Functions -> Cloud vision API
NLP: extract entities from a sentence, sentiment analysis, syntax analysis (parse sentence to a lemma so you can see the parts of speech dependency graph)
Speech API
Speech to text transcription in 110 languages.
Azar - uses cloud speech api and cloud translation api to talk
Also gives timestamp of each word on top of transcript.
source: devto devToUrl: "https://dev.to/swyx/serverless-machine-learning-at-google-cp9" devToReactions: 9 devToReadingTime: 2 devToPublishedAt: "2017-12-02T22:25:33.493Z" devToViewsCount: 633
title: Serverless Machine Learning at Google published: true description: Serverless Machine Learning at Google tags: serverless, machine learning
Google can tell dogs from mops. Can you?
Bret McGowen presented on Serverless machine learning at Google. You can watch his full talk here but here are my notes.
Serverless
Four principles:
Serverless at Google:
Machine Learning
Machine learning is using many examples to answer questions.
Machine Learning at Google:
Specifics on capabilities of Cloud Vision API:
Other Cloud Vision features:
Cloud event trigger walkthrough
Cloud storage -> Cloud Functions -> Cloud vision API
NLP: extract entities from a sentence, sentiment analysis, syntax analysis (parse sentence to a lemma so you can see the parts of speech dependency graph)
Speech API
Speech to text transcription in 110 languages.
Azar - uses cloud speech api and cloud translation api to talk Also gives timestamp of each word on top of transcript.
Video Intelligence API
Look through the whole video to label things.
Links