Domain 2C: AWS Managed AI Services

8 min read 1668 words

AWS Managed AI Services

Exam Domain: 2 — ML Model Development (26%) Task: Choose the right modeling approach — use managed services before building custom

When Managed vs Custom?

AWS AI Services Landscape

Decision Tree:
  "Can an AWS managed service solve this?"
       │
       YES → Use managed service (faster, cheaper, no ML expertise needed)
       │
       NO → "Can a SageMaker built-in algorithm solve this?"
              │
              YES → Use built-in algorithm
              │
              NO → Build custom model (bring your own container)

Exam tip: AWS always prefers the simplest, most managed solution. If a managed service can do it, that’s the answer.

Think of it this way: Choosing between managed services and custom models is like deciding whether to cook dinner. If you can buy a ready-made meal (managed service) that does the job, don’t spend hours cooking from scratch (custom model). AWS exam questions are designed to reward the person who reaches for the simplest tool first — the more managed, the better.

Natural Language Processing (NLP)

Amazon Comprehend

Purpose: Extract insights from text — no ML experience needed.

Feature	What It Does
Entity Recognition	People, places, organizations, dates, quantities
Sentiment Analysis	Positive, negative, neutral, mixed
Key Phrase Extraction	Important phrases from text
Language Detection	Identify language (100+ languages)
Topic Modeling	Discover topics across document collections
PII Detection	Identify and redact personal information
Syntax Analysis	Parts of speech (noun, verb, adjective)
Custom Classification	Train custom text classifiers
Custom Entity Recognition	Train to find domain-specific entities

Comprehend Use Cases:
  Customer reviews   → Sentiment analysis
  Support tickets    → Topic modeling + classification
  Medical records    → Comprehend Medical (HIPAA-eligible)
  Legal documents    → Entity extraction + PII detection

ELI5: Comprehend is like having a super-fast reader processing thousands of reviews per second. Feed it a customer review and it reports back: “this person is angry (sentiment), they mention John Smith in New York (entities), they’re talking about returning a product (key phrases), and their phone number is in there (PII).” No ML expertise needed — it’s all pre-built.

Comprehend Medical — specialized for healthcare:

Extract medical conditions, medications, dosages, procedures
HIPAA-eligible
Works with ICD-10-CM, RxNorm ontologies

Amazon Translate

Neural machine translation (75+ languages)
Real-time and batch translation
Custom Terminology — force specific translations (brand names, technical terms)
Active Custom Translation — fine-tune for domain-specific language
Use case: website localization, multilingual chatbots

Amazon Transcribe

Purpose: Speech-to-text (ASR — Automatic Speech Recognition)

Feature	Details
Streaming transcription	Real-time audio → text
Batch transcription	Process audio files from S3
Custom Vocabulary	Add domain terms (medical, legal)
Vocabulary Filtering	Remove/mask unwanted words
Speaker Diarization	Identify who said what
Channel Identification	Separate speakers on different audio channels
Automatic Language ID	Detect language automatically
Call Analytics	Sentiment, talk time, interruptions, issues
Transcribe Medical	HIPAA-eligible medical transcription

ELI5: Transcribe is Siri’s brain without Siri’s personality. It converts audio to text and can even figure out who said what (Speaker Diarization). Think of a call center: one recording, two speakers, and Transcribe produces a labeled transcript — “Agent: How can I help? Customer: My order is missing.” That labeled transcript can then feed into Comprehend for sentiment analysis.

Amazon Polly

Purpose: Text-to-speech (TTS)

60+ voices, 30+ languages
SSML support — control pronunciation, pauses, emphasis, speed
Lexicon — custom pronunciation for specific words
Neural TTS (NTTS) — more natural, human-like speech
Speech Marks — metadata for lip-syncing (viseme, word, sentence timings)
Use case: accessibility, voice applications, IVR systems

Computer Vision

Amazon Rekognition

Purpose: Image and video analysis.

Feature	What It Does
Object/Scene Detection	Labels objects, scenes, activities
Face Detection & Analysis	Age range, emotion, glasses, gender
Face Comparison	Match faces across images
Face Search	Search against a face collection
Celebrity Recognition	Identify famous people
Text in Image	OCR for text within images
Content Moderation	Detect unsafe/inappropriate content
Custom Labels	Train custom image classifier (as few as 10 images)
Video Analysis	All above + person tracking, path detection
PPE Detection	Detect personal protective equipment

Rekognition Custom Labels:
  You provide:  10+ labeled images per class
  AWS handles:  Transfer learning, training, hosting
  You get:      Custom image classification API
  Use case:     "Is this a defective part?" (manufacturing QA)

ELI5: Rekognition is like a security camera that also has a film degree. It identifies faces, detects whether someone’s wearing a hard hat (PPE Detection), reads text on road signs, flags inappropriate content, and can even recognize celebrities. The “Custom Labels” feature lets you teach it your own categories with as few as 10 images — useful for things like spotting defective parts on a factory line.

Amazon Textract

Purpose: Extract text, forms, and tables from documents.

Feature	Details
Text Detection	Raw text extraction (OCR)
Form Extraction	Key-value pairs from forms
Table Extraction	Structured table data
Expense Analysis	Receipts and invoices
Identity Documents	Passports, driver’s licenses
Lending Analysis	Mortgage and loan documents
Layout Analysis	Document structure (titles, headers, paragraphs)

Textract vs Comprehend: Textract extracts text from images/PDFs. Comprehend analyzes the meaning of text.

ELI5: Regular OCR just “sees” characters — it reads “Name: John” as four separate characters. Textract understands document structure: it knows “Name” is a form field label and “John” is its value. So you get structured data back ({"Name": "John"}), not just a wall of text. That’s the difference between reading and comprehending a form.

Conversational AI

Amazon Lex

Purpose: Build chatbots and voice bots (same tech as Alexa).

Lex Concepts:
┌──────────────────────────────────────────┐
│  Bot                                     │
│  ├─ Intent: "OrderPizza"                 │
│  │   ├─ Utterances: "I want a pizza"     │
│  │   │              "Order me a pizza"    │
│  │   ├─ Slots: {size}, {topping}, {crust}│
│  │   └─ Fulfillment: Lambda function     │
│  │                                       │
│  └─ Intent: "CheckOrderStatus"           │
│      ├─ Utterances: "Where is my order?" │
│      ├─ Slots: {orderId}                 │
│      └─ Fulfillment: Lambda function     │
└──────────────────────────────────────────┘

ASR (speech → text) + NLU (text → intent)
Slot types: built-in (date, number) and custom
Multi-turn conversation support
Integrates with: Lambda, Connect (call center), Cognito (auth)

ELI5: Lex is literally Alexa’s brain — the same technology powers both. An “intent” is what the user wants to do (order a pizza). “Utterances” are different ways they might say it (“I want a pizza”, “Order me a pizza”, “Get me a large pepperoni”). “Slots” are the details you need to collect to fulfill it: size, topping, crust. Once all slots are filled, a Lambda function does the actual work.

Personalization & Recommendations

Amazon Personalize

Purpose: Real-time personalized recommendations.

Concept	Details
User-Item Interaction Data	Who clicked/bought/watched what
Recipe Types	User personalization, similar items, re-ranking
Real-time events	Update recommendations as user interacts
Filters	Exclude already-purchased, apply business rules
Campaigns	Deployed recommendation endpoints

Personalize Data Inputs:
  1. Interactions dataset (required): user-item interactions
  2. Users dataset (optional): user demographics
  3. Items dataset (optional): item metadata

Output: Ranked list of recommended items per user

Use cases: e-commerce product suggestions, content recommendations, personalized search

ELI5: Personalize is the “customers who bought this also bought…” engine. Netflix recommendations, Amazon product suggestions — same underlying collaborative filtering technology, now available as an API you call with a user ID and get back a ranked list of items. You bring your interaction history; AWS handles all the model training and hosting.

Amazon Forecast (retired — but may still appear)

Time series forecasting using AutoML
Built-in algorithms: DeepAR+, CNN-QR, Prophet, ARIMA, NPTS, ETS
Use case: demand forecasting, inventory planning, financial forecasting

Note: Amazon Forecast was retired in 2024. For new projects, use SageMaker Canvas or DeepAR. May still appear in exam questions.

Document & Search Intelligence

Amazon Kendra

Purpose: Intelligent enterprise search using NLP.

Semantic search — understands meaning, not just keywords
Returns specific answers, not just document links
Connectors: S3, SharePoint, Salesforce, databases, websites
FAQ support — question-answer pairs
Access control integration (who can see what)
Use case: internal knowledge base search, customer self-service

Amazon Augmented AI (A2I)

Purpose: Human review workflow for ML predictions.

ML Prediction → Confidence Check
                    │
                    ├─ High confidence → Auto-approve
                    │
                    └─ Low confidence → Route to Human Reviewer
                                            │
                                            └─ Human reviews → Final decision

Built-in integrations: Textract, Rekognition
Custom: any ML model output
Workforce: Mechanical Turk, private team, or third-party
Use case: content moderation review, document verification

ELI5: A2I is the “phone a friend” lifeline for your ML model. When the model isn’t confident enough in its prediction, instead of guessing wrong, it routes the case to a human reviewer. Think of it like a junior doctor flagging uncertain X-rays for a specialist — automation handles the easy cases, humans handle the edge cases.

AI Assistants

Amazon Q Business

Purpose: Enterprise generative AI assistant.

Feature	Details
Data connectors	40+ sources (S3, SharePoint, Jira, Salesforce, Slack)
Access controls	Respects existing permissions (who can see what)
Guardrails	Admin controls for topic blocking
Plugins	Custom actions (create Jira tickets, query databases)
Q Apps	Build lightweight apps from Q Business

Amazon Q Developer

Purpose: AI coding assistant.

Feature	Details
Code completion	Real-time suggestions in IDE
Code generation	Generate functions from comments
Security scanning	Find and fix vulnerabilities
Code transformation	Upgrade Java versions, modernize code
IDE support	VS Code, IntelliJ, PyCharm, Cloud9
Languages	Python, Java, JavaScript, TypeScript, C#, Go, and more

AI Hardware

AWS Custom Chips

Chip	Purpose	Use Case
AWS Trainium	Training ML models	Cost-effective alternative to NVIDIA GPUs (up to 50% savings)
AWS Inferentia	ML inference	High throughput, low latency inference (up to 70% cost savings)
AWS Graviton	General compute	ARM-based, not ML-specific but good for preprocessing

Instance mapping:
  Trainium   → ml.trn1 instances
  Inferentia → ml.inf1, ml.inf2 instances
  NVIDIA GPU → ml.p3 (V100), ml.p4d (A100), ml.g5 (A10G)

Quick Decision: Which Service?

You Need To…	Use
Analyze text sentiment	Comprehend
Extract entities from medical text	Comprehend Medical
Translate text	Translate
Convert speech to text	Transcribe
Convert text to speech	Polly
Analyze images/video	Rekognition
Extract text from documents	Textract
Build a chatbot	Lex
Personalized recommendations	Personalize
Enterprise search	Kendra
Human review of ML output	A2I (Augmented AI)
Enterprise AI assistant	Q Business
AI coding assistant	Q Developer
Custom image classification (few images)	Rekognition Custom Labels
Custom text classification	Comprehend Custom
Detect PII in text	Comprehend
Detect PII in S3	Macie
Content moderation (images)	Rekognition
Detect fraud	Fraud Detector

AWS Managed AI Services#

When Managed vs Custom?#

Natural Language Processing (NLP)#

Amazon Comprehend#

Amazon Translate#

Amazon Transcribe#

Amazon Polly#

Computer Vision#

Amazon Rekognition#

Amazon Textract#

Conversational AI#

Amazon Lex#

Personalization & Recommendations#

Amazon Personalize#

Amazon Forecast (retired — but may still appear)#

Document & Search Intelligence#

Amazon Kendra#

Amazon Augmented AI (A2I)#

AI Assistants#

Amazon Q Business#

Amazon Q Developer#

AI Hardware#

AWS Custom Chips#

Quick Decision: Which Service?#