Domain 2C: AWS Managed AI Services
AWS Managed AI Services
Exam Domain: 2 — ML Model Development (26%) Task: Choose the right modeling approach — use managed services before building custom
When Managed vs Custom?

Decision Tree:
"Can an AWS managed service solve this?"
│
YES → Use managed service (faster, cheaper, no ML expertise needed)
│
NO → "Can a SageMaker built-in algorithm solve this?"
│
YES → Use built-in algorithm
│
NO → Build custom model (bring your own container)
Exam tip: AWS always prefers the simplest, most managed solution. If a managed service can do it, that’s the answer.
Think of it this way: Choosing between managed services and custom models is like deciding whether to cook dinner. If you can buy a ready-made meal (managed service) that does the job, don’t spend hours cooking from scratch (custom model). AWS exam questions are designed to reward the person who reaches for the simplest tool first — the more managed, the better.
Natural Language Processing (NLP)
Amazon Comprehend
Purpose: Extract insights from text — no ML experience needed.
| Feature | What It Does |
|---|---|
| Entity Recognition | People, places, organizations, dates, quantities |
| Sentiment Analysis | Positive, negative, neutral, mixed |
| Key Phrase Extraction | Important phrases from text |
| Language Detection | Identify language (100+ languages) |
| Topic Modeling | Discover topics across document collections |
| PII Detection | Identify and redact personal information |
| Syntax Analysis | Parts of speech (noun, verb, adjective) |
| Custom Classification | Train custom text classifiers |
| Custom Entity Recognition | Train to find domain-specific entities |
Comprehend Use Cases:
Customer reviews → Sentiment analysis
Support tickets → Topic modeling + classification
Medical records → Comprehend Medical (HIPAA-eligible)
Legal documents → Entity extraction + PII detection
ELI5: Comprehend is like having a super-fast reader processing thousands of reviews per second. Feed it a customer review and it reports back: “this person is angry (sentiment), they mention John Smith in New York (entities), they’re talking about returning a product (key phrases), and their phone number is in there (PII).” No ML expertise needed — it’s all pre-built.
Comprehend Medical — specialized for healthcare:
- Extract medical conditions, medications, dosages, procedures
- HIPAA-eligible
- Works with ICD-10-CM, RxNorm ontologies
Amazon Translate
- Neural machine translation (75+ languages)
- Real-time and batch translation
- Custom Terminology — force specific translations (brand names, technical terms)
- Active Custom Translation — fine-tune for domain-specific language
- Use case: website localization, multilingual chatbots
Amazon Transcribe
Purpose: Speech-to-text (ASR — Automatic Speech Recognition)
| Feature | Details |
|---|---|
| Streaming transcription | Real-time audio → text |
| Batch transcription | Process audio files from S3 |
| Custom Vocabulary | Add domain terms (medical, legal) |
| Vocabulary Filtering | Remove/mask unwanted words |
| Speaker Diarization | Identify who said what |
| Channel Identification | Separate speakers on different audio channels |
| Automatic Language ID | Detect language automatically |
| Call Analytics | Sentiment, talk time, interruptions, issues |
| Transcribe Medical | HIPAA-eligible medical transcription |
ELI5: Transcribe is Siri’s brain without Siri’s personality. It converts audio to text and can even figure out who said what (Speaker Diarization). Think of a call center: one recording, two speakers, and Transcribe produces a labeled transcript — “Agent: How can I help? Customer: My order is missing.” That labeled transcript can then feed into Comprehend for sentiment analysis.
Amazon Polly
Purpose: Text-to-speech (TTS)
- 60+ voices, 30+ languages
- SSML support — control pronunciation, pauses, emphasis, speed
- Lexicon — custom pronunciation for specific words
- Neural TTS (NTTS) — more natural, human-like speech
- Speech Marks — metadata for lip-syncing (viseme, word, sentence timings)
- Use case: accessibility, voice applications, IVR systems
Computer Vision
Amazon Rekognition
Purpose: Image and video analysis.
| Feature | What It Does |
|---|---|
| Object/Scene Detection | Labels objects, scenes, activities |
| Face Detection & Analysis | Age range, emotion, glasses, gender |
| Face Comparison | Match faces across images |
| Face Search | Search against a face collection |
| Celebrity Recognition | Identify famous people |
| Text in Image | OCR for text within images |
| Content Moderation | Detect unsafe/inappropriate content |
| Custom Labels | Train custom image classifier (as few as 10 images) |
| Video Analysis | All above + person tracking, path detection |
| PPE Detection | Detect personal protective equipment |
Rekognition Custom Labels:
You provide: 10+ labeled images per class
AWS handles: Transfer learning, training, hosting
You get: Custom image classification API
Use case: "Is this a defective part?" (manufacturing QA)
ELI5: Rekognition is like a security camera that also has a film degree. It identifies faces, detects whether someone’s wearing a hard hat (PPE Detection), reads text on road signs, flags inappropriate content, and can even recognize celebrities. The “Custom Labels” feature lets you teach it your own categories with as few as 10 images — useful for things like spotting defective parts on a factory line.
Amazon Textract
Purpose: Extract text, forms, and tables from documents.
| Feature | Details |
|---|---|
| Text Detection | Raw text extraction (OCR) |
| Form Extraction | Key-value pairs from forms |
| Table Extraction | Structured table data |
| Expense Analysis | Receipts and invoices |
| Identity Documents | Passports, driver’s licenses |
| Lending Analysis | Mortgage and loan documents |
| Layout Analysis | Document structure (titles, headers, paragraphs) |
Textract vs Comprehend: Textract extracts text from images/PDFs. Comprehend analyzes the meaning of text.
ELI5: Regular OCR just “sees” characters — it reads “Name: John” as four separate characters. Textract understands document structure: it knows “Name” is a form field label and “John” is its value. So you get structured data back (
{"Name": "John"}), not just a wall of text. That’s the difference between reading and comprehending a form.
Conversational AI
Amazon Lex
Purpose: Build chatbots and voice bots (same tech as Alexa).
Lex Concepts:
┌──────────────────────────────────────────┐
│ Bot │
│ ├─ Intent: "OrderPizza" │
│ │ ├─ Utterances: "I want a pizza" │
│ │ │ "Order me a pizza" │
│ │ ├─ Slots: {size}, {topping}, {crust}│
│ │ └─ Fulfillment: Lambda function │
│ │ │
│ └─ Intent: "CheckOrderStatus" │
│ ├─ Utterances: "Where is my order?" │
│ ├─ Slots: {orderId} │
│ └─ Fulfillment: Lambda function │
└──────────────────────────────────────────┘
- ASR (speech → text) + NLU (text → intent)
- Slot types: built-in (date, number) and custom
- Multi-turn conversation support
- Integrates with: Lambda, Connect (call center), Cognito (auth)
ELI5: Lex is literally Alexa’s brain — the same technology powers both. An “intent” is what the user wants to do (order a pizza). “Utterances” are different ways they might say it (“I want a pizza”, “Order me a pizza”, “Get me a large pepperoni”). “Slots” are the details you need to collect to fulfill it: size, topping, crust. Once all slots are filled, a Lambda function does the actual work.
Personalization & Recommendations
Amazon Personalize
Purpose: Real-time personalized recommendations.
| Concept | Details |
|---|---|
| User-Item Interaction Data | Who clicked/bought/watched what |
| Recipe Types | User personalization, similar items, re-ranking |
| Real-time events | Update recommendations as user interacts |
| Filters | Exclude already-purchased, apply business rules |
| Campaigns | Deployed recommendation endpoints |
Personalize Data Inputs:
1. Interactions dataset (required): user-item interactions
2. Users dataset (optional): user demographics
3. Items dataset (optional): item metadata
Output: Ranked list of recommended items per user
Use cases: e-commerce product suggestions, content recommendations, personalized search
ELI5: Personalize is the “customers who bought this also bought…” engine. Netflix recommendations, Amazon product suggestions — same underlying collaborative filtering technology, now available as an API you call with a user ID and get back a ranked list of items. You bring your interaction history; AWS handles all the model training and hosting.
Amazon Forecast (retired — but may still appear)
- Time series forecasting using AutoML
- Built-in algorithms: DeepAR+, CNN-QR, Prophet, ARIMA, NPTS, ETS
- Use case: demand forecasting, inventory planning, financial forecasting
Note: Amazon Forecast was retired in 2024. For new projects, use SageMaker Canvas or DeepAR. May still appear in exam questions.
Document & Search Intelligence
Amazon Kendra
Purpose: Intelligent enterprise search using NLP.
- Semantic search — understands meaning, not just keywords
- Returns specific answers, not just document links
- Connectors: S3, SharePoint, Salesforce, databases, websites
- FAQ support — question-answer pairs
- Access control integration (who can see what)
- Use case: internal knowledge base search, customer self-service
Amazon Augmented AI (A2I)
Purpose: Human review workflow for ML predictions.
ML Prediction → Confidence Check
│
├─ High confidence → Auto-approve
│
└─ Low confidence → Route to Human Reviewer
│
└─ Human reviews → Final decision
- Built-in integrations: Textract, Rekognition
- Custom: any ML model output
- Workforce: Mechanical Turk, private team, or third-party
- Use case: content moderation review, document verification
ELI5: A2I is the “phone a friend” lifeline for your ML model. When the model isn’t confident enough in its prediction, instead of guessing wrong, it routes the case to a human reviewer. Think of it like a junior doctor flagging uncertain X-rays for a specialist — automation handles the easy cases, humans handle the edge cases.
AI Assistants
Amazon Q Business
Purpose: Enterprise generative AI assistant.
| Feature | Details |
|---|---|
| Data connectors | 40+ sources (S3, SharePoint, Jira, Salesforce, Slack) |
| Access controls | Respects existing permissions (who can see what) |
| Guardrails | Admin controls for topic blocking |
| Plugins | Custom actions (create Jira tickets, query databases) |
| Q Apps | Build lightweight apps from Q Business |
Amazon Q Developer
Purpose: AI coding assistant.
| Feature | Details |
|---|---|
| Code completion | Real-time suggestions in IDE |
| Code generation | Generate functions from comments |
| Security scanning | Find and fix vulnerabilities |
| Code transformation | Upgrade Java versions, modernize code |
| IDE support | VS Code, IntelliJ, PyCharm, Cloud9 |
| Languages | Python, Java, JavaScript, TypeScript, C#, Go, and more |
AI Hardware
AWS Custom Chips
| Chip | Purpose | Use Case |
|---|---|---|
| AWS Trainium | Training ML models | Cost-effective alternative to NVIDIA GPUs (up to 50% savings) |
| AWS Inferentia | ML inference | High throughput, low latency inference (up to 70% cost savings) |
| AWS Graviton | General compute | ARM-based, not ML-specific but good for preprocessing |
Instance mapping:
Trainium → ml.trn1 instances
Inferentia → ml.inf1, ml.inf2 instances
NVIDIA GPU → ml.p3 (V100), ml.p4d (A100), ml.g5 (A10G)
Quick Decision: Which Service?
| You Need To… | Use |
|---|---|
| Analyze text sentiment | Comprehend |
| Extract entities from medical text | Comprehend Medical |
| Translate text | Translate |
| Convert speech to text | Transcribe |
| Convert text to speech | Polly |
| Analyze images/video | Rekognition |
| Extract text from documents | Textract |
| Build a chatbot | Lex |
| Personalized recommendations | Personalize |
| Enterprise search | Kendra |
| Human review of ML output | A2I (Augmented AI) |
| Enterprise AI assistant | Q Business |
| AI coding assistant | Q Developer |
| Custom image classification (few images) | Rekognition Custom Labels |
| Custom text classification | Comprehend Custom |
| Detect PII in text | Comprehend |
| Detect PII in S3 | Macie |
| Content moderation (images) | Rekognition |
| Detect fraud | Fraud Detector |