← AWS MLA-C01 — ML Engineer Associate

Domain 2C: AWS Managed AI Services

AWS Managed AI Services

Exam Domain: 2 — ML Model Development (26%) Task: Choose the right modeling approach — use managed services before building custom


When Managed vs Custom?

AWS AI Services Landscape

Decision Tree:
  "Can an AWS managed service solve this?"
       │
       YES → Use managed service (faster, cheaper, no ML expertise needed)
       │
       NO → "Can a SageMaker built-in algorithm solve this?"
              │
              YES → Use built-in algorithm
              │
              NO → Build custom model (bring your own container)

Exam tip: AWS always prefers the simplest, most managed solution. If a managed service can do it, that’s the answer.

Think of it this way: Choosing between managed services and custom models is like deciding whether to cook dinner. If you can buy a ready-made meal (managed service) that does the job, don’t spend hours cooking from scratch (custom model). AWS exam questions are designed to reward the person who reaches for the simplest tool first — the more managed, the better.


Natural Language Processing (NLP)

Amazon Comprehend

Purpose: Extract insights from text — no ML experience needed.

FeatureWhat It Does
Entity RecognitionPeople, places, organizations, dates, quantities
Sentiment AnalysisPositive, negative, neutral, mixed
Key Phrase ExtractionImportant phrases from text
Language DetectionIdentify language (100+ languages)
Topic ModelingDiscover topics across document collections
PII DetectionIdentify and redact personal information
Syntax AnalysisParts of speech (noun, verb, adjective)
Custom ClassificationTrain custom text classifiers
Custom Entity RecognitionTrain to find domain-specific entities
Comprehend Use Cases:
  Customer reviews   → Sentiment analysis
  Support tickets    → Topic modeling + classification
  Medical records    → Comprehend Medical (HIPAA-eligible)
  Legal documents    → Entity extraction + PII detection

ELI5: Comprehend is like having a super-fast reader processing thousands of reviews per second. Feed it a customer review and it reports back: “this person is angry (sentiment), they mention John Smith in New York (entities), they’re talking about returning a product (key phrases), and their phone number is in there (PII).” No ML expertise needed — it’s all pre-built.

Comprehend Medical — specialized for healthcare:

  • Extract medical conditions, medications, dosages, procedures
  • HIPAA-eligible
  • Works with ICD-10-CM, RxNorm ontologies

Amazon Translate

  • Neural machine translation (75+ languages)
  • Real-time and batch translation
  • Custom Terminology — force specific translations (brand names, technical terms)
  • Active Custom Translation — fine-tune for domain-specific language
  • Use case: website localization, multilingual chatbots

Amazon Transcribe

Purpose: Speech-to-text (ASR — Automatic Speech Recognition)

FeatureDetails
Streaming transcriptionReal-time audio → text
Batch transcriptionProcess audio files from S3
Custom VocabularyAdd domain terms (medical, legal)
Vocabulary FilteringRemove/mask unwanted words
Speaker DiarizationIdentify who said what
Channel IdentificationSeparate speakers on different audio channels
Automatic Language IDDetect language automatically
Call AnalyticsSentiment, talk time, interruptions, issues
Transcribe MedicalHIPAA-eligible medical transcription

ELI5: Transcribe is Siri’s brain without Siri’s personality. It converts audio to text and can even figure out who said what (Speaker Diarization). Think of a call center: one recording, two speakers, and Transcribe produces a labeled transcript — “Agent: How can I help? Customer: My order is missing.” That labeled transcript can then feed into Comprehend for sentiment analysis.

Amazon Polly

Purpose: Text-to-speech (TTS)

  • 60+ voices, 30+ languages
  • SSML support — control pronunciation, pauses, emphasis, speed
  • Lexicon — custom pronunciation for specific words
  • Neural TTS (NTTS) — more natural, human-like speech
  • Speech Marks — metadata for lip-syncing (viseme, word, sentence timings)
  • Use case: accessibility, voice applications, IVR systems

Computer Vision

Amazon Rekognition

Purpose: Image and video analysis.

FeatureWhat It Does
Object/Scene DetectionLabels objects, scenes, activities
Face Detection & AnalysisAge range, emotion, glasses, gender
Face ComparisonMatch faces across images
Face SearchSearch against a face collection
Celebrity RecognitionIdentify famous people
Text in ImageOCR for text within images
Content ModerationDetect unsafe/inappropriate content
Custom LabelsTrain custom image classifier (as few as 10 images)
Video AnalysisAll above + person tracking, path detection
PPE DetectionDetect personal protective equipment
Rekognition Custom Labels:
  You provide:  10+ labeled images per class
  AWS handles:  Transfer learning, training, hosting
  You get:      Custom image classification API
  Use case:     "Is this a defective part?" (manufacturing QA)

ELI5: Rekognition is like a security camera that also has a film degree. It identifies faces, detects whether someone’s wearing a hard hat (PPE Detection), reads text on road signs, flags inappropriate content, and can even recognize celebrities. The “Custom Labels” feature lets you teach it your own categories with as few as 10 images — useful for things like spotting defective parts on a factory line.

Amazon Textract

Purpose: Extract text, forms, and tables from documents.

FeatureDetails
Text DetectionRaw text extraction (OCR)
Form ExtractionKey-value pairs from forms
Table ExtractionStructured table data
Expense AnalysisReceipts and invoices
Identity DocumentsPassports, driver’s licenses
Lending AnalysisMortgage and loan documents
Layout AnalysisDocument structure (titles, headers, paragraphs)

Textract vs Comprehend: Textract extracts text from images/PDFs. Comprehend analyzes the meaning of text.

ELI5: Regular OCR just “sees” characters — it reads “Name: John” as four separate characters. Textract understands document structure: it knows “Name” is a form field label and “John” is its value. So you get structured data back ({"Name": "John"}), not just a wall of text. That’s the difference between reading and comprehending a form.


Conversational AI

Amazon Lex

Purpose: Build chatbots and voice bots (same tech as Alexa).

Lex Concepts:
┌──────────────────────────────────────────┐
│  Bot                                     │
│  ├─ Intent: "OrderPizza"                 │
│  │   ├─ Utterances: "I want a pizza"     │
│  │   │              "Order me a pizza"    │
│  │   ├─ Slots: {size}, {topping}, {crust}│
│  │   └─ Fulfillment: Lambda function     │
│  │                                       │
│  └─ Intent: "CheckOrderStatus"           │
│      ├─ Utterances: "Where is my order?" │
│      ├─ Slots: {orderId}                 │
│      └─ Fulfillment: Lambda function     │
└──────────────────────────────────────────┘
  • ASR (speech → text) + NLU (text → intent)
  • Slot types: built-in (date, number) and custom
  • Multi-turn conversation support
  • Integrates with: Lambda, Connect (call center), Cognito (auth)

ELI5: Lex is literally Alexa’s brain — the same technology powers both. An “intent” is what the user wants to do (order a pizza). “Utterances” are different ways they might say it (“I want a pizza”, “Order me a pizza”, “Get me a large pepperoni”). “Slots” are the details you need to collect to fulfill it: size, topping, crust. Once all slots are filled, a Lambda function does the actual work.


Personalization & Recommendations

Amazon Personalize

Purpose: Real-time personalized recommendations.

ConceptDetails
User-Item Interaction DataWho clicked/bought/watched what
Recipe TypesUser personalization, similar items, re-ranking
Real-time eventsUpdate recommendations as user interacts
FiltersExclude already-purchased, apply business rules
CampaignsDeployed recommendation endpoints
Personalize Data Inputs:
  1. Interactions dataset (required): user-item interactions
  2. Users dataset (optional): user demographics
  3. Items dataset (optional): item metadata

Output: Ranked list of recommended items per user

Use cases: e-commerce product suggestions, content recommendations, personalized search

ELI5: Personalize is the “customers who bought this also bought…” engine. Netflix recommendations, Amazon product suggestions — same underlying collaborative filtering technology, now available as an API you call with a user ID and get back a ranked list of items. You bring your interaction history; AWS handles all the model training and hosting.

Amazon Forecast (retired — but may still appear)

  • Time series forecasting using AutoML
  • Built-in algorithms: DeepAR+, CNN-QR, Prophet, ARIMA, NPTS, ETS
  • Use case: demand forecasting, inventory planning, financial forecasting

Note: Amazon Forecast was retired in 2024. For new projects, use SageMaker Canvas or DeepAR. May still appear in exam questions.


Document & Search Intelligence

Amazon Kendra

Purpose: Intelligent enterprise search using NLP.

  • Semantic search — understands meaning, not just keywords
  • Returns specific answers, not just document links
  • Connectors: S3, SharePoint, Salesforce, databases, websites
  • FAQ support — question-answer pairs
  • Access control integration (who can see what)
  • Use case: internal knowledge base search, customer self-service

Amazon Augmented AI (A2I)

Purpose: Human review workflow for ML predictions.

ML Prediction → Confidence Check
                    │
                    ├─ High confidence → Auto-approve
                    │
                    └─ Low confidence → Route to Human Reviewer
                                            │
                                            └─ Human reviews → Final decision
  • Built-in integrations: Textract, Rekognition
  • Custom: any ML model output
  • Workforce: Mechanical Turk, private team, or third-party
  • Use case: content moderation review, document verification

ELI5: A2I is the “phone a friend” lifeline for your ML model. When the model isn’t confident enough in its prediction, instead of guessing wrong, it routes the case to a human reviewer. Think of it like a junior doctor flagging uncertain X-rays for a specialist — automation handles the easy cases, humans handle the edge cases.


AI Assistants

Amazon Q Business

Purpose: Enterprise generative AI assistant.

FeatureDetails
Data connectors40+ sources (S3, SharePoint, Jira, Salesforce, Slack)
Access controlsRespects existing permissions (who can see what)
GuardrailsAdmin controls for topic blocking
PluginsCustom actions (create Jira tickets, query databases)
Q AppsBuild lightweight apps from Q Business

Amazon Q Developer

Purpose: AI coding assistant.

FeatureDetails
Code completionReal-time suggestions in IDE
Code generationGenerate functions from comments
Security scanningFind and fix vulnerabilities
Code transformationUpgrade Java versions, modernize code
IDE supportVS Code, IntelliJ, PyCharm, Cloud9
LanguagesPython, Java, JavaScript, TypeScript, C#, Go, and more

AI Hardware

AWS Custom Chips

ChipPurposeUse Case
AWS TrainiumTraining ML modelsCost-effective alternative to NVIDIA GPUs (up to 50% savings)
AWS InferentiaML inferenceHigh throughput, low latency inference (up to 70% cost savings)
AWS GravitonGeneral computeARM-based, not ML-specific but good for preprocessing
Instance mapping:
  Trainium   → ml.trn1 instances
  Inferentia → ml.inf1, ml.inf2 instances
  NVIDIA GPU → ml.p3 (V100), ml.p4d (A100), ml.g5 (A10G)

Quick Decision: Which Service?

You Need To…Use
Analyze text sentimentComprehend
Extract entities from medical textComprehend Medical
Translate textTranslate
Convert speech to textTranscribe
Convert text to speechPolly
Analyze images/videoRekognition
Extract text from documentsTextract
Build a chatbotLex
Personalized recommendationsPersonalize
Enterprise searchKendra
Human review of ML outputA2I (Augmented AI)
Enterprise AI assistantQ Business
AI coding assistantQ Developer
Custom image classification (few images)Rekognition Custom Labels
Custom text classificationComprehend Custom
Detect PII in textComprehend
Detect PII in S3Macie
Content moderation (images)Rekognition
Detect fraudFraud Detector