Skip to main content

Chapter 8 Checkpoint

You've covered the modalities beyond text — seeing, generating images, hearing, speaking in real time, video, multimodal retrieval, and how to measure outputs you can't ==. This quiz checks that the load-bearing ideas stuck: image-token cost, OCR-free extraction, the diffusion mental model, the voice latency budget, frame sampling, shared embedding spaces, and judge-vs-human evaluation.

There are 12 questions in the bank — each visit picks 5 at random, so retaking gives you different ones. If you miss one, the result card tells you exactly which page to revisit.

You must pass (≥ 60%) to unlock the Next button and Chapter 9 in the sidebar.

Required checkpoint

Multimodal & Voice AI checkpoint

Pass to unlock the Next button below

What's next

→ Continue to Chapter 9: Solo / Indie AI — you now know the disciplines and specializations; next, see how it all assembles into real workflows at every team scale.