Chapter 8 Checkpoint
You've covered the modalities beyond text — seeing, generating images, hearing, speaking in real time, video, multimodal retrieval, and how to measure outputs you can't ==. This quiz checks that the load-bearing ideas stuck: image-token cost, OCR-free extraction, the diffusion mental model, the voice latency budget, frame sampling, shared embedding spaces, and judge-vs-human evaluation.
There are 12 questions in the bank — each visit picks 5 at random, so retaking gives you different ones. If you miss one, the result card tells you exactly which page to revisit.
You must pass (≥ 60%) to unlock the Next button and Chapter 9 in the sidebar.
Multimodal & Voice AI checkpoint
Pass to unlock the Next button belowWhat's next
→ Continue to Chapter 9: Solo / Indie AI — you now know the disciplines and specializations; next, see how it all assembles into real workflows at every team scale.