What is a token, and why does it affect AI cost?
Understand tokenisation and pricing basics before using an API.
100-page programme
The full 100-page programme is complete. All 100 briefs have been drafted, editorially reviewed, and integrated into the live prototype. This page is the accountable record of every brief in the programme — use it to verify coverage, not as a working queue.
48 briefs
Concepts, ideas, explainers, comparisons and knowledge snippets.
Understand tokenisation and pricing basics before using an API.
Decide whether long-context models solve a product problem.
Configure generation settings for consistency or creativity.
Understand embeddings before buying a vector database.
Diagnose why an AI feature feels slow.
Plan capacity before launching an AI feature.
Explain unexpected token usage and cost spikes.
Understand prompt hierarchy and instruction conflicts.
Get valid machine-readable outputs from LLMs.
Assess whether multimodal AI fits a workflow.
Understand what kind of AI work a project actually needs.
Interpret model-size claims and open-model labels.
Learn what lm-eval-harness can and cannot test.
Understand holistic model evaluation.
Choose an evaluation tool for prompts or models.
Diagnose poor RAG answers.
Decide whether to generate test cases with LLMs.
Design a review process for generated outputs.
Use another model to grade model outputs.
Interpret coding model claims.
Understand tool-use and agent benchmark claims.
Evaluate long-context model claims.
Understand benchmark trust issues.
Use public leaderboards without overtrusting them.
Compare models for a specific business use case.
Understand how AI API bills are calculated.
Reduce repeated prompt costs.
Process non-urgent AI jobs cheaply.
Decide whether local/self-hosted inference saves money.
Understand GPU hosting options for open models.
Compare fine-tuning with prompting/RAG from a cost perspective.
Estimate total cost of a document QA system.
Model AI feature profitability.
Cut costs from verbose model outputs.
Save money by routing tasks across models.
Reduce repeated AI calls.
Understand why real costs exceed estimates.
Budget for monitoring AI systems.
Estimate AI costs from token volumes.
Choose between open models and managed APIs.
Choose between cloud marketplace AI and direct provider integration.
Understand why retrieved text or user input can hijack an AI app.
Understand jailbreak risk without panic.
Design guardrails for AI agents with tools.
Handle personal data safely in AI features.
Monitor production LLM quality and incidents.
Improve citations in AI answers.
Prepare for AI-related production incidents.
25 briefs
Workflows with inputs, outputs, checks, failure modes and a stopping point.
Understand AI memory features and their privacy implications.
Decide whether a task needs an “agent” or deterministic automation.
Understand benchmark claims in model launches.
Build a practical evaluation set for an AI feature.
Put AI regression tests into a software pipeline.
Reduce disruption from frequent model releases/deprecations.
Reduce factual errors in LLM outputs.
Diagnose unwanted model refusals.
Understand why high benchmark scores may not translate.
Test an AI feature before launch.
Choose an adaptation strategy for an AI product.
Build reliable tool-using AI workflows.
Understand Model Context Protocol and when it helps.
Decide whether to add a vector database.
Improve retrieval quality in document AI.
Understand reranking after vector search.
Plan a simple document-QA prototype.
Evaluate coding agents for real engineering work.
Extract structured data from unstructured text.
Monitor an LLM app in production.
Design resilient AI features.
Add human review to AI workflows.
Manage changes to prompts in teams.
Use AI over internal policies and procedures.
Understand the site’s editorial trust model.
21 briefs
Dated context pieces: what changed, who should care, and what remains unproved.
Read model cards critically.
Compare major hosted providers without hype.
Understand whether an open model can be used commercially.
Decide whether a compact model fits a task.
Understand reasoning-model trade-offs.
Interpret MoE claims in model launches.
Choose a runtime for running open models.
Understand local model file choices.
Assess privacy and data-use differences between providers.
Add provider flexibility without rewriting apps.
Keep AI apps stable amid provider changes.
Identify places sensitive data can escape.
Choose safety controls for an AI app.
Evaluate AI vendors and platforms.
Understand the controversy around training data and outputs.
Understand AI energy and infrastructure claims.
Understand why GPUs and accelerators affect AI availability and cost.
Reduce switching risk in AI systems.
Evaluate provider reliability claims.
Turn abstract AI principles into processes.
Identify practical AI use cases for small firms.