<?xml version="1.0" encoding="UTF-8"?><urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9" xmlns:news="http://www.google.com/schemas/sitemap-news/0.9" xmlns:xhtml="http://www.w3.org/1999/xhtml" xmlns:image="http://www.google.com/schemas/sitemap-image/1.1" xmlns:video="http://www.google.com/schemas/sitemap-video/1.1"><url><loc>https://thellms.dev/</loc></url><url><loc>https://thellms.dev/about/</loc></url><url><loc>https://thellms.dev/briefs/</loc></url><url><loc>https://thellms.dev/cache/</loc></url><url><loc>https://thellms.dev/cache/a-simple-llm-cost-calculator-editors-can-maintain/</loc></url><url><loc>https://thellms.dev/cache/ai-feature-unit-economics-cost-per-user-task-and-successful-answer/</loc></url><url><loc>https://thellms.dev/cache/api-model-pricing-input-output-cache-batch-costs/</loc></url><url><loc>https://thellms.dev/cache/batch-apis-for-llms-cheaper-slower-and-often-underused/</loc></url><url><loc>https://thellms.dev/cache/benchmark-leaderboards-for-busy-buyers-chatbot-arena-livebench-and-what-to-ignore/</loc></url><url><loc>https://thellms.dev/cache/caching-ai-answers-when-it-is-safe-risky-or-pointless/</loc></url><url><loc>https://thellms.dev/cache/context-windows-explained-why-bigger-is-not-always-better/</loc></url><url><loc>https://thellms.dev/cache/creating-a-model-scorecard-for-your-own-workload/</loc></url><url><loc>https://thellms.dev/cache/embeddings-explained-for-business-search-and-rag/</loc></url><url><loc>https://thellms.dev/cache/fine-tuning-vs-prompting-vs-rag-decision-checklist/</loc></url><url><loc>https://thellms.dev/cache/frontier-gpt-55-llm-ethics-raw-draft/</loc></url><url><loc>https://thellms.dev/cache/function-calling-and-tool-use-where-agents-actually-fail/</loc></url><url><loc>https://thellms.dev/cache/function-calling-benchmarks-why-tool-use-scores-do-not-guarantee-agents-work/</loc></url><url><loc>https://thellms.dev/cache/llm-observability-cost-logs-traces-and-evaluation-storage/</loc></url><url><loc>https://thellms.dev/cache/lm-eval-harness-explained-for-non-researchers/</loc></url><url><loc>https://thellms.dev/cache/local-quantized-llm-vs-frontier-model-writing-test/</loc></url><url><loc>https://thellms.dev/cache/local-qwen-llm-ethics-raw-draft/</loc></url><url><loc>https://thellms.dev/cache/long-context-benchmarks-needle-tests-document-qa-and-real-recall/</loc></url><url><loc>https://thellms.dev/cache/model-parameters-and-sizes-why-7b-70b-and-moe-labels-can-mislead/</loc></url><url><loc>https://thellms.dev/cache/model-routing-using-cheap-models-first-without-breaking-quality/</loc></url><url><loc>https://thellms.dev/cache/multimodal-models-explained-text-images-audio-and-video-in-practical-products/</loc></url><url><loc>https://thellms.dev/cache/open-weights-vs-hosted-apis-practical-trade-offs/</loc></url><url><loc>https://thellms.dev/cache/output-tokens-are-expensive-designing-shorter-ai-answers-without-hurting-usefulness/</loc></url><url><loc>https://thellms.dev/cache/prompt-caching-explained-when-repeated-context-becomes-cheaper/</loc></url><url><loc>https://thellms.dev/cache/prompt-length-output-length-and-why-ai-bills-surprise-teams/</loc></url><url><loc>https://thellms.dev/cache/rag-costs-vector-database-embeddings-reranking-and-generation/</loc></url><url><loc>https://thellms.dev/cache/rag-evaluation-checking-retrieval-before-blaming-the-model/</loc></url><url><loc>https://thellms.dev/cache/rate-limits-explained-requests-tokens-tiers-and-hidden-launch-risks/</loc></url><url><loc>https://thellms.dev/cache/structured-outputs-and-json-mode-reliability-limits/</loc></url><url><loc>https://thellms.dev/cache/system-prompts-developer-prompts-and-user-prompts-who-controls-what/</loc></url><url><loc>https://thellms.dev/cache/temperature-top-p-and-deterministic-outputs-what-the-settings-actually-do/</loc></url><url><loc>https://thellms.dev/cache/the-hidden-cost-of-retries-fallbacks-and-validation-loops/</loc></url><url><loc>https://thellms.dev/cache/what-is-a-token-and-why-it-affects-ai-cost/</loc></url><url><loc>https://thellms.dev/comparisons/</loc></url><url><loc>https://thellms.dev/comparisons/cloud-ai-platforms-vs-direct-model-apis/</loc></url><url><loc>https://thellms.dev/comparisons/fine-tuning-economics/</loc></url><url><loc>https://thellms.dev/comparisons/gpu-rental-for-llm-inference/</loc></url><url><loc>https://thellms.dev/comparisons/hosted-api-vs-self-hosted-open-model/</loc></url><url><loc>https://thellms.dev/comparisons/model-gateways-and-routers-openrouter-litellm-and-build-vs-buy/</loc></url><url><loc>https://thellms.dev/comparisons/promptfoo-vs-lm-eval-harness-when-each-is-useful/</loc></url><url><loc>https://thellms.dev/contact/</loc></url><url><loc>https://thellms.dev/diff/</loc></url><url><loc>https://thellms.dev/diff/ai-adoption-in-small-businesses-where-llms-help-first/</loc></url><url><loc>https://thellms.dev/diff/ai-energy-use-useful-facts-without-moral-panic/</loc></url><url><loc>https://thellms.dev/diff/ai-slas-and-status-pages-what-reliability-evidence-vendors-publish/</loc></url><url><loc>https://thellms.dev/diff/ai-vendor-lock-in-model-apis-embeddings-vector-stores-and-eval-data/</loc></url><url><loc>https://thellms.dev/diff/changelog-watching-for-ai-teams-deprecations-pricing-and-model-aliases/</loc></url><url><loc>https://thellms.dev/diff/copyright-and-training-data-what-ai-product-teams-can-responsibly-say/</loc></url><url><loc>https://thellms.dev/diff/enterprise-ai-procurement-questions-before-buying-a-platform/</loc></url><url><loc>https://thellms.dev/diff/eu-ai-act-for-llm-buyers-what-to-track-without-overclaiming/</loc></url><url><loc>https://thellms.dev/diff/guardrails-compared-policy-prompts-classifiers-validators-and-permissions/</loc></url><url><loc>https://thellms.dev/diff/hardware-supply-and-inference-economics-why-chips-shape-ai-products/</loc></url><url><loc>https://thellms.dev/diff/local-llm-runtimes-ollama-llama-cpp-vllm-and-tgi-in-plain-english/</loc></url><url><loc>https://thellms.dev/diff/meta-llama-and-open-model-licensing-what-builders-must-check/</loc></url><url><loc>https://thellms.dev/diff/mixture-of-experts-models-why-active-parameters-matter/</loc></url><url><loc>https://thellms.dev/diff/nist-ai-rmf-and-genai-guidance-practical-use-for-small-teams/</loc></url><url><loc>https://thellms.dev/diff/openai-anthropic-google-and-mistral-apis-what-comparison-pages-should-measure/</loc></url><url><loc>https://thellms.dev/diff/provider-data-retention-policies-what-api-users-should-compare/</loc></url><url><loc>https://thellms.dev/diff/quantisation-explained-why-model-files-have-q4-q5-and-gguf-labels/</loc></url><url><loc>https://thellms.dev/diff/reasoning-models-what-thinking-modes-change-for-cost-and-latency/</loc></url><url><loc>https://thellms.dev/diff/responsible-ai-policies-that-builders-can-actually-operationalise/</loc></url><url><loc>https://thellms.dev/diff/small-language-models-when-smaller-is-better/</loc></url><url><loc>https://thellms.dev/diff/uk-ai-governance-sources-ico-ncsc-cma-and-dsit-in-one-map/</loc></url><url><loc>https://thellms.dev/diff/what-model-cards-tell-you-and-what-they-do-not/</loc></url><url><loc>https://thellms.dev/disclaimer/</loc></url><url><loc>https://thellms.dev/editorial-policy/</loc></url><url><loc>https://thellms.dev/evals/</loc></url><url><loc>https://thellms.dev/evals/coding-benchmarks-explained/</loc></url><url><loc>https://thellms.dev/evals/contamination-and-leakage/</loc></url><url><loc>https://thellms.dev/evals/helm-style-evaluation/</loc></url><url><loc>https://thellms.dev/evals/how-llm-benchmarks-work-and-what-they-miss/</loc></url><url><loc>https://thellms.dev/evals/human-evaluation-for-llms-rubrics/</loc></url><url><loc>https://thellms.dev/evals/llm-as-a-judge-when-automated-grading-helps-and-when-it-lies/</loc></url><url><loc>https://thellms.dev/evals/synthetic-eval-datasets-useful-shortcut-or-false-confidence/</loc></url><url><loc>https://thellms.dev/glossary/</loc></url><url><loc>https://thellms.dev/methodology/</loc></url><url><loc>https://thellms.dev/privacy/</loc></url><url><loc>https://thellms.dev/run/</loc></url><url><loc>https://thellms.dev/run/access-control-for-rag-why-retrieval-permissions-matter-before-generation/</loc></url><url><loc>https://thellms.dev/run/ai-agents-vs-workflows-a-plain-english-difference-for-teams/</loc></url><url><loc>https://thellms.dev/run/ai-coding-agents-what-to-measure-before-trusting-them/</loc></url><url><loc>https://thellms.dev/run/ai-incident-response-what-to-do-when-a-model-gives-harmful-or-wrong-advice/</loc></url><url><loc>https://thellms.dev/run/ai-output-monitoring-what-to-log-sample-and-review/</loc></url><url><loc>https://thellms.dev/run/building-a-minimum-viable-rag-system-without-overengineering/</loc></url><url><loc>https://thellms.dev/run/building-an-internal-ai-policy-bot-safe-pattern-or-risky-shortcut/</loc></url><url><loc>https://thellms.dev/run/chat-history-is-not-memory-how-llm-apps-remember-users/</loc></url><url><loc>https://thellms.dev/run/chunking-documents-for-rag-size-overlap-and-metadata-choices/</loc></url><url><loc>https://thellms.dev/run/citation-quality-in-ai-answers-source-grounded-does-not-mean-source-faithful/</loc></url><url><loc>https://thellms.dev/run/data-leakage-in-llm-apps-logs-prompts-files-and-vendor-retention/</loc></url><url><loc>https://thellms.dev/run/eval-ci-for-ai-apps-testing-prompts-before-every-release/</loc></url><url><loc>https://thellms.dev/run/eval-gaming-when-models-optimise-for-the-test-rather-than-the-task/</loc></url><url><loc>https://thellms.dev/run/fallback-design-what-happens-when-the-ai-call-fails/</loc></url><url><loc>https://thellms.dev/run/golden-datasets-for-llm-products-how-small-regression-sets-prevent-regressions/</loc></url><url><loc>https://thellms.dev/run/hallucination-testing-how-to-build-a-small-regression-set/</loc></url><url><loc>https://thellms.dev/run/human-in-the-loop-ai-approval-queues-that-do-not-become-bottlenecks/</loc></url><url><loc>https://thellms.dev/run/inference-vs-training-vs-fine-tuning-three-terms-operators-confuse/</loc></url><url><loc>https://thellms.dev/run/jailbreaks-vs-product-safety-what-operators-can-realistically-control/</loc></url><url><loc>https://thellms.dev/run/latency-in-llm-apps-first-token-total-time-and-user-experience/</loc></url><url><loc>https://thellms.dev/run/llm-observability-basics-traces-prompts-evals-and-feedback-loops/</loc></url><url><loc>https://thellms.dev/run/mcp-explained-tools-resources-prompts-and-the-current-hype-gap/</loc></url><url><loc>https://thellms.dev/run/model-drift-without-training-why-api-behavior-changes-over-time/</loc></url><url><loc>https://thellms.dev/run/pii-handling-for-llm-apps-minimisation-before-redaction/</loc></url><url><loc>https://thellms.dev/run/prompt-injection-explained-for-business-users/</loc></url><url><loc>https://thellms.dev/run/prompt-versioning-treating-prompts-like-production-code/</loc></url><url><loc>https://thellms.dev/run/red-teaming-an-llm-feature-a-practical-first-week-checklist/</loc></url><url><loc>https://thellms.dev/run/refusals-and-over-refusals-testing-whether-safety-blocks-useful-work/</loc></url><url><loc>https://thellms.dev/run/rerankers-explained-the-quiet-quality-layer-in-rag-systems/</loc></url><url><loc>https://thellms.dev/run/safe-prompt-templates-reducing-brittle-instructions-and-hidden-assumptions/</loc></url><url><loc>https://thellms.dev/run/schema-first-ai-extraction-making-llms-useful-for-messy-documents/</loc></url><url><loc>https://thellms.dev/run/the-evidence-led-ai-website-manifesto-how-everything-llm-will-review-claims/</loc></url><url><loc>https://thellms.dev/run/the-model-release-treadmill-how-to-avoid-rebuilding-every-month/</loc></url><url><loc>https://thellms.dev/run/tool-use-safety-stopping-agents-from-taking-dangerous-actions/</loc></url><url><loc>https://thellms.dev/run/vector-databases-when-semantic-search-is-enough-and-when-it-is-not/</loc></url></urlset>