The LLM as a System Component
Tokenizer → context → forward pass → sampling · decoder-only transformer at a systems level · open-weights (Llama 4, Qwen 3, Mistral, Gemma 3, DeepSeek) vs closed-weights · reading a model card without falling for the benchmark.
Build llmpick — multi-model recommender