Variety. Lower = predictable.
Nucleus sampling.
Token shortlist size.
Discourage repeats by count.
Discourage seen tokens.
Reduce verbatim echoes.
Minimum token prob vs best.
Adaptive nucleus filter.
Saved per assistant.