In the LLM, using sampling
When we set the temperature, top K, and top P in the LLM, do you know what they are and what the order for applying them is?
- LLM Samplers Explained
- How to sample from the language model
- What is the actual order of execution when Top-K, Top-P, and Temperature are used together for NLP decoding?
- Token selection strategies: Top-K, Top-P, and Temperature