Afaiu “sampling” here, it is controlled with (not only?) topk and temp parameters in e.g. “text generation web ui”. You may find these in other frontends probably too.
This ofc implies local models and that you have a decent cpu + min 64gb of ram to run above 7b-sized model.