My LLM command-line tool can now access Llama 2 via the Replicate API using a new llm-replicate plugin - I wrote about that here:
https://simonwillison.net/2023/Jul/18/accessing-llama-2/The tool logs all prompts and responses to a SQLite database, so it's great for comparing results from different models.