undefined | Better HN

0 pointsobjactivate2y ago0 comments

The latest 7B model demonstrated impressive performance on benchmarks.

However, I have a question regarding its specific deployment method: How can I merge the parts of the Safetensors format? Specifically, I'm referring to files named 'model-00001-of-00002.safetensors' and 'model-00002-of-00002.safetensors'.

My motivation is straightforward: I aim to combine the Safetensor 'shards' and then utilize the 'convert.py' script from the llama.cpp project to transform a single .safetensors file into the GGUF format. This conversion facilitates running the models on WasmEdge.

I appreciate any guidance on this matter. Thank you.

0 comments

1 comments · 1 top-level

DeepSeek2y ago

Are you looking for GGUF format of deepseek models? There're a few on huggingface, e.g. this one for 7B model v1.5 https://huggingface.co/LoneStriker/deepseek-coder-7b-instruc...

j / k navigate · click thread line to collapse