飞搜侠

Ollama模型编译与量化工具 - 飞书文档

https://docs.feishu.cn/v/wiki/RFxuwGwq3ifDqgkMYKkcGB7cn1d/a6

我们需要准备模型文件、克隆ollama和llama.cpp仓库源码、安装依赖、下载camke、代码编译和模型量化、推理测试模型、创建和配置Modelfile、使用ollama框架的命令行工具 ...

阿里云PAI上部署ChatGLM2-6B，如何安装cuda及pytorch？ - 飞书文档

https://docs.feishu.cn/v/wiki/LF0VwQ32NirqdnkHH23cEyxqnjb/ad

__version__) ##编译当前版本的torch使用的cuda版本号print(torch.version.cuda) ... 源码下载：登录github官方地址下载源码或者直接使用git命令clone： $cd ...

Ollama的编译过程中需要注意什么？ - 飞书文档

https://docs.feishu.cn/v/wiki/LrdMwKKt3iZgoYkQlPRcvY1PnXc/ai

Sglang(推荐，速度快，吞吐量高). 源码安装sglang. git clone https://github.com ... txt #将pytorch模型转化为fp16的gguf python3 convert_hf_to_gguf.py models ...

Nuitka如何保障源码安全 - 飞书文档

https://docs.feishu.cn/v/wiki/Kqh1w1CeMis64HkLb4Lc6ODknob/a4

源码不会被反编译（所有的变量名和函数名已经被混淆为不可意会的字符串，原Python ... pytorch，tensorflow，OCC等Pyinstaller极难实现的解决方案，打包时间回到 ...

【边端部署教程】MiniCPM2.0 - 飞书文档

https://docs.feishu.cn/article/wiki/VL5kw9DsEiRDmJkEyTUcydE0nie

下载[MiniCPM pytorch模型](https://huggingface.co/openbmb/MiniCPM-2B-sft ... cpp的源码后编译. git clone https://github.com/ggerganov/llama.cpp cd llama ...

Prefix Caching机制- vLLM源码解析3 - 飞书文档

https://docs.feishu.cn/v/wiki/POYNwdkaRiQCtWkY6D4c4NcQnCd/ah

2024年7月5日 ... ... 构建了一个带有自动微分支持的Tensor 库。在这个过程中，我学到了很多关于PyTorch 的知识，所以我在这里写了一些相关内容。我尝试剥离PyTorch 的许多 ...

阅读著名编译器源代码从何开始？ - 飞书文档

https://docs.feishu.cn/v/wiki/YzyCwQzcUi8R0LkG243cCPRvnwg/ac

You can try out TensorFlow, PyTorch, MXNet, and other popular frameworks, and compare their performance using different compilers. This will help you gain ...

ollama 部署InternLM 实践 - 书生大模型开源社区

https://aicarrier.feishu.cn/wiki/RFxuwGwq3ifDqgkMYKkcGB7cn1d

... ：comefly 前言ollama框架支持多种格式的模型导入，包括但不限于GGUF、PyTorch和Safetensors ... cpp仓库源码、安装依赖、下载camke、代码编译和模型量化、推理测试模型、创建 ...

【边端部署教程】MiniCPM2.0 - 飞书

https://modelbest.feishu.cn/wiki/VL5kw9DsEiRDmJkEyTUcydE0nie

b. 下载[MiniCPM pytorch模型](https://huggingface.co/openbmb/MiniCPM-2B ... cpp的源码后编译. . . 代码块. Bash. git clone https://github.com/ggerganov ...

MiniCPM - Llama3 - V 2.5的python多gpu代码推理要点有哪些？

https://docs.feishu.cn/v/wiki/BcHIwjOLGihJXCkkSdMc2WhbnZf/a4

安装deploy，注意不要使用源码编译。 pip install deploy. 并发推理代码如下：. from lmdeploy import pipeline, TurbomindEngineConfig from lmdeploy.vl import ...