windows wsl2（ubuntu）使用xinference快速部署ai模型

这篇具有很好参考价值的文章主要介绍了windows wsl2（ubuntu）使用xinference快速部署ai模型。希望对大家有所帮助。如果存在错误或未考虑完全的地方，请大家不吝赐教，您也可以点击"举报违法"按钮提交疑问。

xinference介绍

Xorbits Inference（Xinference）是一个性能强大且功能全面的分布式推理框架。可用于大语言模型（LLM），语音识别模型，多模态模型等各种模型的推理。通过 Xorbits Inference，你可以轻松地一键部署你自己的模型或内置的前沿开源模型。无论你是研究者，开发者，或是数据科学家，都可以通过 Xorbits Inference 与最前沿的 AI 模型，发掘更多可能。

windows wsl2（ubuntu）使用xinference快速部署ai模型,AI,人工智能,运维,linux,windows

官方文档：GitHub - xorbitsai/inference: Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

如何安装wsl2 并安装linux子系统

参考文档：windows 使用wsl2安装linux子系统

演示安装ubuntu 22

列出可安装的子系统命令：wsl --list --online

PS C:\Users\linyu> wsl --list --online
以下是可安装的有效分发的列表。
使用 'wsl.exe --install <Distro>' 安装。
NAME                                   FRIENDLY NAME
Ubuntu                                 Ubuntu
Debian                                 Debian GNU/Linux
kali-linux                             Kali Linux Rolling
Ubuntu-18.04                           Ubuntu 18.04 LTS
Ubuntu-20.04                           Ubuntu 20.04 LTS
Ubuntu-22.04                           Ubuntu 22.04 LTS
OracleLinux_7_9                        Oracle Linux 7.9
OracleLinux_8_7                        Oracle Linux 8.7
OracleLinux_9_1                        Oracle Linux 9.1
openSUSE-Leap-15.5                     openSUSE Leap 15.5
SUSE-Linux-Enterprise-Server-15-SP4    SUSE Linux Enterprise Server 15 SP4
SUSE-Linux-Enterprise-15-SP5           SUSE Linux Enterprise 15 SP5
openSUSE-Tumbleweed                    openSUSE Tumbleweed

安装ubuntu 命令：wsl --install -d Ubuntu-22.04

PS C:\Users\linyu> wsl --install -d Ubuntu-22.04
正在安装: Ubuntu 22.04 LTS
已安装 Ubuntu 22.04 LTS。
正在启动 Ubuntu 22.04 LTS...
Installing, this may take a few minutes...
Please create a default UNIX user account. The username does not need to match your Windows username.
For more information visit: https://aka.ms/wslusers

输入账号密码安装完成

Enter new UNIX username:
New password:
Retype new password:
passwd: password updated successfully
Installation successful!

安装显卡驱动与cuda驱动

参考文档：wsl2 ubuntu子系统安装显卡驱动与cuda

安装python虚拟运行环境conda

参考文档：conda环境安装

创建xinference python虚拟运行环境

创建xinference运行目录

mkdir -p /data/xinference

创建环境命令：

conda create -n xinference python==3.10

进入环境：

conda activate xinference

按需安装参考官方文档：GitHub - xorbitsai/inference: Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop. - xorbitsai/inferencehttps://github.com/xorbitsai/inference

本地快速安装：pip install "xinference[all]"

(xinference) root@DESKTOP-TUR5ISE:/data/xinference# pip install "xinference[all]"

安装完成

windows wsl2（ubuntu）使用xinference快速部署ai模型,AI,人工智能,运维,linux,windows

国内拉模型配置环境变量

配置如下环境变量可以从国内的modelscope拉模型默认是从Hugging Face拉取，需要外网。

命令行输入：

export XINFERENCE_MODEL_SRC=modelscope
export HF_ENDPOINT=https://hf-mirror.com

启动服务

启动服务命令：

XINFERENCE_HOME=/data/xinference xinference-local --host 0.0.0.0 --port 9997

查看ip地址

windows wsl2（ubuntu）使用xinference快速部署ai模型,AI,人工智能,运维,linux,windows

访问服务 http://IP地址:9997

windows wsl2（ubuntu）使用xinference快速部署ai模型,AI,人工智能,运维,linux,windows

运行模型

点击小火箭图标启动chatglm3 模型测试

windows wsl2（ubuntu）使用xinference快速部署ai模型,AI,人工智能,运维,linux,windows

后台开始下载模型

windows wsl2（ubuntu）使用xinference快速部署ai模型,AI,人工智能,运维,linux,windows

下载完后就看到模型已经在运行列表中了

windows wsl2（ubuntu）使用xinference快速部署ai模型,AI,人工智能,运维,linux,windows

之后就可以进行调用或对话了。

windows wsl2（ubuntu）使用xinference快速部署ai模型,AI,人工智能,运维,linux,windows 文章来源地址https://www.toymoban.com/news/detail-848017.html

到了这里，关于windows wsl2（ubuntu）使用xinference快速部署ai模型的文章就介绍完了。如果您还想了解更多内容，请在右上角搜索TOY模板网以前的文章或继续浏览下面的相关文章，希望大家以后多多支持TOY模板网！

Toy模板网

windows wsl2（ubuntu）使用xinference快速部署ai模型

xinference介绍

如何安装wsl2 并安装linux子系统

安装显卡驱动与cuda驱动

安装python虚拟运行环境conda

创建xinference python虚拟运行环境

国内拉模型配置环境变量

启动服务

运行模型

觉得文章有用就打赏一下文章作者

支付宝扫一扫打赏

微信扫一扫打赏

支付宝扫一扫领取红包，优惠每天领

二维码1

二维码2