Yining Wang a6a8afc47e Add qwen vl CPU example (#9221 )

* eee

* add examples on CPU and GPU

* fix

* fix

* optimize model examples

* add Qwen-VL-Chat CPU example

* Add Qwen-VL CPU example

* fix optimize problem

* fix error

* Have updated, benchmark fix removed from this PR

* add generate API example

* Change formats in qwen-vl example

* Add CPU transformer int4 example for qwen-vl

* fix repo-id problem and add Readme

* change picture url

* Remove unnecessary file

---------

Co-authored-by: Yuwen Hu <yuwen.hu@intel.com>

2023-10-25 13:22:12 +08:00

1.7 KiB

Raw Blame History

BigDL-LLM INT4 Optimization for Large Language Model

You can use optimize_model API to accelerate general PyTorch models on Intel servers and PCs. This directory contains example scripts to help you quickly get started using BigDL-LLM to run some popular open-source models in the community. Each model has its own dedicated folder, where you can find detailed instructions on how to install and run it.

Verified models

Model	Example
LLaMA 2	link
ChatGLM	link
Openai Whisper	link
BERT	link
Bark	link
Mistral	link
Flan-t5	link
Phi-1_5	link
Qwen-VL	link

Recommended Requirements

To run the examples, we recommend using Intel® Xeon® processors (server), or >= 12th Gen Intel® Core™ processor (client).

For OS, BigDL-LLM supports Ubuntu 20.04 or later, CentOS 7 or later, and Windows 10/11.

Best Known Configuration on Linux

For better performance, it is recommended to set environment variables on Linux with the help of BigDL-Nano:

pip install bigdl-nano
source bigdl-nano-init

1.7 KiB Raw Blame History

BigDL-LLM INT4 Optimization for Large Language Model

Verified models

Recommended Requirements

Best Known Configuration on Linux

1.7 KiB

Raw Blame History