ipex-llm

History

Zhicun f6fef09933 fix prompt format for llama-2 in langchain (#10637 )		2024-04-03 14:17:34 +08:00
..
chat.py	fix prompt format for llama-2 in langchain (#10637 )	2024-04-03 14:17:34 +08:00
llm_math.py	Remove native_int4 in LangChain examples (#10510 )	2024-03-27 17:48:16 +08:00
rag.py	Remove native_int4 in LangChain examples (#10510 )	2024-03-27 17:48:16 +08:00
README.md	Replace ipex with ipex-llm (#10554 )	2024-03-28 13:54:40 +08:00
voiceassistant.py	Remove native_int4 in LangChain examples (#10510 )	2024-03-27 17:48:16 +08:00

README.md

Langchain Examples

This folder contains examples showcasing how to use langchain with ipex-llm.

Install-IPEX LLM

Ensure ipex-llm is installed by following the IPEX-LLM Installation Guide.

Install Dependences Required by the Examples

pip install langchain==0.0.184
pip install -U chromadb==0.3.25
pip install -U pandas==2.0.3

Example: Chat

The chat example (chat.py) shows how to use LLMChain to build a chat pipeline.

To run the example, execute the following command in the current directory:

python transformers_int4/chat.py -m <path_to_model> [-q <your_question>]

Note: if -q is not specified, it will use What is AI by default.

Example: RAG (Retrival Augmented Generation)

The RAG example (rag.py) shows how to load the input text into vector database, and then use load_qa_chain to build a retrival pipeline.

To run the example, execute the following command in the current directory:

python transformers_int4/rag.py -m <path_to_model> [-q <your_question>] [-i <path_to_input_txt>]

Note: If -i is not specified, it will use a short introduction to Big-DL as input by default. if -q is not specified, What is IPEX LLM? will be used by default.

Example: Math

The math example (math.py) shows how to build a chat pipeline specialized in solving math questions. For example, you can ask What is 13 raised to the .3432 power?

To run the exmaple, execute the following command in the current directory:

python transformers_int4/llm_math.py -m <path_to_model> [-q <your_question>]

Note: if -q is not specified, it will use What is 13 raised to the .3432 power? by default.

Example: Voice Assistant

The voice assistant example (voiceassistant.py) showcases how to use langchain to build a pipeline that takes in your speech as input in realtime, use an ASR model (e.g. Whisper-Medium) to turn speech into text, and then feed the text into large language model to get response.

To run the exmaple, execute the following command in the current directory:

python transformers_int4/voiceassistant.py -m <path_to_model> [-q <your_question>]

Runtime Arguments Explained:

-m MODEL_PATH: Required, the path to the
-r RECOGNITION_MODEL_PATH: Required, the path to the huggingface speech recognition model
-x MAX_NEW_TOKENS: the max new tokens of model tokens input
-l LANGUAGE: you can specify a language such as "english" or "chinese"
-d True|False: whether the model path specified in -m is saved low bit model.

Legacy (Native INT4 examples)

IPEX-LLM also provides langchain integrations using native INT4 mode. Those examples can be foud in native_int4 folder. For detailed instructions of settting up and running native_int4 examples, refer to Native INT4 Examples README.