parent
404e101ded
commit
a7bc89b3a1
2 changed files with 2 additions and 26 deletions
|
|
@ -1,5 +1,5 @@
|
||||||
# Loading GGUF models
|
# Loading GGUF models
|
||||||
In this directory, you will find examples on how to load GGUF model into `bigdl-llm`. For illustration purposes, we utilize the [llama-2-7b-chat.Q4_0.gguf](https://huggingface.co/TheBloke/Llama-2-7B-Chat-GGUF/tree/main) and [llama-2-7b-chat.Q4_1.gguf](https://huggingface.co/TheBloke/Llama-2-7B-Chat-GGUF/tree/main) as reference LLaMA2 GGUF models.
|
In this directory, you will find examples on how to load GGUF model into `bigdl-llm`. For illustration purposes, we utilize the [llama-2-7b-chat.Q4_0.gguf](https://huggingface.co/TheBloke/Llama-2-7B-Chat-GGUF/tree/main) as a reference LLaMA2 GGUF model.
|
||||||
>Note: Only LLaMA2 family models are currently supported
|
>Note: Only LLaMA2 family models are currently supported
|
||||||
|
|
||||||
## Requirements
|
## Requirements
|
||||||
|
|
@ -66,15 +66,3 @@ What is AI?
|
||||||
|
|
||||||
AI is a term used to describe a type of computer software that is designed to perform tasks that typically require human intelligence, such as visual perception, speech
|
AI is a term used to describe a type of computer software that is designed to perform tasks that typically require human intelligence, such as visual perception, speech
|
||||||
```
|
```
|
||||||
|
|
||||||
#### [llama-2-7b-chat.Q4_1.gguf](https://huggingface.co/TheBloke/Llama-2-7B-Chat-GGUF/tree/main)
|
|
||||||
```log
|
|
||||||
Inference time: xxxx s
|
|
||||||
-------------------- Output --------------------
|
|
||||||
### HUMAN:
|
|
||||||
What is AI?
|
|
||||||
|
|
||||||
### RESPONSE:
|
|
||||||
|
|
||||||
Artificial intelligence (AI) is the field of study focused on creating machines that can perform tasks that typically require human intelligence, such as understanding language,
|
|
||||||
```
|
|
||||||
|
|
|
||||||
|
|
@ -1,5 +1,5 @@
|
||||||
# Loading GGUF models
|
# Loading GGUF models
|
||||||
In this directory, you will find examples on how to load GGUF model into `bigdl-llm`. For illustration purposes, we utilize the [llama-2-7b-chat.Q4_0.gguf](https://huggingface.co/TheBloke/Llama-2-7B-Chat-GGUF/tree/main) and [llama-2-7b-chat.Q4_1.gguf](https://huggingface.co/TheBloke/Llama-2-7B-Chat-GGUF/tree/main) as reference LLaMA2 GGUF models.
|
In this directory, you will find examples on how to load GGUF model into `bigdl-llm`. For illustration purposes, we utilize the [llama-2-7b-chat.Q4_0.gguf](https://huggingface.co/TheBloke/Llama-2-7B-Chat-GGUF/tree/main) as a reference LLaMA2 GGUF model.
|
||||||
>Note: Only LLaMA2 family models are currently supported
|
>Note: Only LLaMA2 family models are currently supported
|
||||||
|
|
||||||
## Requirements
|
## Requirements
|
||||||
|
|
@ -63,15 +63,3 @@ What is AI?
|
||||||
|
|
||||||
AI is a term used to describe a type of computer software that is designed to perform tasks that typically require human intelligence, such as visual perception, speech
|
AI is a term used to describe a type of computer software that is designed to perform tasks that typically require human intelligence, such as visual perception, speech
|
||||||
```
|
```
|
||||||
|
|
||||||
#### [llama-2-7b-chat.Q4_1.gguf](https://huggingface.co/TheBloke/Llama-2-7B-Chat-GGUF/tree/main)
|
|
||||||
```log
|
|
||||||
Inference time: xxxx s
|
|
||||||
-------------------- Output --------------------
|
|
||||||
### HUMAN:
|
|
||||||
What is AI?
|
|
||||||
|
|
||||||
### RESPONSE:
|
|
||||||
|
|
||||||
Artificial intelligence (AI) is the field of study focused on creating machines that can perform tasks that typically require human intelligence, such as understanding language,
|
|
||||||
```
|
|
||||||
|
|
|
||||||
Loading…
Reference in a new issue