ipex-llm/python/llm
Yina Chen dc27b3bc35
Use sdp when rest token seq_len > 1 in llama & mistral (for lookup & spec) (#10790)
* update sdp condition

* update

* fix

* update & test llama

* mistral

* fix style

* update

* fix style

* remove pvc constrain

* update ds on arc

* fix style
2024-04-24 17:24:01 +08:00
..
dev LLM: add min new token to perf test. (#10869) 2024-04-24 14:32:02 +08:00
example LLM: make pipeline parallel inference example more common (#10786) 2024-04-24 09:28:52 +08:00
portable-zip Fix baichuan-13b issue on portable zip under transformers 4.36 (#10746) 2024-04-12 16:27:01 -07:00
scripts Update Env check Script (#10709) 2024-04-10 15:06:00 +08:00
src/ipex_llm Use sdp when rest token seq_len > 1 in llama & mistral (for lookup & spec) (#10790) 2024-04-24 17:24:01 +08:00
test Add llama3 and phi2 nightly test (#10874) 2024-04-24 16:58:56 +08:00
.gitignore [LLM] add chatglm pybinding binary file release (#8677) 2023-08-04 11:45:27 +08:00
setup.py Support llama-index install option for upstreaming purposes (#10866) 2024-04-23 19:08:29 +08:00
version.txt Update setup.py and add new actions and add compatible mode (#25) 2024-03-22 15:44:59 +08:00