ipex-llm/python/llm/example
Guoqiong Song aa319de5e8 Add streaming-llm using llama2 on CPU (#9265)
Enable streaming-llm to let model take infinite inputs, tested on desktop and SPR10
2023-10-27 01:30:39 -07:00
..
CPU Add streaming-llm using llama2 on CPU (#9265) 2023-10-27 01:30:39 -07:00
GPU Support deepspeed AutoTP (#9230) 2023-10-24 23:46:28 -07:00