* add llama/bloom/gptneox wrapper * add readme * upload binary main file
* first commit of CMakeFiles.txt to include llama & gptneox * initial support of quantize * update cmake for only consider linux now * support quantize interface * update based on comment