* except lm_head * remove * support gw lm_head * update * fix * remove run.bat * fix style * support llama3
* first commit * update example * fix style * update example * embedding as const * fix generate * code refactor * meet code review * fix style * change max_output_len to max_context_len * fix all-in-one * fix example * add check for new tokens