* Support directly loading GPTQ models from huggingface * fix style * fix tests * change example structure * address comments * fix style * address comments