* add windows nightly test workflow * use github runner to run pr test * model load should use lowbit * remove tmp dir after testing