* update ppl tests * use load_dataset api * add exception handling * add language argument * address comments