eval.lm_eval
evaluate(model, task_list=None, write_out=True, limit=0, shots=5)
The evaluate function takes a model and evaluates it on the tasks specified in task_list. The results are printed to stdout, and optionally written out to a file.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
model |
Specify the model to be evaluated |
required | |
task_list |
Optional[List[str]]
|
Optional[List[str]]: Specify which tasks to evaluate on |
None
|
write_out |
bool
|
bool: Write the output to a file |
True
|
limit |
int
|
int: Limit the number of examples that are evaluated |
0
|
shots |
int
|
int: Specify how many times to run the model on a given task |
5
|
Returns:
Type | Description |
---|---|
A dictionary with the following keys |
Source code in src/python/easydel/eval/lm_eval.py
8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 |
|