kd.evals.StandaloneLastCheckpoint

kd.evals.StandaloneLastCheckpoint#

class kauldron.evals.StandaloneLastCheckpoint(**kwargs)[source]

Bases: kauldron.evals.run_strategies.Standalone

Run eval only after the last checkpoint, after train has completed.

Run as a separate XM job. All kxm.Job parameters are optionally supported.

Example:

kd.evals.Evaluator(
    run=kd.evals.StandaloneLastCheckpoint(platform='a100=1'),
)

If job_group=’group_name’, all the evaluators sharing the same job_group will share the same XManager job (to save resources).