Skip to content

easydel

Training Configs

reinforcement_learning.trainer.training_configs

`RewardConfig` `dataclass`

Source code in src/python/easydel/reinforcement_learning/trainer/training_configs.py

@dataclass
class RewardConfig:
    max_length: Optional[int] = None
    """
    The maximum length of the sequences in the batch. This argument is 
    required if you want to use the default data collator.
    """
    gradient_checkpointing: Optional[bool] = True
    """If True, use gradient checkpointing to save memory at the expense of slower backward pass."""
    gradient_checkpointing_kwargs: Optional[dict] = None
    """Keyword arguments to pass to the gradient checkpointing function."""

`gradient_checkpointing: Optional[bool] = True` `class-attribute` `instance-attribute`

If True, use gradient checkpointing to save memory at the expense of slower backward pass.

`gradient_checkpointing_kwargs: Optional[dict] = None` `class-attribute` `instance-attribute`

Keyword arguments to pass to the gradient checkpointing function.

`max_length: Optional[int] = None` `class-attribute` `instance-attribute`

The maximum length of the sequences in the batch. This argument is required if you want to use the default data collator.