ares.goals package

Submodules

Base class for Attack Goal module

class ares.goals.attack_goal.AttackGoal(config: dict[str, Any])[source]

Bases: ABC

Base class to implement attack goals extraction from the desired benchmark specified in the input

abstractmethod run(limit: bool | None = None, first_n: int | None = None) → list[dict[str, Any]][source]: Method to run extraction of attack goals from the desired source, e.g. HarmBench

static template() → dict[str, Any][source]

This method returns a template for the configuration of Attack Goal module

Returns:: dictionary with default values for the configuration of Attack Goal module

Implementation of generic, Panda-based Attack Goal

class ares.goals.generic_attack_goal.GenericAttackGoal(config: dict[str, Any])[source]

Implementation of Attack Goal for loading data from a generic file

run(limit: bool | None = None, first_n: int | None = -1) → list[dict[str, str | int]][source]

Runs seeds extractions from goals and returns a list of dictionaries containing the seeds.

Parameters:

limit – If True, limits the number of seeds to the first N (e.g. LIMIT). If False, returns all results. If None, behaves the same as False. Default is None.
first_n – The number of seeds to return if limit is True. Default is -1 (no limit).

Returns:

A list of dictionaries, where each dictionary represents a seed with label.

Example:

>>> goal = GenericAttackGoal(config=config)
>>> goal.run()
>>> gold.run(limit=True, first_n=10)

static template() → dict[str, Any][source]

This method returns a template for the configuration of GenericAttackGoal module

Returns:: dictionary with default values for the configuration of GenericAttackGoal module

ARES Core attack goals