Alignment Failure

alignmentethicsmisusegoal

The model’s objectives, values, or behaviors diverge from intended or ethical outcomes, leading to harmful or unintended actions.

Technical Details
Impact Level:Medium
Attack Vectors
  • Goal Misalignment: Model pursues objectives contrary to user or organizational intent. [Medium]
  • Ethical Boundary Violation: Model circumvents ethical or legal constraints. [Medium]
Impact Analysis
Risk Score: 7/10
Mitigation Categories