AI Ethics and Human Values
Ethics is not a checkbox AI systems pass or fail. It is an ongoing process of identifying whose interests are at stake, what values are in tension, and how to make decisions under genuine uncertainty about consequences. The field of AI ethics has grown rapidly in response to real harms from deployed systems — but it has also attracted criticism for providing ethical cover without requiring real accountability. This module examines what AI ethics is, what it struggles with, and what it actually takes to build AI systems that respect human values.
The core ethical principles and their tensions
Several frameworks have emerged as anchors for AI ethics discussions. The most widely cited include fairness, accountability, transparency, and safety — often abbreviated as FATE. The EU's approach adds human agency and oversight, technical robustness, privacy, and non-discrimination. Virtually every major AI ethics statement emphasizes some combination of these principles.
The challenge is that these principles are genuinely in tension with each other and with commercial and operational realities. Transparency may compromise safety (adversaries can exploit known vulnerabilities). Fairness across different groups may require different treatment, which itself raises equity questions. Explainability and accuracy are often in inverse relationship — the most accurate models are often the least interpretable. Ethics involves navigating these tensions rather than resolving them once and for all.
Mathematical definitions of fairness are not compatible with each other. Equal accuracy across groups (demographic parity) conflicts with equal error rates across groups (equalized odds), which conflicts with predictive accuracy for individuals. A system can satisfy one definition of fairness while violating another. Choosing among them is a values decision that cannot be made purely technically — it requires deciding whose interests to prioritize and what kind of errors matter most.
The value alignment problem
At its deepest level, AI ethics is concerned with the value alignment problem: ensuring that AI systems pursue goals that are genuinely aligned with human values rather than proxy measures that diverge from what we actually care about. A social media algorithm optimizing for "engagement" aligns with a narrow proxy metric and diverges from the actual human interests users have — in accurate information, meaningful connection, and wellbeing.
The alignment problem becomes more acute as AI systems become more capable. A narrow AI that maximizes click-through rates can cause social harm. A highly capable AI system pursuing subtly misspecified objectives could cause catastrophic harm. The field of AI safety is concerned with these longer-term alignment challenges, while near-term AI ethics focuses on the harms from systems deployed today.
From principles to practice
Specific ethical challenges that matter now
- Predictive systems and self-fulfilling prophecies: When risk scores predict criminal recidivism, housing instability, or child welfare risk, deploying those predictions can create the conditions they predict — denying resources to people flagged as high-risk reinforces the patterns that produced the score
- Consent and data: The data underlying AI systems was collected in contexts where users did not meaningfully consent to its use in training AI — or could not have anticipated that use. Retroactive consent frameworks are inadequate
- Autonomy and manipulation: Personalized AI systems that learn to exploit psychological vulnerabilities to achieve engagement undermine the autonomy that ethical frameworks are designed to protect
- Responsibility gaps: When AI systems cause harm, attribution is diffuse — developers, deployers, users, regulators all bear some responsibility, and this diffusion can enable everyone to point elsewhere
Many AI ethics problems are framed as technical problems with technical solutions — better fairness metrics, more explainable models, more diverse training data. These improvements matter, but they don't resolve the underlying question of whether a system should be built and deployed at all. Some applications of AI — mass surveillance, autonomous lethal weapons, psychological manipulation for engagement — raise ethical objections that better technical implementation cannot address.
Genuine AI ethics is a practice, not a checklist. It involves asking hard questions before deployment: What could go wrong? Who is most vulnerable? What are the reversibility options if harm emerges? It involves building structures that make those questions matter — review processes with real authority to delay or stop deployment, external audits with genuine access, affected community input with actual influence. And it involves ongoing monitoring after deployment, because systems change and their contexts change in ways that produce new harms over time.