00.Q+.530 Trustworthy AI: Fairness, Interpretability and Privacy

Veranstaltungsdetails

Lehrende/r: Mattia Cerrato

Veranstaltungsart: Vorlesung/Übung

Anzeige im Stundenplan: 00.Q+.530

Semesterwochenstunden: 2

Credits: 6,0

Unterrichtssprache: Englisch

Min. | Max. Teilnehmerzahl: - | -

Voraussetzungen / Organisatorisches:
Teilnahmevoraussetzungen
Keine

Anforderungen
Essay

Inhalt:
Machine Learning models, especially based on neural networks, are now part of our everyday life, being deployed in smartphones and embedded systems. Computer Vision algorithms recognize our pets in the photos we take; Natural Language Processing models are able to fix our grammar and even generate human-passing text.

While the new wave of so-called “deep learning” systems displays impressive performance in various tasks, these models are very hard to understand in various ways. To cite one issue among others, the GPT family of models employs 175 billion learnable parameters. When something goes wrong, it is almost impossible to understand “why” a model has made a particular decision.

The technical optimism and excitement around Machine Learning has also pushed businesses to apply it in situations where it may impact people’s well-being directly, such as loan applications, candidate selection for job offers and evaluating the chance of re-offending for people who commited crimes. Computer vision applications based on neural networks have even been employed to judge beauty contests.When we do not fully understand AI algorithms, consequences of their application are hard to predict.In such contexts, opaque models are particularly problematic as there is a concrete risk for discrimination against certain groups of people. Protected characteristics such as gender and ethnicity might be used in the computation of the final decision, which is problematic on moral grounds and unlawful. The existence of the gender pay gap, for example, shows how there are complex correlations between law-protected attributes (gender) and other, non-sensitive attributes such as a person’s yearly salary.

If models are learning from biased data, it follows that they will learn to output biased decisions; if we are unable to explain those decisions, we are left with very little human control over what ultimately is a software process. On top of being philosophically troubling and unethical, recent legislation might see these methodologies as unlawful. Taking the General Data Protection Regulation in the European Union as an example, transparency and fairness have to be guaranteed to individuals who are subject to automatic decision-making software systems.

During this class, you will learn about the current discussion in the AI and ML literature about how to control AI and ML algorithms so that they are trustworthy. We will be focusing on three characteristics especially:

Fairness: how can we learn non-discriminating models from biased data?
Interpretability: how can we make sense of a model’s decisions?
Privacy: how do we make sure that a model trained with our personal data does not leak our data to third parties?

Empfohlene Literatur:
For students without any exposition to AI/machine learning/data mining, the following introductory texts are recommended:

Russell, Norvig. AI: A Modern Approach. 3rd Edition. Chapters 1 and 18.1, 18.2, 18.3.

Alexandre Gefen, Léa Saint-Raymond, Tommaso Venturini. AI for Digital Humanities and Computational Social Sciences. Available on HAL.

Stanford Encyclopedia of Philosophy, AI https://plato.stanford.edu/entries/artificial-intelligence/ and Ethics of AI https://plato.stanford.edu/entries/ethics-ai/

Termine
Datum Von Bis Raum Lehrende/r
1 Di, 25. Okt. 2022 10:15 11:45 04 426 Mattia Cerrato
2 Di, 8. Nov. 2022 10:15 11:45 04 426 Mattia Cerrato
3 Di, 15. Nov. 2022 10:15 11:45 04 426 Mattia Cerrato
4 Di, 22. Nov. 2022 10:15 11:45 04 426 Mattia Cerrato
5 Di, 29. Nov. 2022 10:15 11:45 04 426 Mattia Cerrato
6 Di, 6. Dez. 2022 10:15 11:45 04 426 Mattia Cerrato
7 Di, 13. Dez. 2022 10:15 11:45 04 426 Mattia Cerrato
8 Di, 20. Dez. 2022 10:15 11:45 04 426 Mattia Cerrato
9 Di, 10. Jan. 2023 10:15 11:45 04 426 Mattia Cerrato
10 Di, 17. Jan. 2023 10:15 11:45 04 426 Mattia Cerrato
11 Di, 24. Jan. 2023 10:15 11:45 04 426 Mattia Cerrato
12 Di, 31. Jan. 2023 10:15 11:45 04 426 Mattia Cerrato
13 Di, 7. Feb. 2023 10:15 11:45 04 426 Mattia Cerrato
Veranstaltungseigene Prüfungen
Beschreibung Datum Lehrende/r Pflicht
1. Essay k.Terminbuchung Ja
Übersicht der Kurstermine
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
Lehrende/r
Mattia Cerrato