|
| 1 | +--- |
| 2 | +type: regular |
| 3 | +title: Evaluation of generative AI |
| 4 | +subtitle: > |
| 5 | + Overview of Algorithm Audit's expertise pertaining to generative AI. |
| 6 | +image: /images/technical tools/eval-gen-ai/Kader.png |
| 7 | +quick_navigation: |
| 8 | + title: Content overview |
| 9 | + links: |
| 10 | + - title: Introduction |
| 11 | + url: '#intro' |
| 12 | + - title: Validation framework |
| 13 | + url: '#validation-framework' |
| 14 | + - title: Development and usage |
| 15 | + url: '#development-and-usage' |
| 16 | +promo_bar: |
| 17 | + - content: "**\U0001F44B Do you want to learn more about Algorithm Audit's expertise on generative AI? Get in [contact](/en/about/contact).**" |
| 18 | +--- |
| 19 | + |
| 20 | +{{< promo_bar index="0" >}} |
| 21 | + |
| 22 | +<!-- Introductie --> |
| 23 | + |
| 24 | +{{< container_open icon="far fa-file" title="Introduction" id="intro" >}} |
| 25 | + |
| 26 | +Algorithm Audit has versatile expertise in evaluating generative AI. We have contributed to developing a validation framework for responsible use of large language models (LLMs) for public information provision. Also, we conduct project work for the AI Office of the European Commission to evaluate socio-technical risks of general purpose AI (GPAI) models. |
| 27 | + |
| 28 | +{{< container_close >}} |
| 29 | + |
| 30 | + |
| 31 | +<!-- Validation framework --> |
| 32 | + |
| 33 | +{{< container_open icon="far fa-check" title="Validation framework" id="validation-framework" >}} |
| 34 | + |
| 35 | +Soon available in English. |
| 36 | + |
| 37 | +<!-- {{< embed_pdf url="/pdf-files/technical-tools/UBDT/20260215 Auditing a Dutch Public Sector Risk Profiling Algorithm.pdf" width_mobile_pdf="12" width_desktop_pdf="12" >}} --> |
| 38 | + |
| 39 | +{{< container_close >}} |
| 40 | + |
| 41 | + |
| 42 | +<!-- Development and usage --> |
| 43 | + |
| 44 | +{{< container_open icon="fas fa-terminal" title="Development and usage of validation framework:" id="development-and-usage" >}} |
| 45 | + |
| 46 | +The validation framework 'Responsible use of Large Language Models (LLMs) for public information provision' has been developed in collaboration with the Dutch judiciary, Technical University Eindhoven (TU/e), T&T Data Consultancy and Deloitte. The validation framework is for a large extent inspired on the LLM-pilot [voorrecht-Rechtspraak](https://www.voorrecht-rechtspraak.nl). The project is supported by the program <a href="https://www.sidnfonds.nl/projecten/validatiekader-betrouwbare-llms-voor-publieke-informatievoorziening" target="_blank">‘Responsible AI in de praktijk'</a> of the SIDN Fund and TopSectorICT. |
| 47 | + |
| 48 | +The Validation framework is published under the CC BY-4.0 license. |
| 49 | + |
| 50 | +{{< image image1="/images/partner%20logo-cropped/Rechtspraak.png" image2="/images/partner%20logo-cropped/TUEindhoven.svg.png" image3="/images/partner%20logo-cropped/T&TDataConsultancy.png" image4="/images/partner%20logo-cropped/Deloitte.png" width_desktop="3" width_mobile="1" alt1="De Rechtspraak" alt2="Technische Universiteit Eindhoven" alt3="T&T Data Consultancy" alt4="Deloitte" caption1="De Rechtspraak" caption2="Technische Universiteit Eindhoven" caption3="T&T Data Consultancy" caption4="Deloitte" >}} |
| 51 | + |
| 52 | +{{< container_close >}} |
0 commit comments