Stefan Baack.jpg

Stefan Baack

Investigaciones más recientes

Towards Best Practices for Open Datasets for LLM Training

13 de enero de 2025

Openness and AI / AI fairness, accountability, and transparency

Building on community insights from 30 AI dataset experts, this research paper distills best practices for creating open datasets for LLM training. The paper is a collaboration between Mozilla and EleutherAI.
Training Data for the Price of a Sandwich: Common Crawl’s Impact on Generative AI

6 de febrero de 2024

AI bias & discrimination / AI fairness, accountability, and transparency

Mozilla finds that Common Crawl's outsized role in the generative AI boom has improved transparency and competition, but is also contributing to biased and opaque generative AI models.
Internet Health Report 2022

18 de julio de 2022

Internet health / Internet Health Report / AI fairness, accountability, and transparency

An annual compilation of research and stories explaining what’s key to a healthier internet. In this edition we are narrowing our focus to artificial intelligence.

Examinar todos los proyectos (7)