Library

Filter
156 results
  • RH-thumbnail-04

    Using LLMs to Bridge the Gaps in QA Test Plans at Firefox

    Jan. 16, 2026
    Suhaib Mujahid, Marco Castelluccio, John Pangas, Ahmad Abdellatif
    Generative AI / Open source

    The study explores using LLMs to automatically generate test plans for Firefox features, aiming to reduce the manual effort and blind spots in traditional QA planning.

  • RH-thumbnail-04

    Automated Generation of Issue-Reproducing Tests by Combining LLMs and Search-Based Testing

    Jan. 16, 2026
    Alberto Bacchelli, Marco Castelluccio, Konstantinos Kitsios
    Generative AI / Open source

    The paper introduces BLAST, a tool that combines large language models with search-based software testing to automatically generate issue-reproducing tests from issue-patch pairs, addressing the common absence of such tests in practice.

  • RH-thumbnail-04

    A Comparison of Conversational Models and Humans in Answering Technical Questions: the Firefox Case

    Jan. 16, 2026
    Marco Castelluccio, Daniel Coutinho, Anita Sarma, Marco Gerosa, Caio Barbosa, Alessandro Garcia, Igor Steinmacher, Joao Correia
    Generative AI / Open source

    The study evaluates Retrieval-Augmented Generation (RAG) with LLMs for assisting Mozilla Firefox developers by comparing human responses, standard LLM, and RAG-enhanced LLM on real developer queries.

  • RH-thumbnail-04

    A Dataset of Performance Measurements and Alerts from Mozilla (Data Artifact)

    Jan. 16, 2026
    Diego Elias Costa, Suhaib Mujahid, Marco Castelluccio, Gregory Mierzwinski, Mohamed Bilel Besbes
    Open source

    The paper introduces a publicly available dataset from Mozilla Firefox that addresses the lack of real-world data for studying performance regressions, combining performance measurements, expert-validated alerts, and rich metadata.

  • RH-thumbnail-04

    Impact of LLM-based review comment generation in practice: A mixed open-/closed-source user study

    Jan. 16, 2026
    Suhaib Mujahid, Marco Castelluccio, Doriane Olewicki, Benjamin Mah, Leuson Da Silva, Arezou Amini, Sarra Habchi, Bram Adams, Foutse Khomh
    Generative AI / Open source

    The study evaluates RevMate, an LLM-based code review assistant, through a large-scale live user study at Mozilla and Ubisoft, analyzing over 587 patch reviews.

  • Dataset Convening logo

    Towards Best Practices for Open Datasets for LLM Training

    Jan. 13, 2025
    Stefan Baack, Stella Biderman, Aviya Skowron, Kasia Odrozek
    Openness and AI / AI fairness, accountability, and transparency

    Building on community insights from 30 AI dataset experts, this research paper distills best practices for creating open datasets for LLM training. The paper is a collaboration between Mozilla and EleutherAI.

  • From screen to skin

    From Skin to Screen Bodily Intgrity in the Digital Age

    Nov. 20, 2024
    Júlia Keserű

    Research examines the collection of “body-centric data” which has experienced a dramatic surge since the COVID-19 pandemic and the rise of sophisticated AI tools.

  • Page from Incomplete Chronicles. AIMZ report.revised_0(1)

    Incomplete Chronicles: Unveiling Data Bias in Maternal Health

    Nov. 18, 2024
    Min’enhle Ncube

    An ethnographic research into the datasets powering maternal healthcare app “DawaMom” used across Zambia and other Southern African countries

  • Public AI cover photo

    Public AI

    Sept. 30, 2024
    Mark Surman, Nik Marda, Jasmine Sun
    Responsible technology

    A vision for a robust ecosystem of initiatives that promote public goods, public orientation, and public use throughout every step of AI development and deployment.

  • RH-thumbnail-03

    Unveiling the Potential of a Conversational Agent in Developer Support: Insights from Mozilla’s PDF.js Project

    July 10, 2024
    Marco Castelluccio
    Open source

    This paper presents an investigation into whether AI can be leveraged to assist developers and guide open source community members. It introduces DevMentorAI, an LLM-based tool that uses a RAG approach to answer developer questions, which is then evaluated with a case study on Mozilla's PDF.js proje

Filter