Working on Openness and Trust in AI
Wikimedia Enterprise invites you to a special social event at NeurIPS 2025 in San Diego this December. This in-person event will explore the intersections between generative AI data and open, trusted datasets.
Representatives from the Wikimedia Foundation (Enterprise team), MLCommons, and the AI Alliance will share insights into their work using technology to advance academic and social missions. Attendees will hear presentations from each organization, highlighting their goals, current projects, and ongoing challenges around trust and responsible data usage in AI.
We’ll update this post with more detailed information as it becomes available.
Get in touch if you’d like to connect with us during the event.
Expected Agenda
- Introductions: Presentations from Wikimedia Enterprise, AI Alliance’s Open Trusted Data Initiative, and MLCommons, covering each organization’s mission and active projects.
- Open Wikimedia Datasets and Licensing: An overview of how Wikipedia datasets are made available with clear open licenses.
- Open Trusted Data Initiative (AI Alliance): A discussion on how generative AI data flows from source to model and the licensing implications of this new paradigm.
- Building an AI Data Ecosystem with Croissant: A look at how the Croissant data standard supports tool development and improves AI deployment success.
- Q&A and Networking: Engage with presenters, ask questions, and enjoy light refreshments.
Event date/time: TBD
Event Location: San Diego Convention Center
About NeurIPS
NeurIPS conference was founded in 1987 and is now a multi-track interdisciplinary annual meeting that includes invited talks, demonstrations, symposia, and oral and poster presentations of refereed papers. Along with the conference is a professional exposition focusing on machine learning in practice, a series of tutorials, and topical workshops that provide a less formal setting for the exchange of ideas.

Previous years at NeurIPS
2024 Highlights
We attended NeurIPS 2024 in Vancouver, BC, alongside the Common Crawl Foundation, exploring the intersections between nonprofit organizations and the tech community within the evolving AI and machine learning ecosystem, featuring two of the most widely used datasets for training large language models.
— Wikimedia Enterprise Team
Photo Credits
San Diego Convention Center, CC BY-SA 3.0, via Wikimedia Commons