Link Search Menu Expand Document

Project Tapestry: Technical Website

Welcome to the technical website for The AI Alliance Project Tapestry, representing the content from the technical documentation and code repository for Project Tapestry.

For a general introduction to Project Tapestry, including its motivations and goals, see the AI Alliance website Tapestry page

Project Tapestry Image

The AI Alliance launched Project Tapestry to build a collaborative foundation for open and sovereign AI. Project Tapestry is an open-source platform designed to enable globally federated development of frontier, open models while preserving sovereignty, local control, and long-term independence.

Why this matters: People will not use models that under perform on their language, legal context, and domain knowledge. And countries, enterprises, and individuals need AI infrastructure they own and control — with guaranteed data residency, the right to exit, and the ability to operate independently. Tapestry addresses both problems simultaneously: sovereignty is the performance strategy, not a trade-off against it.

  • Join Us! We are looking for collaborators. See our contributing page for details.
  • Use the search box at the top of this page to find specific content.
  • The links for Capitalized Terms go to this glossary. Tapestry-specific terms (e.g., Consortium training, Shared-Base Loop, Sovereign Build) are defined in the in-repo glossary.

This website is for technical contributors. As Project Tapestry evolves, this website will provide links to technical requirements, architecture and design documentation, and implementation source code.

Contribute to Our First Work Streams

Project Tapestry has big plans, and we’re starting with some fundamental building blocks.

  • LLM Cultural Alignment and Re-alignment - help us develop techniques for cultural alignment, initially based on the Inglehart–Welzel Cultural Map as a metric. This task will implement a corresponding evaluation and implement tuning experiments to understand how to shift alignment without compromising general model performance. Prior expertise in evaluation and tuning technologies are especially welcome.
  • Consortium Training - Tapestry’s approach to global model development relies on a balance between centralized and distributed training that preserves use and privacy requirements for data sets. Help us adapt and develop optimal techniqes with ideas from both federated learning and the latest LLM pre-training and post-training methods. Prior expertise in large scale LLM training, distributed infrastructure, and federated learning are especially welcome.
  • Global Training Data Corpus A core thesis of project Tapestry is that bringing together a much more diverse set of data can provide a path to a better frontier base model for all. What unique datasets exist that could be brought to Tapestry model training? They don’t have to be fully open; we will work with you to define and enforce appropriate requirements.
  • Tapestry Model Development Roadmap - coming soon - we want your input!

Project Tapestry Work Groups

Tapestry is designed with data sovereignty requirements first and foremost, leading to new approaches for distributed model training to build world-class foundation models, as well as support tuning domain-specific models using sensitive data with carefully governed access.

The following work groups are provisional. Participation is welcome!

Work Group Focus
Base Model Training Own the shared model capability path: selecting or adopting an initial open-weights base, defining how consortium training improves shared weights, and planning the transition toward consortium-owned base models when the project has sufficient compute, data, and operational maturity.
Data Governance Define how sovereign data can participate in Tapestry without surrendering control. This group owns data sourcing, licensing, stewardship, residency constraints, provenance, contribution rights, and data-quality expectations for national, cultural, industrial, and institutional participants.
Deployment and Adoption Ensure Tapestry-derived models become usable systems, not just trained weights. This group owns serving patterns, product harnesses, integration guidance, participant rollout, developer experience, and adoption feedback loops.
Evaluation Certification Define the evidence that Tapestry models, pipelines, and participants must produce before claims of capability, sovereignty, cultural alignment, safety, or certification are accepted.
Governance and Participation Translate Tapestry’s governance principles into operating mechanics for work groups, participants, contributions, decisions, certification processes, and anti-capture safeguards.
Infrastructure and Operations Own the platform and operating model that lets participants run Tapestry workloads across heterogeneous compute, networks, security regimes, and organizational boundaries.
Security and Privacy Define the technical guarantees that make Tapestry sovereignty enforceable: privacy tiers, secure aggregation, differential privacy, trusted execution, threat models, model-update leakage analysis, and safety-preservation constraints.
Sovereign Alignment Own the participant-specific pipeline that turns a shared capable base into models that reflect local knowledge, values, institutions, domains, and interaction norms. This includes culturally grounded continued pretraining, post-training alignment, instruction tuning, and portability of sovereign contributions.

Other Technical Documentation

The rest of the technical documentation is currently maintained in the project repository docs:

Some additional links.


Table of contents
  1. Project Tapestry: Technical Website
    1. Contribute to Our First Work Streams
    2. Project Tapestry Work Groups
    3. Other Technical Documentation
    4. Additional links