Link Search Menu Expand Document
AI Alliance Banner
Browse the Datasets   Contribute a new Dataset!

Contribute Your Dataset!

NOTE: Be sure to read all the Dataset Requirements before proceeding. Make sure you agree to all the requirements described there or contact us if you have questions.

Contribution means adding your dataset to our catalog. You can optionally donate the dataset to the Alliance, where we take ownership of a copy at the time of donation and we host it ourselves. Otherwise, you continue to own and host the dataset.

The Contribution Process

The process follows these steps:

  1. Prepare your contribution: Make sure you meet the Dataset Requirements.
  2. Complete the contribution form: Use the form below to submit your dataset for consideration.
  3. Receive feedback from us: After we evaluate the submission, we will provide feedback and request clarifications, where needed.
  4. Upload the data: (Optional) Once your contribution is accepted, you can transfer the data to be hosted in The AI Alliance Hugging Face space or you can continue to host it yourself, for example, in your own Hugging Face space.
  5. Review your submission details: After publication in our catalog, verify that the imformation about your dataset is correct.

License

The Open Trusted Data Initiative is focused on obtaining datasets from submitters who either own or have a broad license from all owners of data included in the dataset. By contributing a dataset to the Initative, you affirm that with respect to the dataset and all of its data, you are either (1) the owner or (2) you have been granted a license by all owner(s) of the data enabling you to license it to others under the Community Data License Agreement - Permissive, Version 2.0, which gives anyone the right to use, modify, copy, and create derivative works of the data and dataset, among other things. Do not contribute any data that was obtained merely by collecting publicly-visible data from the Internet or from other sources that you do not own or to which you do not have a CDLA or compatible license.

By contributing the dataset to the Initiative, you grant anyone a license to the dataset and its data under the Developer Certificate of Origin, Version 1.1 (see also our community contributors page) and the Community Data License Agreement - Permissive, Version 2.0. This does not affect your ownership, copyrights and other interests, and rights to and title to the dataset and its data.

Now to Contribute Your Dataset

Use this form to tell us about your dataset. We will follow up with next steps. Note that some of the fields are also in the dataset card you are asked to submit.

TIP: See the dataset requirements if you have questions about any of the following fields.

Contributions will be accepted soon!
Contact us at data@thealliance.ai for more information.
Leave blank if the location README is the dataset card.
I want the AI Alliance to host this dataset.
  I agree to the terms for contribution.