Link Search Menu Expand Document
AI Alliance Banner
Join Our Initiative   Browse the Datasets   Contribute a New Dataset

Contribute Your Dataset!

NOTE: Be sure to read the Dataset Specification details before proceeding. If you have questions or concerns about the specification, please contact us.

Contribution means adding your dataset to our catalog. You can optionally donate the dataset to the Alliance, where we take ownership of a copy at the time of donation and we host it ourselves. Otherwise, you continue to own and host the dataset.

The Contribution Process

The process follows these steps:

  1. Prepare your contribution: Make sure you meet the Dataset Specification and prepare the dataset card.
  2. Complete the contribution form: Use the form below to submit your dataset for consideration.
  3. Receive feedback from us: After we evaluate the submission, we will provide feedback and request clarifications, where needed.
  4. Transfer the data: (Optional) Once your contribution is accepted, you can transfer the data to be hosted in The AI Alliance Hugging Face space or you can continue to host it yourself, for example, in your own Hugging Face space.
  5. Review your submission details: After publication in our catalog, verify that the imformation about your dataset is correct. Your dataset will be listed on this website’s Catalog page and also listed in the Open Trusted Data Initiative catalog in the AI Alliance Hugging Face space.

License

The Open Trusted Data Initiative is focused on obtaining datasets from submitters who either own or have a unrestricted, free-to-use license from all owners of data included in the dataset. By contributing a dataset to the Initative, you affirm that with respect to the dataset and all of its data, you are either (1) the owner or (2) you have been granted a license by all owner(s) of the data enabling you to license it to others under an acceptable open license, which gives anyone the right to use, modify, copy, and create derivative works of the data and dataset, among other things. Do not contribute any data that was obtained merely by collecting publicly-visible data from the Internet or from other sources that you do not own or to which you do not have a suitable license.

We prefer the Community Data License Agreement - Permissive, Version 2.0 although The Creative Commons License, Version 4.0 - CC BY 4.0 is also sometimes used.

By contributing the dataset to the Initiative, you grant anyone a license to the dataset and its data under the Developer Certificate of Origin, Version 1.1 (see also our community contributors page). This does not affect your ownership, copyrights and other interests, and rights to and title to the dataset and its data.

Contribute Your Dataset

Use this form to tell us about your dataset. We will follow up with next steps. Note that some of the fields are also in the dataset card you are asked to submit.

Contributions will be accepted soon!
Contact us at data@thealliance.ai for more information.
Leave blank if the location README is the dataset card.
I want the AI Alliance to host this dataset.
  I agree to the terms for contribution.