Once the sample/repertoire metadata has been gathered and the sequences have been annotated, it is time to "publish" the data. In the case of iReceptor, this means loading the data into the iReceptor Public Archive (IPA). Both the sample/repertoire metadata is loaded as are all of the sequence annotations. We use the iReceptor Data Loading and Data Curation software and standards to load data into the IPA.
By loading the data into an AIRR compliant repository such as the IPA, the data can be searched and downloaded. In the case of the IPA and other AIRR Compliant repositories, the iReceptor Scientific Gateway can be used to search and federate data.
The iReceptor team manages the iReceptore Data Provenance web page for all of the data in the IPA and its constiuent repositories (the IPA is a "cluster" of repositories). For each repository in the IPA, the provenance page tracks the loading of studies and documents any changes to the data. Such a provenance process is important from a science reproducibility perspective. We want to provide enough information about each data set in the IPA such that a researcher can understand how the data was produced (this curation documentation) as well as any changes that may have occured to the data over time (data provenance documentation).