Data Curation | iReceptor

Data curation to Data loading workflow

Data curation is a fundamental part of making scientific data Findable, Accessible, Interoperable, and Reusable (FAIR). This section of the iReceptor web site describes the Data Curation process used by the iReceptor Team when it curates data into the iReceptor Public Archive (IPA). The iReceptor Team has identified a set of valuable Adaptive Immune Response Repertoire (AIRR-seq) studies and has curated that data as a public resource to the AIRR Community. Because these studies are relatively diverse, the curation process around how these studies were processed is critical to the data being reusable. The iReceptor Team's curation process adheres to the MiAIRR and other standards set out by the AIRR Community. This process includes the capturing of information about who processed the data and how the data was processed before loading into an iReceptor repository.

Note that although this curation process applies to data that is found in publicly available papers, much of the curation process can be applied to your own research. If you are curating data into your own repository (either an iReceptor Turnkey repository or your own local repository), this process (starting at Step 4) could be used to ensure that your data can be shared with and compared to other AIRR-seq data..

The data curation pipeline used to curate data into the iReceptor Public Archive, which is accessible through the iReceptor Gateway, is given below: