News & Notes > Dryad partnership brings easier data sharing to bioRxiv
Dryad partnership brings easier data sharing to bioRxiv
bioRxiv | 2025-09-18
Open data is essential for reproducible science, but it's important it is curated, discoverable and deposited according to FAIR principles to maximize its utility. While bioRxiv has long enabled rapid dissemination of manuscripts and allowed authors to provide links to data elsewhere, pairing preprints with robust, curated datasets ensures that results can be validated, reanalyzed, and extended by the community. The Dryad data repository specializes in hosting data with rich metadata, DOI assignment, and long-term preservation, aligning with the goals of preprint servers like bioRxiv.
By linking Dryad's data publishing platform directly to the bioRxiv submission platform, bioRxiv enables authors to deposit their datasets and associate them with their preprints in a few steps. This helps data publishing become a more integral part of the preprint lifecycle, and readers gain timely access to the data underlying each study.
For preprint authors: a step-by-step guide
If you are submitting a new preprint, you can follow these steps.
- Submit data to Dryad through bioRxiv submissions
As usual, authors should upload their manuscripts, provide titles, abstracts, and select subject categories.
Authors who have already deposited data at Dryad can simply add the data URL to the appropriate field in the bioRxiv submission system.
Authors who have not yet deposited data should complete their bioRxiv submission and then choose "Submit bioRxiv Preprint to External Author Service" from the Author Area, find the relevant manuscript, and select "Dryad" from the dropdown to activate the integrated workflow. Standard Dryad submission costs will apply.
- Automatic metadata transfer
bioRxiv passes core manuscript metadata (i.e., title, author names, affiliations) directly into Dryad's data submission form. This saves time and reduces errors, so authors won't need to retype information already entered during preprint upload.
- Complete the Dryad deposit
Within the same browser window, authors will land on Dryad's upload interface. Here they can:
- Review and enrich metadata (descriptions, keywords, funding sources)
- Upload datasets (up to 2TB, code, and supplementary files)
- Specify any usage licenses
- Complete Dryad's README to allow anyone to easily interpret and reuse the dataset
- Pay the relevant Dryad Publishing Charge if required
- Finalize and publish
Once files are uploaded and metadata added, the author confirms their Dryad deposit. Dryad assigns a unique DOI to the dataset, which then appears as a link on the associated bioRxiv preprint page under “Data/Code”. A green checkmark indicates that the dataset has undergone Dryad's curation process.
- Unified access
Clicking on the Dryad DOI link with verification badge takes the preprint reader to a landing page for the dataset at the Dryad website. This includes downloadable files, structured metadata, and citation instructions. The Dryad dataset page also includes a link back to the bioRxiv preprint in the 'Related Works' section.
Best practices for data sharing
- Prepare the data early: Organize files and draft metadata descriptions before starting a submission; this ensures a swift upload and publication.
- Use standard formats: Wherever possible, deposit data in non-proprietary formats (e.g., CSV, TXT, TIFF) to maximize reuse.
- Write clear metadata: Detailed descriptions, keywords, and methodological notes help other researchers find and understand your data.
- Link methods and code: If the analysis depends on scripts or software, include these alongside raw data to enable exact reproduction.
For more information, visit https://datadryad.org/best_practices.
The Dryad-bioRxiv integration represents another step toward fully open, reproducible preprints. By blending Dryad's data curation with bioRxiv's rapid manuscript sharing, authors can share preprints and data through one process. At the same time, readers benefit from rapid, transparent access to the data underpinning the research. As open science continues to evolve, such integrations will help to advance collaboration and innovation across the life sciences.
