FlyBase is pleased to announce our participation in the Amazon Web Services (AWS) Open Data Sponsorship Program. Through this program, AWS sponsors the storage and data transfer costs for our bulk data repository, enabling us to preserve and provide broader access to nearly two decades of FlyBase archive releases.
What data is available?
The FlyBase S3 bucket (s3ftp.flybase.org) hosts over 4 terabytes of data, including:
- All FlyBase releases from FB2006_01 through the current release
- Genome sequences and annotations for Drosophila melanogaster and other Drosophila species
- Bulk data files (gene summaries, alleles, stocks, ontologies, and more)
- Precomputed reports and historical datasets
This archive is particularly valuable for researchers who need to reproduce analyses from older publications or compare data across release versions.
How to access the data
FlyBase data can be accessed directly via S3, or browsed via HTTPS at
https://s3ftp.flybase.org/. For detailed instructions on downloading bulk data, see our Downloads Overview:
https://wiki.flybase.org/wiki/FlyBase:Downloads_Overview About the AWS Open Data Sponsorship Program
The AWS Open Data Sponsorship Program covers storage and data transfer costs for publicly available, high-value datasets. The program aims to democratize access to data by making it available for analysis in the cloud. FlyBase data is now listed on the Registry of Open Data on AWS alongside other scientific datasets:
https://registry.opendata.aws/ We are grateful to AWS for their generous sponsorship of this data repository, which helps ensure long-term access to FlyBase data for the Drosophila research community.