- Notifications
You must be signed in to change notification settings - Fork248
Cloud-native genomic dataframes and batch computing
License
hail-is/hail
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Hail is an open-source, general-purpose, Python-based data analysis tool with additional data types and methods for working with genomic data.
Hail is built to scale and has first-class support for multi-dimensional structured data, like the genomic data in a genome-wide association study (GWAS).
Hail is exposed as a Python library, using primitives for distributed queries and linear algebra implemented in Scala,Spark, and increasingly C++.
See thedocumentation for more info on using Hail.
Hail has been widely adopted in academia and industry, including as the analysis platform for thegenome aggregation database andUK Biobank rapid GWAS. Learn more aboutHail-powered science.
If you'd like to discuss or contribute to the development of methods or infrastructure, please:
- see theFor Software Developers section of the installation guide for info on compiling Hail
- chat with us about development in ourZulip chatroom
- visit theDevelopment Forum for longer-form discussions
Hail uses a continuous deployment approach to software development, which means we frequently add new features. We update users about changes to Hail via theDiscussion Forum. We recommend creating an account on the Discussion Forum so that you can subscribe to these updates as well.
Hail is maintained by a team in theNeale lab at theStanley Center for Psychiatric Research of theBroad Institute of MIT and Harvard and theAnalytic and Translational Genetics Unit ofMassachusetts General Hospital.
Contact the Hail team athail@broadinstitute.org
.
If you use Hail for published work, please cite the software. You can get acitation for the version of Hail you installed by executing:
importhailashlprint(hl.citation())
Which will look like:
Hail Team. Hail 0.2.13-81ab564db2b4. https://github.com/hail-is/hail/releases/tag/0.2.13.
The Hail team has several sources of funding at the Broad Institute:
- The Stanley Center for Psychiatric Research, which together with Neale Lab has provided an incredibly supportive and stimulating home.
- Principal Investigators Benjamin Neale and Daniel MacArthur, whose scientific leadership has been essential for solving the right problems.
- Jeremy Wertheimer, whose strategic advice and generous philanthropy have been essential for growing the impact of Hail.
We are grateful for generous support from:
- The National Institute of Diabetes and Digestive and Kidney Diseases
- The National Institute of Mental Health
- The National Human Genome Research Institute
- The Chan Zuckerberg Initiative
We would like to thankZulip for supportingopen-source by providing free hosting, and YourKit, LLC for generously providingfree licenses forYourKit JavaProfiler for open-source development.
About
Cloud-native genomic dataframes and batch computing