- Notifications
You must be signed in to change notification settings - Fork0
Extracting human gene families from HGNC
License
NotificationsYou must be signed in to change notification settings
dhimmel/hgnc
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
This repository processes thegene family data from HGNC. In the future, the repository may expand its scope to process other types of HGNC data.
1.download.ipynb
downloads HGNC data. Check this notebook to see the last modified dates of downloaded files.2.families.ipynb
constructs the gene family ontology innetworkx
. Annotates gene families with their corresponding Entrez Gene IDs. Gene membership in a family is propagated, e.g. genes belonging to the "Glutamate metabotropic receptors" family also belong to the "Glutamate receptors" family.
download
contains unmodified downloads from the EBI FTP site.data
contains generated datasets.families.graphml
contains a GraphML-formatted network of the HGNC gene family ontology.gene-families.tsv
contains the mapping between gene families and Entrez genes.
Have a question? Submit all feedback or questions viaGitHub issues!