- Notifications
You must be signed in to change notification settings - Fork4
The AgBioData GFF3 working group has developed recommendations to solve common problems in the GFF3 format. We suggest improvements for each of the GFF3 fields, as well as the special cases of modeling functional annotations, and standard protein-coding genes. We welcome discussion of these recommendations from the larger community.
License
NAL-i5K/AgBioData_GFF3_recommendation
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Members
- AgBioData: Ethalinda Cannon, Andrew Farmer, Zhiliang Hu, Rex Nelson, Monica Poelchau, Surya Saha
- Alliance of Genome Resources: Scott Cain, Nathan Dunn, Sierra Moxon, Rob Buels
- NCBI: Vamsi Kodali, Terence Murphy
The GFF3 format is a common, flexible tab-delimited format representing the structure and function of genes or other mapped features. However, with increasing re-use of annotation data, this flexibility has become an obstacle for standardized downstream processing. Common software packages that export annotations in GFF3 format model the same data and metadata in different notations, which puts the burden on end-users to interpret the data model.
The AgBioData consortium is a group of genomics, genetics and breeding databases and partners working towards shared practices and standards. Providing concrete guidelines for generating GFF3, and creating a standard representation of the most common biological data types would provide a major increase in efficiency for AgBioData databases and the genomics research community that use the GFF3 format in their daily operations.
The AgBioData GFF3 working group has developedrecommendationsto solve common problems in the GFF3 format. We suggest improvements for each of the GFF3 fields, as well as the special cases of modeling functional annotations, and standard protein-coding genes. We welcome discussion of theserecommendations from the larger community.
References
- Sequence Ontology gff3 specifications
- AgBiodata consortium
- GFF3 recommendation google doc (comment access)
- NCBI Genbank genomes gff3 specifications
- NCBI Genbank gff3 documentation
GFF3 working group goals
Ultimate goal - to use a GFF3 file from any software or any database in downstream processing tools or applications (e.g. VEP, Tripal, Apollo, ...) WITHOUT having to modify it
1.Databases and software export their GFF3 files in (a) standard way(s)
2.Databases and software know how to import standard information from a GFF3
Summary ofRecommendations
For each column and reserved attribute, we provide the following results from our discussions:
- Change level: The level of change relative to the SO specification. Values are 'No change', 'Recommendation only','minor', 'moderate', 'major'
- Summary: A summary of the GFF3 working group's findings.
- Proposed changes to specification: A list of the proposed changes to the SO specification.
- Rationale: The rationale behind these changes.
- Best Practices: Recommended best practices for this field.
- Validation: How software would validate whether the field is used correctly.
- Example: An example implementation of the field.
Types of changes that we recommend:
We primarily have recommendations on how to
- Interpret the specification
- Model standard data types
Feedback: Please fileissues and tag the relevant contact person.
Acknowledgements
We thank Margaret Woodhouse for the inspiration for this working group, and many other contributors from the larger research community who have provided feedback on earlier versions
About
The AgBioData GFF3 working group has developed recommendations to solve common problems in the GFF3 format. We suggest improvements for each of the GFF3 fields, as well as the special cases of modeling functional annotations, and standard protein-coding genes. We welcome discussion of these recommendations from the larger community.
Topics
Resources
License
Uh oh!
There was an error while loading.Please reload this page.
Stars
Watchers
Forks
Contributors5
Uh oh!
There was an error while loading.Please reload this page.