Version 16 (modified by linyu, 15 years ago) |
---|
GFF3 is a tab-delimited, ontology-aware format of genomic features. It should be beneficial to represent such genomic features in RDF so that we can exchange data and make queries with SW technologies.
Participants
- Erick Antezana
- Alberto Labarga
- Yu Lin
- Hideya KAWAJI
- Venkata Satagopam
- Jerven Bolleman
- ...
Scope
Mapping GFF2RDF (proposal)
The following table is in a very inmature state...
GFF Element | RDF (XML) | Description |
Column 1: "seqid" | <gff:seqid rdf:about="#ctg123"> | ? |
Column 2: "source" | <gff:source>1000</gff:source> | ? |
Column 3: "type" | <gff:type rdf:about="#SO:0000704"> | ? |
Column 4: "start" | <gff:start>1000</gff:start> | ? |
Column 5: "end" | <gff:stop>9000</gff:stop> | ? |
Column 6: "score" | <gff:score>5.8e-42</gff:score> | ? |
Column 7: "strand" | <gff:strand>+</gff:strand> | ? |
Column 8: "phase" | <gff:phase>.</gff:phase> | ? |
Column 9: "attributes" | <gff:attributes><rdf:Description>...</rdf:Description></gff:attributes> | ? |
Tools
- GFF to OWL (Source code not available yet?)
- Chirs Mungall's code
Discussion
- application to general genomic features (BED, etc)?
- Genomic coordinate system (0-based / 1-based)
- Dasty?
Milestones