Version 15 (modified by erick.antezana, 14 years ago)

--

 GFF3 is a tab-delimited, ontology-aware format of genomic features. It should be beneficial to represent such genomic features in RDF so that we can exchange data and make queries with SW technologies.

Participants

  • Erick Antezana
  • Alberto Labarga
  • Yu Lin
  • Hideya KAWAJI
  • Venkata Satagopam
  • Jerven Bolleman
  • ...

Scope

Mapping GFF2RDF (proposal)

The following table is in a very inmature state...

GFF Element RDF (XML) Description
Column 1: "seqid" <gff:seqid rdf:about="#ctg123"> ?
Column 2: "source" <gff:source>1000</gff:source> ?
Column 3: "type" <gff:type rdf:about="#SO:0000704"> ?
Column 4: "start" <gff:start>1000</gff:start> ?
Column 5: "end" <gff:stop>9000</gff:stop> ?
Column 6: "score" <gff:score>5.8e-42</gff:score> ?
Column 7: "strand" <gff:strand>+</gff:strand> ?
Column 8: "phase" <gff:phase>.</gff:phase> ?
Column 9: "attributes" <gff:attributes><rdf:Description>...</rdf:Description></gff:attributes> ?

Tools

Discussion

  • application to general genomic features (BED, etc)?
  • Genomic coordinate system (0-based / 1-based)
  • Dasty?

Milestones