| | 55 | |
| | 56 | In the end I used pellet 2.0.1 for validating the RDF of UniProt,org to see if it is valid according to our OWL file. |
| | 57 | |
| | 58 | ./pellet.sh consistency "http://purl.uniprot.org/uniprot/P12345" "ftp://ftp.uniprot.org//pub/databases/uniprot/current_release/rdf/core.owl" |
| | 59 | |
| | 60 | Which found many errors. Most where datatype errors. e.g. "2009-11" Year-Month is not a valid ISO8601 standard. I changed these to "P2009Y11M" which is valid period encoding. |
| | 61 | The OWL now has a comment describing what is meant here and why we use this. |
| | 62 | |
| | 63 | For uniprot.org we now also have a script that checks a 1000 entries for correctness. |
| | 64 | {{{ |
| | 65 | #!/bin/bash |
| | 66 | cd $PELLETDIR |
| | 67 | LIMIT=1000 |
| | 68 | UNIPROT="http://localhost:8080" |
| | 69 | for ac in `curl "$UNIPROT/uniprot/?query=*&format=list&limit=$LIMIT"`; do |
| | 70 | echo $ac |
| | 71 | `curl "$UNIPROT/uniprot/$ac.rdf" -o $ac.rdf`; |
| | 72 | ./pellet.sh consistency -v $ac.rdf ~/Documents/workspace/uniprot-rdf/src/core.owl |
| | 73 | rm $ac.rdf; |
| | 74 | done}}} |
| | 75 | |