| 55 | |
| 56 | In the end I used pellet 2.0.1 for validating the RDF of UniProt,org to see if it is valid according to our OWL file. |
| 57 | |
| 58 | ./pellet.sh consistency "http://purl.uniprot.org/uniprot/P12345" "ftp://ftp.uniprot.org//pub/databases/uniprot/current_release/rdf/core.owl" |
| 59 | |
| 60 | Which found many errors. Most where datatype errors. e.g. "2009-11" Year-Month is not a valid ISO8601 standard. I changed these to "P2009Y11M" which is valid period encoding. |
| 61 | The OWL now has a comment describing what is meant here and why we use this. |
| 62 | |
| 63 | For uniprot.org we now also have a script that checks a 1000 entries for correctness. |
| 64 | {{{ |
| 65 | #!/bin/bash |
| 66 | cd $PELLETDIR |
| 67 | LIMIT=1000 |
| 68 | UNIPROT="http://localhost:8080" |
| 69 | for ac in `curl "$UNIPROT/uniprot/?query=*&format=list&limit=$LIMIT"`; do |
| 70 | echo $ac |
| 71 | `curl "$UNIPROT/uniprot/$ac.rdf" -o $ac.rdf`; |
| 72 | ./pellet.sh consistency -v $ac.rdf ~/Documents/workspace/uniprot-rdf/src/core.owl |
| 73 | rm $ac.rdf; |
| 74 | done}}} |
| 75 | |