Jena analyzes problem for RDF freebase dump (January 2014)

I am trying to parse a dump file freebase-rdf-2014-01-12-00-00.gz (25 GB) using Jena. Regarding Yena's bad data, there were many reports. Example. 150.0 invalid, true and false values ​​are invalid. I solved these problems by adding double quotes around the decimal number and true / false in the dump file. However, problems still remainreported by Jena.(current - org.apache.jena.riot.RiotException: [line: 161083, col: 110] Illegal object: [MINUS])

Is there a way to pre-process this data so that I don't have to fix each problem one at a time. My Java code is:

    // Open TDB dataset
    String directory = "D:/test_dump";
    Dataset dataset = TDBFactory.createDataset(directory);

    // Assume we want the default model, or we could get a named model here
    Model tdb = dataset.getDefaultModel();

    // Read the input file - only needs to be done once
    String source = "D:/test_dump/fixed-freebase-second-rdf.gz";
    FileManager.get().readModel( tdb, source, "N-TRIPLES" ); 
+4
source share
2 answers

Turtle, N-Triples. (, true "true"^^xsd:boolean -27 "-27"^^xsd:integer).

- , , . $ \

RDF.

+4

: answers.semanticweb.com, Freebase RDF w3 n-triples? , , N-Triples. , true "true"^^xsd:boolean.

N-:

<http://rdf.freebase.com/ns/g.11vjz1ynm>  <http://rdf.freebase.com/ns/measurement_unit.dated_percentage.rate> 4.5 .

, 3 (N3) Turtle, N-Triples. , freebase Shawn Simister 29 2013 ( ):

Freebase RDF, , , RDF.... N-Triples Turtle. N-Triples - , .

A (31 2013 .) :

, , , . "true" ^^ xsd: boolean "false" ^^ xsd: boolean, N-Triples - .

. , , , " " true "," , true "true". , RDF, "true" "true"^^xsd:boolean, true. , : " Turtle N3".

+2

All Articles