Java API Performance Compared to Python with Cypher for Neo4J

Question

Java API Performance Compared to Python with Cypher for Neo4J

I am working with an application that uses a Neo4J graph containing about 10 million nodes. One of the main tasks that I perform daily is the batch import of new / updated nodes into a schedule of the order of about 1-2 million. After experimenting with Python scripts in combination with the Cypher query language, I decided to try the built-in graph with the Java API to get the best results.

What I found is a 5x improvement using my own Java API. I am using Neo4j 2.1.4, which I believe is the last. I read in other posts that the built-in chart is a little faster, but that this should / may change in the near future. I would like to confirm my findings to those who have observed similar results?

I have included the snippets below to give a general idea of the methods used - the code has been greatly simplified.

cypher / python sample:

cnode = self.graph_db.create(node(hash = obj.hash,
    name = obj.title,
    date_created = str(datetime.datetime.now()),
    date_updated = str(datetime.datetime.now())
))

Sample from embedded graph using java:

final Node n = Graph.graphDb.createNode();
for (final Label label : labels){
    n.addLabel(label);
}
for (Map.Entry<String, Object> entry : properties.entrySet()) {
    n.setProperty(entry.getKey(), entry.getValue());
}

Thank you for understanding!

+4

java python neo4j cypher

Kristen Odamtten Sep 22 '14 at 14:10

source share

3 answers

, . , Python , Java, , .

: Python 12 Java. Python 1 , Java 3 . , 2 /(60 - 12) = 60 , .

, , , 48 , Python . , 60 12 - , .

0

Aaron Digulla 22 . '14 15:01

" " Java Python 3 (http://benchmarksgame.alioth.debian.org/u32/benchmark.php?test=all&lang=java&lang2=python3&data=u32), 5- Java .

0

Stephen C 22 . '14 15:04

Nigel Small · Accepted Answer · 2014-09-22T16:44:58+0000

What you are actually doing here is comparing the speeds of two different APIs and just using two different languages for this. Therefore, you will not compare, for example. The Java API kernel and the REST API used by Python (and other languages) have different idioms, such as explicit vs implicit transactions. In addition, the network latency associated with the REST API will make a big difference, especially if you use one HTTP call per node.

, , , : , Java REST API Cypher .

1: REST, API.

2: API REST , API, , .

Java API Performance Compared to Python with Cypher for Neo4J

More articles: