Skip to content
Nate Jensen edited this page Jul 12, 2018 · 21 revisions

Since Jep 3.3

Java primitive arrays <-> numpy.ndarrays

Jep supports automatic conversion of Java primitive arrays to numpy.ndarrays. For ease and speed of copying the arrays between the two languages, the Java arrays must be one dimensional. This is to ensure a contiguous block of a known size of memory for the conversion. Conceptually the Java array can be n-dimensional with the dimensions argument, but in memory it must be one dimensional.

Sending a Java primitive array into Python as a numpy.ndarray is easy using the class jep.NDArray. For example:

try(Jep jep = new Jep(new JepConfig().addSharedModules("numpy")) {
   float[] f = new float[] { 1.0f, 2.1f, 3.3f, 4.5f, 5.6f, 6.7f };
   NDArray<float[]> nd = new NDArray<>(f, 3, 2);
   jep.set("x", nd);
}

Inside the interpreter, the variable named x will be a numpy.ndarray with shape (3,2) and dtype float32. If a Java method signature contains an NDArray, it will also be transformed when crossing between the languages. For example:

/**
 * If Python invokes this method, it will receive back a numpy.ndarray.
 */
public NDArray<?> getRawData() {
   // presuming this.rawdata is conceptually only 1-dimensional
   return new NDArray(this.rawdata, this.rawdata.length);
}
/**
 * If Python invokes this method, it should pass in a numpy.ndarray.
 */
public void setData(NDArray<?> dataFromPython) {
   // ignoring dimensions, presuming this.rawdata is conceptually only 1-dimensional
   this.rawdata = dataFromPython.getData();
}

At times Jep will convert a numpy.ndarray into a Java primitive array when the class NDArray does not fit a method signature. For example,

public void setFloatData(float[] dataFromPython) {
   // note that the dimensions are lost
   this.rawdata = dataFromPython;
}

In this scenario the information about the ndarray dimensions will not be sent along.

Direct memory support

When using an NDArray the memory is not shared between numpy and Java, therefore changes in the array in one language are not reflected in the array in the other language. Since Jep 3.7 it is possible to use a DirectNDArray which can be created from a Java NIO buffer object. The Java buffer and the Python ndarray will both use the same memory meaning any changes in one language are immediately visible in the other. For example,

try (Jep jep = new Jep()){
    FloatBuffer data = ByteBuffer.allocateDirect(6*4).asFloatBuffer();
    DirectNDArray<FloatBuffer> nd = new DirectNDArray<>(data, 6);
    jep.set("x", nd);
    jep.eval("x[1] = 700");
    // val will 700 since we set it in python
    float val = data.get(1);
    data.put(4, val + 100);
    // prints 800 since we set in java
    jep.eval("print(x[4])");
}

Numpy gotchas

Numpy currently does not support running in an embedded interpreter, though it mostly works. Since Jep 3.6, you can use the shared modules capability of Jep to work around the known issues. Since Jep 3.8, you can use the shared interpreter capability to work around the known issues.

Known issues:

  • Closing a Jep instance/sub-interpreter breaks some numpy methods. This is the infamous 'NoneType' object is not callable error. You can work around this by using the shared modules capability, using the shared interpreter capability, or by never disposing a Jep sub-interpreter that has imported numpy.
  • Printing arrays sometimes fails. Like the above issue, you can work around it by using shared modules, shared interpreters, or never disposing a Jep sub-interpreter that has imported numpy.
  • Floating point errors can deadlock. You should not be able to get this error if you are using Jep version 3.8 and above. Jep will throw a JepException about unsafe threading in this scenario.
  • Tiny memory leak. The leak, if it still exists in current versions of numpy, is tiny. It was first noticed on much older versions of numpy. We are still trying to prove it and track it down.
Clone this wiki locally