This project builds the JNI layer for Java to call PyTorch C++ APIs.
You can find more information in the src
.
You need to install cmake
and C++ compiler on your machine in order to build
apt-get install -y locales cmake curl unzip software-properties-common
Use the following task to build PyTorch JNI library:
./gradlew compileJNI
gradlew compileJNI
This task will send a Jni library copy to pytorch-engine
model to test locally.
Note: PyTorch C++ library requires CUDA path set in the system.
Use the following task to build pytorch JNI library for GPU:
# compile CUDA 11.X version of JNI
./gradlew compileJNI -Pcu11
# compile CUDA 11.X version of JNI
gradlew compileJNI -Pcu11
It uses clang-format to format the code.
./gradlew formatCpp
- Spin up a EC2 instance for linux, linux-gpu, windows, windows-gpu and cd pytorch/pytorch-native.
- Run ./gradlew compileJNI and resolve all the issues you are facing.
- Raise a PR for the JNI code change and don’t merge it until we have the rest things ready.
- check the dependencies of each JNI library by
otool -L libdjl_torch.dylib
for osx,ldd libdjl_torch.so
for linux anddumpbin /dependents libdjl_torch.dll
for windows. - Compare all dependency libraries to those in downloadPyTorchNativeLib, if we miss copying new dependencies, correct the script.
- modify the version to desired release version in pytorch/pytorch-native/build.gradle and make sure the URL in the task downloadPyTorchNativeLib point to the right, available URL. Usually the URL that is not for the latest version will have %2Bcpu/cuXXX in the end.
- Make corresponding change on build.sh and build.cmd
- Raise PR for script change and get them merge
- Spin up a EC2 instance and
cd pytorch/pytorch-native && ./gradlew dPTNL
cd build/native/lib
and gzip all dependencies (gzip -r download
)- Create a new pytorch-X.X.X in ai.djl/publish bucket with djl-prod account.
aws s3 sync build/native/lib s3://djl-ai/publish/pytorch-X.X.X
- Merge the JNI code change.
- Now every script should point to new PyTorch version except integration and example are still using old pytorch-native version
- Trigger Native JNI S3 PyTorch and resolve issues if any
- Trigger Native Snapshot PyTorch
- Raise a PR to bump up all PyTorch version to new version and add -SNAPSHOT
- Test integration test , example and pytorch-engine unit test with snapshot pytorch-native
- Trigger Native Release PyTorch
- Test integration test, example and pytorch-engine unit test with staging pytorch-native
- Publish to sonatype
- Raise a PR to remove all the -SNAPSHOT
This is adjusted from the comments in PR#2349.
To implement a simple pytorch feature, generally you can do the following steps.
-
Find the c-api in torch library for the feature to add. This can be done by searching in the document like this or searching in the torch cpp source code.
-
Implement the JNI and api's in Java. The JNI can then be compiled with gradle commands. Here is the commands you can use on cpu machine, to compile JNI and run it with java api.
cd engines/pytorch/pytorch-native ./gradlew cleanJNI compileJNI ./gradlew :engines:pytorch:pytorch-jni:clean ./gradlew :engines:pytorch:pytorch-engine:test
Note: In case your need test with GPU, the compilation needs to be the following:
./gradlew cleanJNI ./gradlew compileJNI -Pcu11
-
Document the api and add the unit tests.
-
Format the code and pass the PR tests
Run the following tasks
./gradlew fJ fC checkstyleMain checkstyleTest pmdMain pmdTest ./gradlew test