Demonstrating Native Addons #5

CMCDragonkai · 2022-05-22T03:33:51Z

Description

Derived from #2

In helping solve the snapshot isolation problem in MatrixAI/js-db#18, we needed to lift the hood and go into the C++ level of nodejs.

To do this, I need to have a demonstration of how native addons can be done in our demo lib here.

There are 2 ecosystems for building native addons:

prebuild
node-pre-gyp

Of the 2, the prebuild ecosystem is used by UTP and leveldb. So we will continue using that. Advantages from 2016 was commented here: prebuild/prebuild#159

The basic idea is that Node supports a "NAPI" system that enables node applications to call into C++. So it's a the FFI system of NodeJS. It's also a bidirectional FFI as C++ code can call back into the NodeJS JS functions.

The core library is node-gyp. In the prebuild ecosystem is wrapped with node-gyp-build, which you'll notice is the one that we already using in this repo. The main feature here is the ability to supply prebuilt binaries instead of expecting the end-user to always compile from source.

Further details here: https://nodejs.github.io/node-addon-examples/build-tools/prebuild (it also compares it to node-pre-gyp).

The node-gyp-build has to be a dependency, not devDependencies, because it is used during runtime to automatically find the built shared-object/dynamic library and to load it.

It looks like this:

import nodeGypBuild from 'node-gyp-build';
const bindings = nodeGypBuild('./path/to/dir/containing/gyp/file');
bindings.someNativeFunction()

Internally nodeGypBuild ends up calling the require() function inside NodeJS. Which supports the ability to load *.node binaries (which is the shared-object that is compiled using the NAPI C++ headers). See: https://github.com/prebuild/node-gyp-build/blob/2e982977240368f8baed3975a0f3b048999af40e/index.js#L6

The require is supplied by the NodeJS runtime. If you execute the JS with a different runtime, they may support the commonjs standard, and thus understand the require calls, but they may be compatible with native modules that are compiled with NAPI headers. This is relevant since, you also have to load the binary that matches your OS libraries and CPU architecture. It's all dynamic linking under the hood. This is also why you use node-gyp-build which automates some of this lookup procedure.

As a side-note about bundlers. Bundlers are often used part of the build process that targets web-platforms. Since the web platform does not understand require calls, bundlers will perform some sort of transclusion. This is also the case when ES6 import targets files on disk. Details on this process is here: https://github.com/evanw/esbuild/blob/master/docs/architecture.md#notes-about-linking. Bundlers will often call this "linking", and when targetting web-platforms, this is basically a form of static linking since JS running in browsers cannot load JS files from disk. This is also why in some cases, one should replace native addons with WASM instead, as bundlers can support static linking of WASM (which are cross-platform) into a web-bundle. But some native addons depend on OS features (like databases with persistence), and fundamentally cannot be converted into WASM binaries. In the future, our crypto code would make sense to turn into WASM binaries. But DB code is likely to always be native, as they have to be persistent. As the web develops can gains extra features, then eventually it may be possible that all native code can be done via WASM (but this may be a few years off).

Now the native module itself is just done with a C++ file like index.cpp. We should prefer using .cpp and .h as the most portable extensions.

Additionally, there must be binding.gyp file that looks like this:

{
  "targets": [{
    "target_name": "somename",
    "include_dirs": [
      "<!(node -e \"require('napi-macros')\")"
    ],
    "sources": [ "./index.cpp" ]
  }]
}

Basically another configuration file that configures node-gyp and how it should be compiling the C++ code. The target_name specifies the name of the addon file, so the output result will be somename.node. The sources are self-explanatory. The include_dirs entries have the ability to execute shell commands, in this case, it is using node -e to execute a script that will return some string that is a path to C++ headers that will be included during compilation.

The C++ code needs to use the NAPI headers, however there's a macro library that makes writing NAPI addons easier: https://github.com/hyperdivision/napi-macros. I've seen this used in the utp-native and classic-level.

The C++ code may look like this:

#include <node_api.h>
#include <napi-macros.h>

NAPI_METHOD(times_two) {
  NAPI_ARGV(1)
  NAPI_ARGV_INT32(number, 0)

  number *= 2;

  NAPI_RETURN_INT32(number)
}

NAPI_INIT() {
  NAPI_EXPORT_FUNCTION(times_two)
}

This ends up exporting a native module containing the times_two function that multiples a number by 2, and returns an int32 number.

It's also important that node-gyp-build is setup as a install script in the package.json:

  "scripts": {
    "install": "node-gyp-build"
  }

This means when you run npm install (which is used to install all the dependencies for a NPM package, or to install a specific NPM package), it will run the node-gyp-build durin the installation process.

This means that currently in our utils.nix node2nixDev expression still requires the npm install command. This used to exist, however I removed it during MatrixAI/TypeScript-Demo-Lib#37 thinking it had no effect. But it was confirmed by svanderburg/node2nix#293 (comment) that the npm install command is still run in order to execute build scripts. And node-gyp-build is now part of the installation process. We should include: https://github.com/svanderburg/node2nix/blob/8264147f506dd2964f7ae615dea65bd13c73c0d0/nix/node-env.nix#L380-L387 with all the necessary flags and parameters too. We may be able to make it work if we hook our build command prior to npm install. I imagine that this should be possible since the npm rebuild command is executed prior. So we need to investigate this.

In order to make this all work, our Nix environment is going to need all the tools for source compilation. Now according to https://github.com/nodejs/node-gyp#on-unix we will need python3, make and gcc. Our shell.nix naturally has make and gcc because we are using pkgs.mkShell which must extend from stdenv.mkDerivation. However python3 will be needed as well.

The node2nix has some understanding of native dependencies (this is why it also brings in python in its generated derivation svanderburg/node2nix#281), and I believe it doesn't actually build from source (except in some overridden dependencies).

Some npm dependencies are brought in via nixpkgs nodePackages because node2nix derivation isn't enough to build them (because they have complex native dependencies). Such as node-gyp-build itself or vercel's pkg. This is also why I had to provide nodePackages.node-gyp-build in our buildInputs overrides in utils.nix. It is important that any dependencies acquired via nixpkgs must be the same version we use in our package.json. And this is the case for:

    "node-gyp-build": "4.4.0"
    "pkg": "5.6.0",

Ideally we won't need to do this our own native packages if js-db ends up forking classic-level or leveldown. I think this trick is only relevant in our "build tools" and not our runtime dependencies.

The remaining problem is cross-compilation, as this only enables building from source if you are on NixOS and/or using Nix. Windows and MacOS will require their own setup. Since our development environment is all Nix focused, we don't have to worry about those, but for end-users who may want to rebuild from scratch, they will need to setup their development environent based on information in https://github.com/nodejs/node-gyp. A more pressing question is how we in our Nix development environment will be capable of cross-platform native addons for distribution.

This is where the prebuild ecosystem comes in and in particular https://github.com/prebuild/prebuildify-cross. This is used in leveldb to enable them to build for different platforms, and then save these cross-compiled objects. These objects are then hosted on GitHub releases, and automatically downloaded upon installation for downstream users. In the case they are not downloadable, they are then built from source. https://github.com/Level/classic-level/blob/f4cabe9e6532a876f6b6c2412a94e8c10dc5641a/package.json#L21-L26

However in our Nix based environment, I wonder if we can avoid using docker to do cross compilation, and instead use Nix to provide all the tooling to do cross-compilation. We'll see how this plays out eventually.

Some additional convenience commands now:

# install the current package and install all its dependencies, and build them ALL from source
npm install --build-from-source
# install a specific dependency and build it from source
npm install classic-level --build-from-source
# runs npm build on current package and all dependencies, and also recompiles all C++ addons
npm rebuild
# runs npm build on current package and all dependencies, and specifically recompiles sqlite3 package which has C++ addon
npm rebuild --build-from-source=sqlite3

Issues Fixed

Related Migrating/Integrating WebCrypto (possibly WASM) instead of Node Forge Crypto Polykey#270
Related React Native and Mobile OS (iOS and Android) Compatibility Polykey#155
Related Using QUIC/HTTP3 to replace utp-native for the Data Transfer Layer in the networking domain Polykey#234
Related Node2nix uses nodejs.src for --nodedir when it can just use the nodejs svanderburg/node2nix#295
Related Add support for distributing prebuilds as platform-specific packages prebuild/prebuildify#63 - this PR introduces the ability for optionalDependencies to contain platform-specific native addon packages, which the parent package relies on, this can avoid having to put all platform-specific native addons into a single prebuild directory that is installed from npm.
Related Add support for finding prebuilds in platform-specific packages prebuild/node-gyp-build#45 - this adds support for node-gyp-build to find the prebuilt binaries that come from optional dependencies
https://ownyourbits.com/2018/06/13/transparently-running-binaries-from-any-architecture-in-linux-with-qemu-and-binfmt_misc/ - When we want to deal with ARM
https://amir.rachum.com/blog/2016/09/17/shared-libraries/ - How runpath works.
The mkShell should set NIX_NO_SELF_RPATH = true; by default NixOS/nixpkgs#173025
https://developer.apple.com/documentation/apple-silicon/building-a-universal-macos-binary - How to build universal binaries
Executables within app.asar.unpacked are not being signed correctly in OSX 10.14.5 electron-userland/electron-builder#3940 (comment) - Relevant information for electron entitlements and useful packages
https://developer.apple.com/documentation/security/notarizing_macos_software_before_distribution/customizing_the_notarization_workflow - Notarization of mac binaries
When reinstalling a package, chocolatey always deletes download cache chocolatey/choco#2698 - choco cache is not a download cache

Tasks

Future Tasks

Cross compilation for win-arm64, linux-arm64 (linux will require the necessary nix-shell environment)
Android and iOS builds for library
Consider using Nix directly on MacOS instead of homebrew, need to figure out how xcode works with nix, nix is likely to be faster than homebrew - MacOS Runner Nix Integration for Platform-Specific Build #1
Use gitlab shared arm64 runners to do integration tests when it becomes available
If nodejs and vercel/pkg has universal binary become available, then use that instead of producing arm64 and x64 executable bundles
Check if chocolatey fixes their caching problems so that we can finally make use of it: WIP: Demonstrating Native Addons TypeScript-Demo-Lib#38 (comment)
Get the arm64 and x64 binaries for macos signed using either ldid or codesign WIP: Demonstrating Native Addons TypeScript-Demo-Lib#38 (comment)
Further optimise the pkg bundling script so that it doesn't bundle useless .md files, right now it's even bundling the CHANGELOG.md files WIP: Demonstrating Native Addons TypeScript-Demo-Lib#38 (comment)
Consider test load balancing as an alternative to nominal test splitting, it might end up being faster and work with the higher overhead of having windows and mac runners
Fix the windows integration job script that doesn't fail even when the underlying executable fails WIP: Demonstrating Native Addons TypeScript-Demo-Lib#38 (comment)
Attempt using MacOS pkg instead of zip archives so you can do stapling and therefore not require the client systems to have access to the internet before running the executable: WIP: Demonstrating Native Addons TypeScript-Demo-Lib#38 (comment)
Automate MacOS code signing for both x64 and arm64 application at integration:macos job - WIP: Demonstrating Native Addons TypeScript-Demo-Lib#38 (comment)
Figure out how to do platform-specific grouping of tests for npm test (it should automatically understand how to conditionally test these things by loading files appropriately in the right platform, or just a script that knows): https://stackoverflow.com/questions/50171932/run-jest-test-suites-in-groups

Final checklist

…ot cause a full rebuild of the derivatons

ghost · 2022-05-22T03:34:40Z

👆 Click on the image for a new way to code review

Make big changes easier — review code in small groups of related files
Know where to start — see the whole change at a glance
Take a code tour — explore the change with an interactive tour
Make comments and review — all fully sync’ed with github

Try it now!

Legend

CMCDragonkai · 2022-05-22T03:54:25Z

Now it's back to tag pipeline, but we put in the necessary checks to prevent running when the commit title is a version release tag.

CMCDragonkai · 2022-05-22T04:25:06Z

The build:prerelease is now working. Some prereleases were pushed to NPM under typescript-demo-lib. Changed everything now to point to typescript-demo-lib-native except executable name.

CMCDragonkai · 2022-05-22T04:29:13Z

Now that prerelease build is working fine.

Final step is release job which is done after integration.

We will use our original job which produces a GH release and tag. But also combine it with a release job integration:release that does the same thing as build:prerelease. Maybe it should be called release:npm.

In fact our build:prerelease could be called release:npm-prerelease as the staging is just for grouping.

CMCDragonkai · 2022-05-22T04:29:59Z

Also we'd like to auto-merge the staging into master when this all passes. This should be done after all the integration runs are done.

This will require the gh command to merge branches and push back up to our main repo.

CMCDragonkai · 2022-05-22T04:39:46Z

Actually automerge via gh is not possible. It only does it via PRs. That would mean there would be a PR from staging to master.

So another way is just to use git to merge into master in the job and then to do a git push on the master branch. This would be enough to indicate that the master is up to date to staging.

There's also push options that enable auto-creating a MR on gitlab... or using gh to auto create a PR on GH that merges staging into master. But this may end up creating duplicate PRs.

So keep it simple and just merge into master, and push it up. That will trigger the pipeline on master branch. Like a recursive pipeline.

What are the permissions needed to write back to the repo.

CMCDragonkai · 2022-05-22T04:41:40Z

For the push back to origin, we may need to authenticate either via SSH or HTTP. I believe HTTP would be the best. Some sort of token needs to be available for the projects that authenticate themselves to the same project.

CMCDragonkai · 2022-05-22T04:49:30Z

I found https://docs.gitlab.com/ee/ci/jobs/ci_job_token.html but it's a bit more complicated than that. Firstly I'm not sure if it can even push up to the origin.

Alternatively access token exists on each project, but that has to be setup individually for each project.

Finally all of this would only affect the gitlab repository which is a pull mirror of github.

So the actual auth needed is github. Which is why we wanted to use gh.

CMCDragonkai · 2022-05-22T04:55:38Z

I think we might need to use gh to just create the PR if it doesn't already exist, and then proceed to merge that PR if it does exist. If that can be done in command, that would be great.

We now have a nice way of merging staging to master.

Note that further jobs run after merging to master. In this case production deployment, production deployment tests, then final production release.

CMCDragonkai · 2022-05-22T09:21:56Z

Some final issues:

Pre-release/release should only run on tag pipeline, but not the commit associated with the tag to avoid duplicating the work.
Deployment jobs should run on every commit regardless of the tags.
Merging from staging to master should run on every commit regardless of the tags.

CMCDragonkai · 2022-05-22T09:55:06Z

To investigate:

Changing to typescript-demo-lib-native

CMCDragonkai · 2022-05-23T04:51:46Z

To test merging from feature-native to staging. I need to use git merge feature-native --no-ff . By default git will do fast forward merges.

Generally speaking we would want feature branches to fast forward into staging branch, and this is enforced by GitHub.

According to https://stackoverflow.com/questions/60597400/how-to-do-a-fast-forward-merge-on-github, GitHub has no way of doing fast forward merges on PRs.

This means:

PRs from github always results in merge commits
Using git merge ... will not result in merge commit, beware of this, in case you want it to exist (makes it easier to revert merges)
Because we aren't able to merge directly in the CI/CD without getting a "key" or token that has write access to the repo, we have to use gh to create a PR that auto merges
This means merges from staging to master won't be linear, and will end up with a "merge commit" on master that doesn't exist on staging

The original plan was to have linear commits between master and staging, but this is also acceptable for now.

CMCDragonkai added 5 commits May 21, 2022 19:52

Demonstrating Native Addons with C/C++

6f8ea73

Ignore scripts directory for .npmignore

f730951

Ignore /*.nix inside utils.nix src attribute so that Nix changes do n…

1c81a86

…ot cause a full rebuild of the derivatons

Ignore all project nix files in .npmignore

4b2e85a

Updated codesee

5527ce8

CMCDragonkai changed the title ~~Feature native~~ Demonstrating Native Addons May 22, 2022

CMCDragonkai force-pushed the feature-native branch from daf8abb to 9af53a1 Compare May 22, 2022 03:42

CMCDragonkai force-pushed the feature-native branch from 9af53a1 to 5249a05 Compare May 22, 2022 04:19

CMCDragonkai force-pushed the staging branch from de88ca4 to 758a9d2 Compare May 22, 2022 04:19

CMCDragonkai self-assigned this May 22, 2022

Integrating build:prerelease job for publishing staging NPM packages

516ff81

Changing to typescript-demo-lib-native

CMCDragonkai force-pushed the feature-native branch from 7fa2585 to 516ff81 Compare May 23, 2022 04:30

Integration automatic PR merge from staging to master

ed184b2

CMCDragonkai force-pushed the feature-native branch from 4b8393a to ed184b2 Compare May 23, 2022 07:07

CMCDragonkai merged commit 491e6e3 into staging May 23, 2022

CMCDragonkai deleted the feature-native branch May 23, 2022 14:44

CMCDragonkai mentioned this pull request Nov 10, 2022

Using QUIC/HTTP3 to replace utp-native for the Data Transfer Layer in the networking domain MatrixAI/Polykey#234

Closed

19 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Demonstrating Native Addons #5

Demonstrating Native Addons #5

Uh oh!

CMCDragonkai commented May 22, 2022 •

edited

Loading

Uh oh!

ghost commented May 22, 2022 •

edited by ghost

Loading

Uh oh!

CMCDragonkai commented May 22, 2022

Uh oh!

CMCDragonkai commented May 22, 2022

Uh oh!

CMCDragonkai commented May 22, 2022

Uh oh!

CMCDragonkai commented May 22, 2022

Uh oh!

CMCDragonkai commented May 22, 2022 •

edited

Loading

Uh oh!

CMCDragonkai commented May 22, 2022

Uh oh!

CMCDragonkai commented May 22, 2022

Uh oh!

CMCDragonkai commented May 22, 2022 •

edited

Loading

Uh oh!

CMCDragonkai commented May 22, 2022

Uh oh!

CMCDragonkai commented May 22, 2022 •

edited

Loading

Uh oh!

CMCDragonkai commented May 23, 2022

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

1 participant

Demonstrating Native Addons #5

Demonstrating Native Addons #5

Uh oh!

Conversation

CMCDragonkai commented May 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Issues Fixed

Tasks

Future Tasks

Final checklist

Uh oh!

ghost commented May 22, 2022 • edited by ghost Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Legend

Uh oh!

CMCDragonkai commented May 22, 2022

Uh oh!

CMCDragonkai commented May 22, 2022

Uh oh!

CMCDragonkai commented May 22, 2022

Uh oh!

CMCDragonkai commented May 22, 2022

Uh oh!

CMCDragonkai commented May 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CMCDragonkai commented May 22, 2022

Uh oh!

CMCDragonkai commented May 22, 2022

Uh oh!

CMCDragonkai commented May 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CMCDragonkai commented May 22, 2022

Uh oh!

CMCDragonkai commented May 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CMCDragonkai commented May 23, 2022

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

1 participant

CMCDragonkai commented May 22, 2022 •

edited

Loading

ghost commented May 22, 2022 •

edited by ghost

Loading

CMCDragonkai commented May 22, 2022 •

edited

Loading

CMCDragonkai commented May 22, 2022 •

edited

Loading

CMCDragonkai commented May 22, 2022 •

edited

Loading