feat: cuda feature detection. #275

AsakusaRinne · 2023-11-11T11:49:25Z

This PR is supposed to be merged after #258

Note that it adds two APIs for users, which are NativeLibraryConfig.WithLibrary and NativeLibraryConfig.WithMatchRule. To be honest I think the naming is not good but I can't come up with a better one. Any idea?

TODO:

AsakusaRinne · 2023-11-11T11:54:13Z

Is there a good way for us to print some logs in NativeApis now? I think directly outputting to console is not a good idea.

martindevans · 2023-11-11T14:26:18Z

I think you can do something like Logger.GetLogger() now?

LLama/NativeLibraryConfig.cs

martindevans · 2023-11-11T16:24:04Z

LLama/NativeLibraryConfig.cs

+            config.Desc = new Description(libraryPath);
+        }
+
+        /// <summary>


I think it'd be better to split this into various WithX methods rather than having them all in one method (more extensible in the future).

e.g.

WithCuda(bool cuda) WithAvx(AvxLevel avx) WithAutoFallback(bool allow) WithSkipCheck(bool skip)

This format seems good, especially for the ability of chained call. There's one problem that it couldn't notice user the default configuration well. If we just set default value to the parameter, it may looks strange when calling WithAutoFallback().WithSkipCheck(false). 😹

Anyway, I prefer this format, too. However is there any solution about the case above?

martindevans · 2023-11-11T16:25:32Z

LLama/NativeLibraryConfig.cs

+        /// <param name="useCuda"></param>
+        /// <param name="avxLevel"></param>
+        /// <param name="allowFallback">Whether to allow fall-back when your hardware doesn't support your configuration.</param>
+        /// <param name="skipCheck">Whether to skip the check when fallback is allowed. 


I'm not really sure what skipCheck does from this description. it's "skipping the check", but what check?

It's to check if the hardware support the user's choice. Let's ignore the skipCheck parameter first, then there're two basic conditions, allowing fallback and the opposite.
If fallback is allowed, we could choose another library if the user's configuration does not match the hardware. For example, the user specified cuda but there's actually no supported cuda version at that machine.

If fallback is disabled, however, we're supposed to load exactly the library specified. However the real condition may be complex and our logic may not cover all the cases (for example, linux cuda detection). Therefore if sometimes loading a library won't actually have problem but we take it as invalid by mistake, the user could force-load it without any check.

I think by this way we could leave a solution for some potential problems in the future.

martindevans · 2023-11-11T16:26:50Z

Just a few review nits, but overall looks good.

My only criticism is it's less clear than before which binaries are preferred over others. Previously you could just read down the list of TryLoad calls and it was very obvious. Not a fatal flaw, just an observation!

AsakusaRinne · 2023-11-11T18:27:28Z

The linux check has passed but the binaries seemed not to work. I can only load the library when I replaced the binaries with self-compiled ones. Not sure if it's related with #270

AsakusaRinne · 2023-11-11T21:26:40Z

@SignalRT Could you please help to test it on MAC? For nuget package test please download this file.

…_detection

SignalRT · 2023-11-12T08:11:30Z

@AsakusaRinne I get this exception trying to load LLamaSharp:

System.BadImageFormatException: Could not load file or assembly 'LLamaSharp, Version=0.8.0.0, Culture=neutral, PublicKeyToken=null'. An attempt was ma...

System.BadImageFormatException
Could not load file or assembly 'LLamaSharp, Version=0.8.0.0, Culture=neutral, PublicKeyToken=null'. An attempt was made to load a program with an incorrect format.

at LLama.Unittest.BasicTest..ctor()
at System.RuntimeType.CreateInstanceDefaultCtor(Boolean publicOnly, Boolean wrapExceptions)

AsakusaRinne · 2023-11-12T08:21:12Z

@SignalRT It seems to be a problem of LLamaSharp package instead of backend packages. Is your system 32-bit or 64-bit?

These packages are the ones generated by github ci workflow, maybe another try with it?

SignalRT · 2023-11-12T08:50:42Z

@AsakusaRinne, Arm64 (M2 computer). I can run the test in the project, execute the examples, but with your packages or with the packages generated manually in my computer with prepare_release.sh it fails with the same error.

I'm trying to understand the problem....that seems a little weird.

SignalRT · 2023-11-12T09:51:35Z

@AsakusaRinne After a lot of problems, it seems that it's related to some problem with nuget cache. I clear the cache and it works with the packages generated in the CI pipeline

AsakusaRinne · 2023-11-12T11:38:12Z

@AsakusaRinne After a lot of problems, it seems that it's related to some problem with nuget cache. I clear the cache and it works with the packages generated in the CI pipeline

Wow, that's nice! I think we could release it tonight.

@martindevans The CUDA libraries have been replaced the ones with avx 2, is that right?

martindevans · 2023-11-12T15:19:25Z

The CUDA libraries have been replaced the ones with avx 2, is that right?

Yep, the CUDA binaries should now compile with AVX2 by default.

feat: support cuda feature detection.

d03e1db

AsakusaRinne added enhancement New feature or request distribution labels Nov 11, 2023

AsakusaRinne requested review from martindevans and SignalRT November 11, 2023 11:49

martindevans reviewed Nov 11, 2023

View reviewed changes

LLama/NativeLibraryConfig.cs Outdated Show resolved Hide resolved

martindevans reviewed Nov 11, 2023

View reviewed changes

fix: cannot load library under some conditions.

bbbfbd2

AsakusaRinne force-pushed the cuda_detection branch from e11df78 to bbbfbd2 Compare November 11, 2023 17:55

feat: optimize apis for cuda feature detection.

cb5fb21

AsakusaRinne mentioned this pull request Nov 11, 2023

Runtime detection MacOS #258

Merged

5 tasks

build: change nuget configuration for cuda detection.

4d2c5f1

Merge branch 'master' of github.com:AsakusaRinne/LLamaSharp into cuda…

d7675f7

…_detection

AsakusaRinne marked this pull request as ready for review November 12, 2023 04:14

AsakusaRinne added 3 commits November 12, 2023 12:26

fix typo.

502bb73

ci: update ci workflows.

460f507

docs: adjust some descriptions.

da6718c

AsakusaRinne added the minor-release label Nov 12, 2023

AsakusaRinne merged commit 17ec890 into SciSharp:master Nov 12, 2023
5 checks passed

AsakusaRinne mentioned this pull request Nov 12, 2023

Linux cuda crash #270

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: cuda feature detection. #275

feat: cuda feature detection. #275

AsakusaRinne commented Nov 11, 2023 •

edited

Loading

AsakusaRinne commented Nov 11, 2023

martindevans commented Nov 11, 2023

martindevans Nov 11, 2023

AsakusaRinne Nov 11, 2023

martindevans Nov 11, 2023

AsakusaRinne Nov 11, 2023

martindevans commented Nov 11, 2023

AsakusaRinne commented Nov 11, 2023

AsakusaRinne commented Nov 11, 2023

SignalRT commented Nov 12, 2023

AsakusaRinne commented Nov 12, 2023 •

edited

Loading

SignalRT commented Nov 12, 2023

SignalRT commented Nov 12, 2023

AsakusaRinne commented Nov 12, 2023

martindevans commented Nov 12, 2023

feat: cuda feature detection. #275

feat: cuda feature detection. #275

Conversation

AsakusaRinne commented Nov 11, 2023 • edited Loading

AsakusaRinne commented Nov 11, 2023

martindevans commented Nov 11, 2023

martindevans Nov 11, 2023

Choose a reason for hiding this comment

AsakusaRinne Nov 11, 2023

Choose a reason for hiding this comment

martindevans Nov 11, 2023

Choose a reason for hiding this comment

AsakusaRinne Nov 11, 2023

Choose a reason for hiding this comment

martindevans commented Nov 11, 2023

AsakusaRinne commented Nov 11, 2023

AsakusaRinne commented Nov 11, 2023

SignalRT commented Nov 12, 2023

AsakusaRinne commented Nov 12, 2023 • edited Loading

SignalRT commented Nov 12, 2023

SignalRT commented Nov 12, 2023

AsakusaRinne commented Nov 12, 2023

martindevans commented Nov 12, 2023

AsakusaRinne commented Nov 11, 2023 •

edited

Loading

AsakusaRinne commented Nov 12, 2023 •

edited

Loading