OPEN: FP integration #3

Dequino · 2024-09-12T13:17:57Z

Reintegrated immediate float in AbstractDataTypes
Reintegrated float immediate in DeeployTypes
Reintegrated bfloat16, float32, FloatDataTypes to DataTypes
Reintegrated testImmediatePromotionFloat
Reintegrated FloatAdder, tested FP32 on generic platform
Also tested on siracusa, output buffer is the correct but gvsoc also returns an error

Future work:

Integrate PULP-Trainlib floating point kernels as third-party library

Signed-off-by: LasagneArrosto <[email protected]>

Victor-Jung · 2024-09-13T13:09:58Z

Hi Alberto! Since this PR hasn't been merged in the GitLab repo, it's not really a reintegration. Could you make a more detailed description of the proposed feature?

Additionally, I think it's best not to add the PULP-TrainLib kernel integration into this PR.

Dequino · 2024-09-17T09:46:49Z

Hi Victor,

Sure, we can move the pulp-trainlib integration in a separate PR.

I'll be more through on the contents and added features:

Feature addition: Float Immediate abstract type
I've added the Float Immediate to the abstract types to be integrated in deeploy.
At the moment, the workframe has been developed focusing on deploying networks quantized with integer data.
It may be useful to future proof the framework by enabling integration of arbitrary floating-point precision formats. To do so, the first step was to define FloatImmediate class in the AbstractDataTypes, more precisely I've defined a method that checks if an immediate value can be represented by a float format with an arbitrary number of bits reserved for the exponent and fraction part.

To test the validity of this method, I've integrated the testImmediatePromotionFloat to the testTypes.py script, in which we test different immediate values to be represented in float32 and bfloat16 formats, which have been added as FloatImmediate extensions in Deeploy/CommonExtensions/DataTypes. If we want to add a new floating point precision format, we can simply define it here by specifying its total width, number of reserved bits for the fraction, and number of reserved bits for the exponent.

After that, I've tested the correct integration of floats by testing a simple deployment of a dummy model doing an addition between two float tensors. The test model and data is defined in Test/FloatAdder. I've had to edit a number of files to enable float data types, including:

Deeploy/DeeployTypes
Generic/Bindings
Generic/TypeCheckers
Deeploytest/GenerateNetowrk
testUtils/typeMapping

I've also added some minor edits to the documentation, as some line commands were incorrect or incomplete.

If you have more in depth questions about the changes, let me know

Scheremo

Hi Alberto! I had a look at your proposed changes, and I think they are overall in okay shape. My main advise would be to rely on standard library functions for mantissa and exponent extraction (they have python bindings), as the current implementation is hard to follow and seems to be prone to edge cases.

Scheremo · 2024-09-26T09:27:38Z

Deeploy/AbstractDataTypes.py

+class FloatImmediate(Immediate[Union[float, Iterable[float]], _ImmediateType]):
+    typeFraction: int  #: int: Represents the number of bits reserved for the fraction part
+    typeExponent: int  #: int: Represents the number of bits reserved for the exponent part
+    signed: bool  #: bool: Represents whether the underlying float is signed or unsigned (should be removed)


If this is not needed, please remove it :)

Sure, float numbers are all signed after all, I just put the bool here to make sure it wouldn't conflict with anything else in the framework

Scheremo · 2024-09-26T09:28:08Z

Deeploy/AbstractDataTypes.py

+        # The offset added to the exponent
+        return 2**(cls.typeExponent - 1) - 1
+
+    # ADEQUINO: This is a ugly workaround for FP, works for bfloat16 and fp32 because bfloat16 is a truncated fp32


Not sure I understand this comment. What about the code is a workaround?

Scheremo · 2024-09-26T09:30:50Z

Deeploy/AbstractDataTypes.py

+            # Fraction binarization, fails if nbits required > n bits mantissa.
+            # If integer part of immediate is 0, we start counting mantissa bits after we find the first 1 bit.
+            if (int(integer) > 0):
+                for i in range(cls.typeFraction):
+                    f = f * 2
+                    f, fint = math.modf(f)
+                    binarylist.append(str(int(fint)))
+                    if f == 0:
+                        break
+                    elif i == (cls.typeFraction - 1):
+                        return False
+            else:
+                flag = 0
+                count = cls.typeFraction + 1
+                while (count):
+                    f = f * 2
+                    f, fint = math.modf(f)
+                    binarylist.append(str(int(fint)))
+                    if int(fint) == 1 and flag == 0:
+                        flag = 1
+                    if f == 0:
+                        break
+                    if flag == 1:
+                        count = count - 1
+                    if (count == 0):
+                        return False


All of this float to string to list to int casting seems unnecessary to me.
Please use a builtin method to determine the number of mantissa and exponent bits, e.g. frexp (See here: https://www.tutorialspoint.com/c_standard_library/c_function_frexp.htm)

Scheremo · 2024-09-26T09:31:19Z

Deeploy/CommonExtensions/DataTypes.py

@@ -76,10 +76,27 @@ class uint64_t(IntegerImmediate):
    signed = False


+class bfloat16(FloatImmediate):
+    typeName = "float16alt"


I would suggest to keep the typeName equal to the name of the class to avoid confusion.

Scheremo · 2024-09-26T09:32:42Z

Deeploy/Targets/Generic/Templates/FloatAddTemplate.py

+        input_1_offset = 0
+        if hasattr(data_in_1, "_signed") and hasattr(data_in_1, "nLevels"):
+            input_1_offset = (data_in_1._signed == 0) * int(data_in_1.nLevels / 2)
+        input_2_offset = 0
+        if hasattr(data_in_2, "_signed") and hasattr(data_in_2, "nLevels"):
+            input_2_offset = (data_in_2._signed == 0) * int(data_in_2.nLevels / 2)
+        output_offset = 0
+        if hasattr(data_out, "_signed") and hasattr(data_out, "nLevels"):
+            output_offset = -(data_out._signed == 0) * int(data_out.nLevels // 2)
+
+        operatorRepresentation['offset'] = input_1_offset + input_2_offset + output_offset
+
+        return ctxt, operatorRepresentation, []


I don't quite understand the use of nLevels and offset here, those abstractions don't seem to make sense to me for floating point arithmetic.

Scheremo · 2024-09-26T09:33:49Z

Makefile

@@ -76,7 +76,7 @@ echo-bash:
 	@echo "export MEMPOOL_HOME=${MEMPOOL_INSTALL_DIR}"
 	@echo "export CMAKE=$$(which cmake)"
 	@echo "export PATH=${QEMU_INSTALL_DIR}/bin:${BANSHEE_INSTALL_DIR}:\$$PATH"
-	@echo "export PATH=~/.cargo/bin:$PATH"
+	@echo "export PATH=~/.cargo/bin:\$$PATH"


Why is this change necessary?

FrancescoConti · 2024-11-13T17:58:16Z

Closing as superseded by (and included into) #12

alberto and others added 5 commits September 11, 2024 12:06

Added floating point support

ae9f793

Added floatadder fp32 for generic platform

65bdbac

Added float adder test

b67df55

Small corrections for installation guide

6640216

Update Makefile

0ebe43f

Signed-off-by: LasagneArrosto <[email protected]>

Dequino changed the base branch from main to devel September 12, 2024 13:18

alberto added 2 commits September 12, 2024 15:23

Formatted style

33920fa

Merge branch 'dev' of github.com:Dequino/Deeploy into dev

326c793

Victor-Jung changed the title ~~FP integration~~ OPEN: FP integration Sep 13, 2024

Scheremo reviewed Sep 26, 2024

View reviewed changes

FrancescoConti mentioned this pull request Nov 9, 2024

OPEN: FP integration (v2) #12

Merged

FrancescoConti closed this Nov 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OPEN: FP integration #3

OPEN: FP integration #3

Dequino commented Sep 12, 2024

Victor-Jung commented Sep 13, 2024

Dequino commented Sep 17, 2024

Scheremo left a comment

Scheremo Sep 26, 2024

Dequino Sep 26, 2024

Scheremo Sep 26, 2024

Scheremo Sep 26, 2024

Scheremo Sep 26, 2024

Scheremo Sep 26, 2024

Scheremo Sep 26, 2024

FrancescoConti commented Nov 13, 2024

OPEN: FP integration #3

OPEN: FP integration #3

Conversation

Dequino commented Sep 12, 2024

Victor-Jung commented Sep 13, 2024

Dequino commented Sep 17, 2024

Scheremo left a comment

Choose a reason for hiding this comment

Scheremo Sep 26, 2024

Choose a reason for hiding this comment

Dequino Sep 26, 2024

Choose a reason for hiding this comment

Scheremo Sep 26, 2024

Choose a reason for hiding this comment

Scheremo Sep 26, 2024

Choose a reason for hiding this comment

Scheremo Sep 26, 2024

Choose a reason for hiding this comment

Scheremo Sep 26, 2024

Choose a reason for hiding this comment

Scheremo Sep 26, 2024

Choose a reason for hiding this comment

FrancescoConti commented Nov 13, 2024