fix: adjust checks in ForthMachine to prevent segfault when num_items is negative #3209

ariostas · 2024-08-09T14:40:40Z

This PR addresses bug that was found in #3188. For some particular file, the num_items variable ended up being negative and it was causing a segfault when trying to write some data. I adjusted and added some checks to prevent the segfault and make it crash gracefully. Nevertheless, it does not address the issue with reading the file in #3188.

I checked with the Compiler Explorer and both of these options produce the same machine code, so I just used the more expressive one.

if (num_items < 0) num_items = 0;
num_items *= (num_items >= 0)

ariostas · 2024-08-09T14:46:22Z

Maybe to make it more complete it should set current_error_ to some (possibly new) type of error?

jpivarski · 2024-08-09T15:51:41Z

Maybe to make it more complete it should set current_error_ to some (possibly new) type of error?

That was my first thought, since this is another thing that could go wrong. However, for some AwkwardForth type codes, it has effectively been taking the maximum of the stack-read variable and zero for a while now. Turning this into an error would make code development in AwkwardForth more developer-friendly, but most of the development that (I believe) will ever be done in this language has already been done. Its existence was motivated by accelerating Uproot TTree-reading, and as we move to RNTuple, it becomes irrelevant. In principle, it's useful for other non-columnar data formats whose types are static through the dataset but not known before reading (i.e. the type is specified in a file-bound schema). Avro is one example of that, Protobuf, and Thrift are others, maybe also Flatbuffers, but there hasn't been high demand for getting these formats into Awkward Arrays. AwkwardForth is not the best choice for fully dynamic formats (JSON, BSON, etc.) or columnar formats (Parquet), or if a JIT-compiler is available. It's fairly niche—great for the Uproot TTree use-case—but that's done.

So I'd be happy to turn the accidental interpretation of negative num_items as zero into an explicit rule in the language. It can be documented as a short sentence somewhere in here:

awkward/docs/reference/awkwardforth.rst

Lines 1183 to 1220 in af232f5

    
           Single value vs multiple values 
        
           """"""""""""""""""""""""""""""" 
        
           Reading a batch of data in one instruction is faster than reading the same data in many steps. To read a batch of data, prepend the "``*->``" word with a number sign (``#``). This pops a value off the stack to use as the number of items to read. 
        
           The following examples result in the same output: 
        
           .. code-block:: python 
        
               >>> vm = ForthMachine32(""" 
        
               ... input x 
        
               ... output y float32 
        
               ...  
        
               ... 1000000 0 do 
        
               ...   x d-> y 
        
               ... loop 
        
               ... """) 
        
               >>> vm.run({"x": np.arange(1000000) * 1.1}) 
        
               >>> np.asarray(vm["y"]) 
        
               array([0.0000000e+00, 1.1000000e+00, 2.2000000e+00, ..., 1.0999968e+06, 
        
                      1.0999978e+06, 1.0999989e+06], dtype=float32) 
        
           and 
        
           .. code-block:: python 
        
               >>> vm = ForthMachine32(""" 
        
               ... input x 
        
               ... output y float32 
        
               ...  
        
               ... 1000000 x #d-> y 
        
               ... """) 
        
               >>> vm.run({"x": np.arange(1000000) * 1.1}) 
        
               >>> np.asarray(vm["y"]) 
        
               array([0.0000000e+00, 1.1000000e+00, 2.2000000e+00, ..., 1.0999968e+06, 
        
                      1.0999978e+06, 1.0999989e+06], dtype=float32) 
        
           but the second is faster because it involves two Forth instructions and one ``memcpy``.

For tests (for this PR), how about the following? I ran these on the old/broken AwkwardForth on Linux.

Positive num_items; no issues:

>>> vm = awkward.forth.ForthMachine32("input source 5 source #q-> stack")
>>> vm.run({"source": np.array([1, 2, 3, 4, 5])})
>>> vm.stack
[1, 2, 3, 4, 5]

and

>>> vm = awkward.forth.ForthMachine32("input source output sink float64 5 source #q-> sink")
>>> vm.run({"source": np.array([1, 2, 3, 4, 5])})
>>> vm.output("sink")
array([1., 2., 3., 4., 5.])

Negative num_items; some issues (but no segfault on Linux):

>>> vm = awkward.forth.ForthMachine32("input source -5 source #q-> stack")
>>> vm.run({"source": np.array([1, 2, 3, 4, 5])})
>>> vm.stack
[]

and

>>> vm = awkward.forth.ForthMachine32("input source output sink float64 -5 source #q-> sink")
>>> vm.run({"source": np.array([1, 2, 3, 4, 5])})
>>> vm.output("sink")
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
ValueError: negative dimensions are not allowed

Maybe these would segfault on MacOS, or maybe the segfault only happens deeper in a more complex example. But anyway, with the fix currently implemented in this PR, the first would return the same thing and the second would also return an empty array.

Actually, maybe it would be better to test a negative num_items followed by a positive num_items (or vice-versa), which are showing bad behavior on Linux. (Keep in mind that I'm not running in this PR.)

>>> vm = awkward.forth.ForthMachine32("input source -5 source #q-> stack 5 source #q-> stack")
>>> vm.run({"source": np.array([1, 2, 3, 4, 5])})
>>> vm.stack
[1068248464, 3, -1170705504, 0, 49]

With this PR, the output should be [1, 2, 3, 4, 5] because the first -5 source #q-> stack will now be interpreted as a no-op.

>>> vm = awkward.forth.ForthMachine32("input source output sink float64 -5 source #q-> sink 5 source #q-> sink")
>>> vm.run({"source": np.array([1, 2, 3, 4, 5])})
>>> vm.output("sink")
array([], dtype=float64)

This should also be [1, 2, 3, 4, 5] for the same reason.

This is clearly doing bad things in Linux because a few steps after these steps, I got a core dump.

In a new process, I tested the positive num_items followed by negative num_items, and these should also both return [1, 2, 3, 4, 5], which they do not (without this PR):

>>> vm = awkward.forth.ForthMachine32("input source 5 source #q-> stack -5 source #q-> stack")
>>> vm.run({"source": np.array([1, 2, 3, 4, 5])})
>>> vm.stack
[1, 2, 3, 4, 5]

and

>>> vm = awkward.forth.ForthMachine32("input source output sink float64 5 source #q-> sink -5 source #q-> sink")
>>> vm.run({"source": np.array([1, 2, 3, 4, 5])})
>>> vm.output("sink")
array([], dtype=float64)

ariostas · 2024-08-09T17:31:56Z

Thanks so much, Jim. I added the rule to the documentation and the tests you suggested. I couldn't find if the awkward-cpp tests should go somewhere else, so let me know if I should move them.

jpivarski

This all looks good and I'd say it's ready to merge!

I don't know which awkward-cpp tests you mean, but almost all of the awkward-cpp testing is done in Python through Awkward, anyway. (The major exception is the kernel-testing, but that's unrelated to AwkwardForth.) These tests are definitely sufficient.

ariostas · 2024-08-09T17:42:53Z

Hmm I'll have to check why some Windows tests are failing.

ariostas · 2024-08-09T18:03:15Z

Welp, it's the second time this week where I run into issues by not explicitly specifying dtype. But now it should all be good to go.

ianna

Looks great! Thanks!

jpivarski · 2024-09-12T21:03:07Z

@all-contributors please add @ariostas for code

allcontributors · 2024-09-12T21:03:17Z

@jpivarski

I've put up a pull request to add @ariostas! 🎉

Adjusted checks in case num_items is negative

bd51556

ariostas temporarily deployed to docs August 9, 2024 14:55 — with GitHub Actions Inactive

ariostas added 2 commits August 9, 2024 13:24

Added tests

bf9efb4

Added documentation for new rule

091aa51

jpivarski approved these changes Aug 9, 2024

View reviewed changes

ariostas temporarily deployed to docs August 9, 2024 17:40 — with GitHub Actions Inactive

Specify dtype for numpy arrays

230e4e2

ariostas deployed to docs August 9, 2024 18:00 — with GitHub Actions View deployment

ianna approved these changes Aug 9, 2024

View reviewed changes

ariostas merged commit 6eb8627 into main Aug 12, 2024
40 checks passed

ariostas deleted the ariostas/fix_segfault branch August 12, 2024 13:19

allcontributors bot mentioned this pull request Sep 12, 2024

docs: add ariostas as a contributor for code #3240

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: adjust checks in ForthMachine to prevent segfault when num_items is negative #3209

fix: adjust checks in ForthMachine to prevent segfault when num_items is negative #3209

ariostas commented Aug 9, 2024 •

edited

Loading

ariostas commented Aug 9, 2024

jpivarski commented Aug 9, 2024

ariostas commented Aug 9, 2024

jpivarski left a comment

ariostas commented Aug 9, 2024

ariostas commented Aug 9, 2024

ianna left a comment

jpivarski commented Sep 12, 2024

allcontributors bot commented Sep 12, 2024

fix: adjust checks in ForthMachine to prevent segfault when num_items is negative #3209

fix: adjust checks in ForthMachine to prevent segfault when num_items is negative #3209

Conversation

ariostas commented Aug 9, 2024 • edited Loading

ariostas commented Aug 9, 2024

jpivarski commented Aug 9, 2024

ariostas commented Aug 9, 2024

jpivarski left a comment

Choose a reason for hiding this comment

ariostas commented Aug 9, 2024

ariostas commented Aug 9, 2024

ianna left a comment

Choose a reason for hiding this comment

jpivarski commented Sep 12, 2024

allcontributors bot commented Sep 12, 2024

ariostas commented Aug 9, 2024 •

edited

Loading