Threaded operation #34

dmopalmer · 2023-01-30T20:38:43Z

This is a renewal of an old pull request. (Previously not accepted because @lpsinger intended to implement async instead of threading.)

This supports threaded operation by providing a handler that queues messages.

…s on the queue and an event to stop the thread.

…ed101

lpsinger · 2023-01-30T20:48:21Z

@dmopalmer, I'm sorry, but I am still convinced that asyncio is a better direction for this packaging than threading, because it's so I/O intensive.

dmopalmer · 2023-02-01T18:23:47Z

Asyncio is probably a better direction in theory, but threading is available with this pull request in time for the next LIGO-VIRGO-ETC run. This request doesn't affect the core functionality in any major way, and won't prevent asyncio from being implemented later.

lpsinger · 2023-02-01T19:39:57Z

This PR really contains several mostly-separable changes that could be broken into multiple PRs:

Addition of a termination mechanism (beyond mere signals) to listen().
Addition of a handler that inserts events into a queue.
An extra listener CLI tool that uses multithreading.
Sample code in the README file for calling listen() in a thread.

Here are my reactions too each of these:

I would accept this in principal, although there are a few problems with this implementation:
- It assumes that the calling program is using threading. I expect the termination mechanism to be agnostic toward the calling program's style of concurrency, because that decision is a higher-level concern that should not impact this library.
- It modifies the semantics of the socket timeouts. The impact should be neutral on the protocol timing.
- The termination mechanism is only checked in the receiving loop. It is not checked during the connect loop.
- I would prefer (but I do not require) an implementation with asyncio so that it could use task cancellation. This is the same comment that I had in Allow threaded operation #6.
I would readily accept this.
I require further justification for why the new CLI tool complements the existing pygcn-listen script.
I would readily accept this.

dmopalmer · 2023-02-02T00:11:55Z

You are right, using a {thread,process}.terminate() is cleaner than the stopevent-sentinel approach and means that the reader doesn't have to timeout to check the sentinel and then jump back to reading.
I will keep this
The new CLI is more of an example implementation than a useful utility. Instead I can fold it in to pygcn-listen with a --threaded command line argument to avoid burdening the command-line namespace, or I can remove its entry point from the setup.py file, and leave it in-place in the .py file with a comment of how to put it back in setup.py.
OK

So if I

remove the sentinel code and
no change
update the threaded_listen_main() to remove the sentinel; use .terminate(); and remove its entry point
update the README to use terminate

would that be a pull request you are happy with?

For eventual asyncio upgrade, I haven't found in the documentation how you get individual voevents as they come in. All the tutorials and documentation I have seen assume you want to create a bunch of tasks and await until they complete before using the complete results.

lpsinger · 2023-02-02T03:20:55Z

You are right, using a {thread,process}.terminate() is cleaner than the stopevent-sentinel approach and means that the reader doesn't have to timeout to check the sentinel and then jump back to reading.

I will keep this

The new CLI is more of an example implementation than a useful utility. Instead I can fold it in to pygcn-listen with a --threaded command line argument to avoid burdening the command-line namespace, or I can remove its entry point from the setup.py file, and leave it in-place in the .py file with a comment of how to put it back in setup.py.

OK

So if I

remove the sentinel code and

no change

If 2 will help your application, I would accept it as a self-contained PR.

update the threaded_listen_main() to remove the sentinel; use .terminate(); and remove its entry point

What is the value of introducing a thread here? The pygcn-listen CLI entry point will terminate gracefully anyway if the process receives SIGINT (C-c).

update the README to use terminate

What is terminate? Python's threading.Thread class doesn't have a terminate method.

For eventual asyncio upgrade, I haven't found in the documentation how you get individual voevents as they come in. All the tutorials and documentation I have seen assume you want to create a bunch of tasks and await until they complete before using the complete results.

I can think of at least two possible styles of async APIs. The first is that the listen method could return an asynchronous iterator suitable for use in an async for loop in the calling program. The second is that the calling program simply passes a handler callback, as it currently does. The callback need not be asynchronous.

I have a working asyncio VOEvent Transport Protocol client here: https://github.com/nasa-gcn/gcn-classic-to-kafka/blob/main/gcn_classic_to_kafka/socket.py

dmopalmer · 2023-02-02T05:25:47Z

Yeah, it appears that the only way to gracefully terminate a thread is cooperatively, such as by the sentinel method, which requires the socket reads to timeout so the sentinel can be checked.

Probably the way forward is to not have any way to stop the thread (apart from program termination). YAGNI. The documentation should suggest using processes instead if killing the listener will be necessary.

For 3. the value of introducing a thread is to show the user how to use threads. Probably the extra section in the README is sufficient for that and we can remove threaded_listen_main() altogether. I also wrote that function as a test case for debugging.

lpsinger · 2023-02-02T07:37:09Z

For 3. the value of introducing a thread is to show the user how to use threads. Probably the extra section in the README is sufficient for that and we can remove threaded_listen_main() altogether. I also wrote that function as a test case for debugging.

But why is it useful to show the user how to use threads? I do not see how this is any different from placing any other Python code in a thread. Surely users can read the Python standard library documentation on threads.

dmopalmer · 2023-02-06T17:47:51Z

The only use in showing the user how to use threads is showing how to use the gcn.handlers.queuehandlerfor(), which is covered in the addition to the README.md.

I have a working asyncio VOEvent Transport Protocol client here: https://github.com/nasa-gcn/gcn-classic-to-kafka/blob/main/gcn_classic_to_kafka/socket.py

That client immediately awaits on process() or read(), so I don't see how it provides a non-blocking way to check for the next message.

lpsinger · 2023-02-06T18:25:04Z

That client immediately awaits on process() or read(), so I don't see how it provides a non-blocking way to check for the next message.

I think you'd just use task cancellation.

dmopalmer · 2023-02-08T20:43:46Z

I don't want to cancel the task and then restart it and lose the messages that came in when it was cancelled.

My use case, which is probably common, is to have a telescope make observations of yesterday's GRB while keeping a VOEvent socket open to quickly change observations to a new LIGO event coming in. So after each exposure I check the message queue for something more interesting.

dmopalmer · 2023-04-05T00:22:45Z

I got back to this in preparation for the LIGO/et al. run.

I have stripped out the stopevent capability. The changes consist of a queue handler implementation and a description in the README of how to use it.

dmopalmer · 2023-08-04T03:01:09Z

Just re-pinging his pull request.

lpsinger

I would be content with adding the new handler, but not the threading code sample.

lpsinger · 2023-08-04T11:47:28Z

setup.cfg

@@ -28,6 +28,7 @@ classifiers =
    Programming Language :: Python :: 3.8
    Programming Language :: Python :: 3.9
    Programming Language :: Python :: 3.10
+    Programming Language :: Python :: 3.11


This is a nice addition, but not related to the topic of this PR.

Suggested change

Programming Language :: Python :: 3.11

I don't know why GitHub is showing this in the diff. It's already on the main branch, as of #35. Would you please rebase?

lpsinger · 2023-08-04T11:49:48Z

gcn/handlers.py

+    queue.put((payload, root))
+
+
+def queuehandlerfor(queue):


The other handlers have function names that are verbs or verb phrases. Please rename this one to be consistent. Perhaps 'enqueueorput_queue`?

lpsinger · 2023-08-04T11:54:20Z

README.md

+## Threading
+
+You can run the listener in a separate thread or process and pass the packets back in a `Queue`,
+allowing the main program to continue operating while waiting for an event.
+Here is an example:
+
+```python
+#!/usr/bin/env python
+import gcn
+import threading
+import queue
+
+# Set up communications:
+messagequeue = queue.Queue()
+# Create a listen handler to enqueue the (payload, root) tuple
+handler = gcn.handlers.queuehandlerfor(messagequeue)
+
+# Create and start the thread.
+thread = threading.Thread(target=gcn.listen,
+            kwargs=dict(handler=handler))
+thread.start()
+
+# Wait for messages to come in, but do other things if they don't.
+nothingcount=0
+while True:
+    try:
+        # Use block=False if you want to timeout immediately 
+        payload,root = messagequeue.get(timeout=10)
+        print(root.attrib['ivorn'])
+        nothingcount = 0
+    except queue.Empty:
+        # Do idle stuff here.
+        print("Nothing...")
+        nothingcount += 1
+        if nothingcount > 10:
+            print("Quitting due to inactivity")
+            break
+```
+


Please remove this. If someone asked how to run the listener in a thread, this isn't how I would suggest to them that they do it. (I would suggest to them that they use an ordinary handler and just launch gcn.listen in a thread or a subprocess.)

Also, this is missing calls to queue.task_done().

lpsinger · 2023-08-04T12:03:55Z

FYI, passing the lxml root object through a queue might not be compatible queues under multiprocessing.

dmopalmer and others added 17 commits December 1, 2018 11:29

Allow threaded operation, including a handler that places the message…

b3349bb

…s on the queue and an event to stop the thread.

Allow threaded operation, including a handler that places the message…

6bcf153

…s on the queue and an event to stop the thread.

Merge branch 'threaded' of github.com:dmopalmer/pygcn into threaded

ed7e8c4

PEP-8 cleanup.

2725775

Allow threaded operation, including a handler that places the message…

fa06b7f

…s on the queue and an event to stop the thread.

Allow threaded operation, including a handler that places the message…

836f229

…s on the queue and an event to stop the thread.

PEP-8 cleanup.

19e8a5f

Merge branch 'threaded' of github.com:dmopalmer/pygcn into threaded

c4e3615

Merge branch 'master' of github.com:lpsinger/pygcn into threaded

8b46d45

Merge branch 'master' of github.com:lpsinger/pygcn into threaded

0b6635b

Removed accidental duplication. Fixed pep-8 (required by Travis).

2fe080b

Merge branch 'master' of github.com:lpsinger/pygcn into threaded

9de70e9

Merge branch 'main' of github.com:lpsinger/pygcn into threaded101

3828390

Added extra and keyword args to the wrappers.

7b309e7

Merge branch 'handler_args' of github.com:dmopalmer/pygcn into thread…

6b98dfe

…ed101

Merge branch 'main' of github.com:lpsinger/pygcn into threaded102

b1f8369

Merge branch 'nasa-gcn:main' into threaded102

4b88669

dmopalmer added 2 commits April 4, 2023 17:54

Removed stopevent capability to simplify threaded operation.

86c4e5c

Readme change

5db02d4

lpsinger requested changes Aug 4, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Threaded operation #34

Threaded operation #34

dmopalmer commented Jan 30, 2023

lpsinger commented Jan 30, 2023

dmopalmer commented Feb 1, 2023

lpsinger commented Feb 1, 2023

dmopalmer commented Feb 2, 2023

lpsinger commented Feb 2, 2023

dmopalmer commented Feb 2, 2023 •

edited

Loading

lpsinger commented Feb 2, 2023

dmopalmer commented Feb 6, 2023

lpsinger commented Feb 6, 2023

dmopalmer commented Feb 8, 2023

dmopalmer commented Apr 5, 2023

dmopalmer commented Aug 4, 2023

lpsinger left a comment

lpsinger Aug 4, 2023

lpsinger Aug 4, 2023

lpsinger Aug 4, 2023

lpsinger Aug 4, 2023

lpsinger commented Aug 4, 2023

Threaded operation #34

Are you sure you want to change the base?

Threaded operation #34

Conversation

dmopalmer commented Jan 30, 2023

lpsinger commented Jan 30, 2023

dmopalmer commented Feb 1, 2023

lpsinger commented Feb 1, 2023

dmopalmer commented Feb 2, 2023

lpsinger commented Feb 2, 2023

dmopalmer commented Feb 2, 2023 • edited Loading

lpsinger commented Feb 2, 2023

dmopalmer commented Feb 6, 2023

lpsinger commented Feb 6, 2023

dmopalmer commented Feb 8, 2023

dmopalmer commented Apr 5, 2023

dmopalmer commented Aug 4, 2023

lpsinger left a comment

Choose a reason for hiding this comment

lpsinger Aug 4, 2023

Choose a reason for hiding this comment

lpsinger Aug 4, 2023

Choose a reason for hiding this comment

lpsinger Aug 4, 2023

Choose a reason for hiding this comment

lpsinger Aug 4, 2023

Choose a reason for hiding this comment

lpsinger commented Aug 4, 2023

dmopalmer commented Feb 2, 2023 •

edited

Loading