Issue 725 #731

GawyWOOOHOOO · 2024-11-16T22:17:58Z

Adding a flag to track whether any process was parallelized
Track execution time for each process
Modify Schedule.run() to generate feedback messages based on the flag and execution time status.

2. Track execution time for each process 3. Modify Schedule.run() to generate feedback messages based on the flag and execution time status.

github-actions · 2024-11-16T22:20:28Z

OS =
CPU =
Ram =
Hash = 71dc3ea
Kernel=
||
|-|-|-|-|-|-|-|-|-|

github-actions · 2024-11-16T22:21:03Z

OS:ubuntu-20.04
Sat Nov 16 22:21:02 UTC 2024
intro: 2/2 tests passed.
interface: 41/41 tests passed.
compiler: 54/54 tests passed.

angelhof · 2024-11-18T01:38:43Z

compiler/pash.py

@@ -35,6 +35,9 @@ def main():
        return_code = preprocess_and_execute_asts(input_script_path, args, input_script_arguments, shell_name)

        log("-" * 40) #log end marker
+
+        if args.debug >= 1:
+            log("Use the '-d 1' option for detailed debugging information.")


I am not exactly sure what this is trying to do here.

angelhof · 2024-11-18T01:39:41Z

compiler/pash_compilation_server.py

@@ -295,6 +295,11 @@ def compile_and_add(self, compiled_script_file, var_file, input_ir_file):
            pass
        else:
            self.running_procs += 1
+
+        if ast_or_ir is not None:
+            compile_success = True


Why is this variable set here? Isn't is set previously?

angelhof

Thanks for the PR! I have left some comments with questions and here are some more needed changes to get this merged:

Rebase against binpash:future which is the branch of PaSh where we make all new changes.
Add a short example usage of this (for example with an echo hi script and with a cat README.md | grep "foo" script in docs/tutorial/tutorial.md
Add a test to check that this behavior happens (for echo hi and cat README.md | grep "foo"). We need to create a new test category in this script (https://github.com/binpash/pash/blob/future/scripts/run_tests.sh) that we can call api_tests. And the test should check that if pash is invoked on these two scripts, its standard error contains these two messages.

angelhof · 2024-11-18T01:46:06Z

compiler/pash_compilation_server.py

+
+        if ast_or_ir is not None:
+            compile_success = True
+            if run_parallel:


This does not indicate whether a fragment of the script was parallelized successfully. compile_success checks whether a script region was compiled successfully (which means that it was successfully translated into a dataflow graph, which means that we have annotations for all commands in it and the annotations for all commands are pure, parallelizable pure, or stateless). However, we need to also check if there was any parallelization transformation applied (which has to be kept as state and checked further in the compiler (see this function: https://github.com/binpash/pash/blob/future/compiler/pash_compiler.py#L227).

angelhof · 2024-11-18T01:50:41Z

compiler/pash_compilation_server.py

@@ -336,9 +341,19 @@ def handle_exit(self, input_cmd):
        ## Get the execution time        
        command_finish_exec_time = datetime.now()
        command_start_exec_time = self.process_id_input_ir_map[process_id].get_start_exec_time()
-        exec_time = (command_finish_exec_time - command_start_exec_time) / timedelta(milliseconds=1)
+        exec_time = (command_finish_exec_time - command_start_exec_time).total_seconds()


why is this changed?

angelhof · 2024-11-18T01:51:58Z

compiler/pash_compilation_server.py

        log("Process:", process_id, "exited. Exec time was:", exec_time)
        self.handle_time_measurement(process_id, exec_time)
+
+        proc_info = self.process_id_input_ir_map[process_id]
+        if proc_info.compiler_config.width > 1:  # Check if it was parallelized


This doesn't check if parallelization was successful, but just whether the compiler would even try to parallelize.

angelhof · 2024-11-18T01:53:50Z

compiler/pash_compiler.py

@@ -92,6 +92,8 @@ def compile_ir(ir_filename, compiled_script_file, args, compiler_config):
    ret = None
    try:
        ret = compile_optimize_output_script(ir_filename, compiled_script_file, args, compiler_config)
+        if ret is None:


I don't think this means that there were no parallelization opportunities for the whole script, but only for this region. Also, I think it might be subsumed by the exception handling. Is this code ever called?

angelhof · 2024-11-18T02:01:06Z

compiler/pash_compilation_server.py

@@ -414,6 +429,16 @@ def run(self):
            self.parse_and_run_cmd(input_cmd)

        self.connection_manager.close()
+        if not self.parallelized_flag:
+            log("No parts of the input script were parallelized. Ensure commands are annotated for parallelization.")


These messages need to be logged no matter the debug level, so they should be given a different level (the way it is now they will only be printed if we have -d 1. Also it would be good to prefix them with [PaSh Warning]. Here is some wordsmithing to make them a bit clearer too:

[PaSh Warning] No region of the script was parallelized. Maybe you are missing relevant annotations? Use -d 1 for more info.

[PaSh Warning] Some script regions were parallelized but their execution times were negligible (<1s). If your script takes a long time maybe annotations are missing from relevant regions. Use -d 1 for more info.

angelhof · 2024-11-18T02:01:55Z

compiler/pash_compilation_server.py

+        if not self.parallelized_flag:
+            log("No parts of the input script were parallelized. Ensure commands are annotated for parallelization.")
+        elif all(
+            proc_info.exec_time is not None and proc_info.exec_time < 1


Have we checked that this ever actually passes? I am skeptical about the all proc_info.exec_time is not None. Is there anyway one of those is None and this becomes false?

angelhof · 2024-11-18T02:02:32Z

compiler/pash_compilation_server.py

+            for proc_info in self.process_id_input_ir_map.values()
+        ):
+            log("Some script fragments were parallelized, but their execution times were negligible.")
+            log("Consider optimizing your script to include longer-running tasks.")


No need for this message, this is not really an optimization that we are asking them to do, but rather making sure that they have annotations for the long-running parts that they care about.

angelhof · 2024-11-18T02:02:44Z

compiler/pash_compilation_server.py

+            log("Some script fragments were parallelized, but their execution times were negligible.")
+            log("Consider optimizing your script to include longer-running tasks.")
+        else:
+            log("Parallelization completed successfully.")


Not necessary, should be deleted.

github-actions · 2024-11-29T23:05:28Z

OS =
CPU =
Ram =
Hash = ba3d0c0
Kernel=
||
|-|-|-|-|-|-|-|-|-|

github-actions · 2024-11-29T23:05:55Z

OS:ubuntu-20.04
Fri Nov 29 23:05:55 UTC 2024
intro: 2/2 tests passed.
interface: 41/41 tests passed.
compiler: 54/54 tests passed.

github-actions · 2024-11-29T23:37:50Z

OS:ubuntu-20.04
Fri Nov 29 23:37:50 UTC 2024
intro: 0/2 tests passed.
interface: 8/41 tests passed.
compiler: 38/54 tests passed.
demo-spell.sh are not identical
hello-world.sh are not identical
test1 are not identical
test2 are not identical
test3 are not identical
test4 are not identical
test5 are not identical
test6 are not identical
test8 are not identical
test9 are not identical
test10 are not identical
test12 are not identical
test13 are not identical
test14 are not identical
test15 are not identical
test16 are not identical
test17 are not identical
test18 are not identical
test_set are not identical
test_set_e are not identical
test_redirect are not identical
test_unparsing are not identical
test_set_e_2 are not identical
test_set_e_3 are not identical
test_new_line_in_var are not identical
test_cmd_sbst are not identical
test_cmd_sbst2 are not identical
test_trap are not identical
test_umask are not identical
test_var_assgn_default are not identical
test_exclam are not identical
test_redir_var_test are not identical
test_star are not identical
test_env_vars are not identical
test_redir_dup are not identical
diff.sh are not identical
diff.sh are not identical
set-diff.sh are not identical
set-diff.sh are not identical
export_var_script.sh are not identical
export_var_script.sh are not identical
comm-par-test.sh are not identical
comm-par-test.sh are not identical
comm-par-test2.sh are not identical
comm-par-test2.sh are not identical
tee_web_index_bug.sh are not identical
tee_web_index_bug.sh are not identical
fun-def.sh are not identical
fun-def.sh are not identical
bigrams.sh are not identical
bigrams.sh are not identical

github-actions · 2024-11-29T23:38:14Z

OS =
CPU =
Ram =
Hash = 4896a16
Kernel=
||
|-|-|-|-|-|-|-|-|-|

github-actions · 2024-11-29T23:55:40Z

OS:ubuntu-20.04
Fri Nov 29 23:55:40 UTC 2024
intro: 0/2 tests passed.
interface: 8/41 tests passed.
compiler: 38/54 tests passed.
demo-spell.sh are not identical
hello-world.sh are not identical
test1 are not identical
test2 are not identical
test3 are not identical
test4 are not identical
test5 are not identical
test6 are not identical
test8 are not identical
test9 are not identical
test10 are not identical
test12 are not identical
test13 are not identical
test14 are not identical
test15 are not identical
test16 are not identical
test17 are not identical
test18 are not identical
test_set are not identical
test_set_e are not identical
test_redirect are not identical
test_unparsing are not identical
test_set_e_2 are not identical
test_set_e_3 are not identical
test_new_line_in_var are not identical
test_cmd_sbst are not identical
test_cmd_sbst2 are not identical
test_trap are not identical
test_umask are not identical
test_var_assgn_default are not identical
test_exclam are not identical
test_redir_var_test are not identical
test_star are not identical
test_env_vars are not identical
test_redir_dup are not identical
diff.sh are not identical
diff.sh are not identical
set-diff.sh are not identical
set-diff.sh are not identical
export_var_script.sh are not identical
export_var_script.sh are not identical
comm-par-test.sh are not identical
comm-par-test.sh are not identical
comm-par-test2.sh are not identical
comm-par-test2.sh are not identical
tee_web_index_bug.sh are not identical
tee_web_index_bug.sh are not identical
fun-def.sh are not identical
fun-def.sh are not identical
bigrams.sh are not identical
bigrams.sh are not identical

github-actions · 2024-11-29T23:56:01Z

OS =
CPU =
Ram =
Hash = c4808ee
Kernel=
||
|-|-|-|-|-|-|-|-|-|

angelhof · 2024-11-30T23:39:36Z

This needs to be rebased for the future branch BTW :)

1. Adding a flag to track whether any process was parallelized

71dc3ea

2. Track execution time for each process 3. Modify Schedule.run() to generate feedback messages based on the flag and execution time status.

angelhof reviewed Nov 18, 2024

View reviewed changes

angelhof mentioned this pull request Nov 18, 2024

Adding a warning/info message when nothing was parallelized successfully #725

Open

angelhof requested changes Nov 18, 2024

View reviewed changes

Revised Informational Message in the pash.py

ba3d0c0

Update pash_compilation_server.py on the comments

4896a16

Update pash_compiler.py

c4808ee

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue 725 #731

Issue 725 #731

GawyWOOOHOOO commented Nov 16, 2024

github-actions bot commented Nov 16, 2024

github-actions bot commented Nov 16, 2024

angelhof Nov 18, 2024

angelhof Nov 18, 2024 •

edited

Loading

angelhof left a comment

angelhof Nov 18, 2024

angelhof Nov 18, 2024

angelhof Nov 18, 2024

angelhof Nov 18, 2024

angelhof Nov 18, 2024

angelhof Nov 18, 2024

angelhof Nov 18, 2024

angelhof Nov 18, 2024

github-actions bot commented Nov 29, 2024

github-actions bot commented Nov 29, 2024

github-actions bot commented Nov 29, 2024

github-actions bot commented Nov 29, 2024

github-actions bot commented Nov 29, 2024

github-actions bot commented Nov 29, 2024

angelhof commented Nov 30, 2024

Issue 725 #731

Are you sure you want to change the base?

Issue 725 #731

Conversation

GawyWOOOHOOO commented Nov 16, 2024

github-actions bot commented Nov 16, 2024

github-actions bot commented Nov 16, 2024

Choose a reason for hiding this comment

angelhof Nov 18, 2024 • edited Loading

Choose a reason for hiding this comment

angelhof left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

github-actions bot commented Nov 29, 2024

github-actions bot commented Nov 29, 2024

github-actions bot commented Nov 29, 2024

github-actions bot commented Nov 29, 2024

github-actions bot commented Nov 29, 2024

github-actions bot commented Nov 29, 2024

angelhof commented Nov 30, 2024

angelhof Nov 18, 2024 •

edited

Loading