CI: 04/22/25 upstream sync #377

rocm-repo-management-api-2 · 2025-04-22T06:02:16Z

Daily sync with upstream

PiperOrigin-RevId: 746056606

PiperOrigin-RevId: 746057793

If mesh axes are empty, we are setting mesh as None, resulting in an error in this test. This fix provides an empty mesh, when no mesh axes in dumped module are empty. PiperOrigin-RevId: 746058506

PiperOrigin-RevId: 746059204

PiperOrigin-RevId: 746059522

…rying with the correct vma as the operands were. PiperOrigin-RevId: 746065965

PiperOrigin-RevId: 746070501

PiperOrigin-RevId: 746097962

PiperOrigin-RevId: 746098316

PiperOrigin-RevId: 746100873

PiperOrigin-RevId: 746102268

PiperOrigin-RevId: 746112248

PiperOrigin-RevId: 746117643

PiperOrigin-RevId: 746128775

PiperOrigin-RevId: 746140286

This fixes some non-intuitive errors where scalar-shaped values in VREGs were being used in operations that expected SREGs. PiperOrigin-RevId: 746146037

…pes being enabled by default PiperOrigin-RevId: 746146834

…sult jax.Array. PiperOrigin-RevId: 746147571

Adds a new WarpMesh object which when used in conjunction with core_map, allows the user to drop into warp-level code rather than programming at the warpgroup level. PiperOrigin-RevId: 746163942

This change primarily reduces sharding, although in a few cases it also increases shardings. It is harmful to performance to overshard tests since there's a startup and teardown cost to each test run. In a few cases, change tests to be non-accelerator tests. PiperOrigin-RevId: 746164539

Partially addresses: jax-ml#18246. If compile can also be a future, this code can be used to safely block on that as well. PiperOrigin-RevId: 746189742

…rly by adding the explicit mesh axis on dim 0 PiperOrigin-RevId: 749125322

PiperOrigin-RevId: 749159983

Array serialization in array_serialization.py contains a mixture of JAX specific serialization logic and tensorstore driver. This change separates JAX and tensorstore methods (a) making serialization more modular and (b) potentially allowing for alternative array serialization backends in the future. Additional clean-up changes include: - making ocdbt kvstore driver default in tensorstore - robustified array serialization tests especially on multi-host - explicit tensorstore array chunking to ensure chunk file size does not blow up PiperOrigin-RevId: 749175295

PiperOrigin-RevId: 749195753

…t to tracing cache after sharding_in_types config was turned on which lead to `sharding` always being available on `ShapedArray` PiperOrigin-RevId: 749206500

PiperOrigin-RevId: 749243140

http://github.com/openxla/xla/commit/1dd84dc2e7f87d79ba9f77b9874ff4a50227ad5e. PiperOrigin-RevId: 749332712

PiperOrigin-RevId: 749464614

PiperOrigin-RevId: 749492881

http://github.com/openxla/xla/commit/f94b36b783b9d955ec8cc966fc0b76cf9e265382. PiperOrigin-RevId: 749548364

PiperOrigin-RevId: 749779206

http://github.com/openxla/xla/commit/9219fd7ef180a01f814d3fce9f8aecfd80b9fd6c. PiperOrigin-RevId: 749784566

Description: - Copy mlir module before adding new attributes Fixes jax-ml#27991

PiperOrigin-RevId: 749807401

…utation-27991 PiperOrigin-RevId: 749811476

PiperOrigin-RevId: 749813377

…ed op instead of multiple .at[] calls. PiperOrigin-RevId: 749818535

Amend the scheme format and top-level domain.

PiperOrigin-RevId: 749885945

PiperOrigin-RevId: 749889362

PiperOrigin-RevId: 749898586

PiperOrigin-RevId: 749908416

Migrate auto-tuned table from https://github.com/pytorch/xla/blob/master/torch_xla/experimental/tuned_block_sizes.py PiperOrigin-RevId: 749965181

@asabne

…bstract eval This can happen if a user forgets to unwrap a ref! @asabne had this happen to him today, and he was confused as to what was going on. The prior error is unclear: AssertionError: (MemRef<None>{float32[2,1024,1024]}, MemRef<None>{float32[1,1024,1024]}) PiperOrigin-RevId: 749979253

…ray creation when possible This changes makes use of the new `xla::ifrt::Client::MakeArraysFromHostBufferShards()` API when possible. This API needs a single call to create a multi-shard IFRT Array (to be wrapped as a JAX `PyArray`), which provides more optimization opportunities for the runtime than creating single-device IFRT Arrays and then assembling them. Please note that `xla::ifrt::Client::MakeArraysFromHostBufferShards()` implementation in PjRt-IFRT is not yet optimized, so there is no immediate performance benefits for McJAX. As an exception, it takes the previous path of array assembly if any shard for `BatchedDevicePut` is not a host buffer, but already a single-device array, because `xla::ifrt::Client::MakeArraysFromHostBufferShards()` works only if all the sharded input to be host buffers. With batching possible at IFRT level, we now skip `DevicePutResultFn` step; `DevicePut` (now `DevicePutWithDevice` and `DevicePutWithSharding`) internally calls per-shard functions (with GIL released) and returns a final IFRT Array. This change includes a code cleanup for `xla::DevicePutResult::owning_pybuffer`, which was originally intended to hold a Python object to keep an IFRT Array valid when it is created from `DevicePut()` implementations, but this role has been entirely covered by `on_done_with_host_buffer` function supplied at IFRT Array creation time. PiperOrigin-RevId: 749989229

Google-ML-Automation and others added 30 commits April 10, 2025 09:16

Minor adjustments in error messages in launch_context.py

e287c7f

PiperOrigin-RevId: 746056606

Add unit tests for the grouped query attention reference implementation

6dd576a

PiperOrigin-RevId: 746057793

fix bug in export_module when no mesh axes are empty for shardy.

c730bbd

If mesh axes are empty, we are setting mesh as None, resulting in an error in this test. This fix provides an empty mesh, when no mesh axes in dumped module are empty. PiperOrigin-RevId: 746058506

Unify markdown formatting (no visible change on GitHub).

dd050f5

PiperOrigin-RevId: 746059204

Add a note about rotation direction for the tpu::RotateOp.

ed05bf8

PiperOrigin-RevId: 746059522

[export] Add test that exporting works for experimental.compute_on.

9af0c05

Do a pvary in dynamic_slice_transpose_rule so that the zeros are va…

6c0ac7a

…rying with the correct vma as the operands were. PiperOrigin-RevId: 746065965

Merge pull request jax-ml#27903 from mattjj:pvary-errors

9011d66

PiperOrigin-RevId: 746070501

Keep old scale_matmul arg names

a39a81a

One alias one

0f29716

Deprecation warning

2090dad

Fix dtype failures in JaxGroupedQueryAttentionReferenceTest.

f3115d3

PiperOrigin-RevId: 746097962

Skip Read the Docs builds unless the 'documentation' label is added.

dc33db3

Merge pull request jax-ml#27849 from ZacCranko:docfig

16ffbca

PiperOrigin-RevId: 746098316

Run notebooks as part of docs presubmit.

9f7507f

Merge pull request jax-ml#27917 from dfm:rtds-opt-in-label

349605c

PiperOrigin-RevId: 746100873

Merge pull request jax-ml#27368 from dfm:docs-on-actions

8482b7f

PiperOrigin-RevId: 746102268

Mark as thread-unsafe tests that modify possibly-cached jaxprs in-place.

5f5e742

PiperOrigin-RevId: 746112248

Add documentation for JAX's CI folder

edc76c7

PiperOrigin-RevId: 746117643

Enable execution of explicit-sharding notebook in docs.

a940100

Don't use default quant config

ae29f63

Merge pull request jax-ml#27924 from dfm:explicit-sharding-tutorial

64e10ad

PiperOrigin-RevId: 746128775

[Mosaic GPU] Skip WGMMA with cluster example on non H100 GPUs.

7117aa0

PiperOrigin-RevId: 746140286

[Pallas] Fix ()-shaped vectors being materialized in Pallas lowering.

2807ae4

This fixes some non-intuitive errors where scalar-shaped values in VREGs were being used in operations that expected SREGs. PiperOrigin-RevId: 746146037

Fix grep for label on Read the Docs.

654b91b

Make sure direct-linearize handles res_names correctly post vma in ty…

7e5966b

…pes being enabled by default PiperOrigin-RevId: 746146834

Implement mutation by replacing the contents of a jax.Array with a re…

48e14dc

…sult jax.Array. PiperOrigin-RevId: 746147571

[Mosaic GPU] Implement warp-level thread semantics.

92be510

Adds a new WarpMesh object which when used in conjunction with core_map, allows the user to drop into warp-level code rather than programming at the warpgroup level. PiperOrigin-RevId: 746163942

Allow ctrl-c to cancel block_until_ready().

3864c4f

Partially addresses: jax-ml#18246. If compile can also be a future, this code can be used to safely block on that as well. PiperOrigin-RevId: 746189742

yashk2810 and others added 28 commits April 18, 2025 13:11

Handle sharding param in convert_element_type's batching rule prope…

80d1fba

…rly by adding the explicit mesh axis on dim 0 PiperOrigin-RevId: 749125322

Makes Effort_02 the default value for memory_fitting_level.

0ae613e

PiperOrigin-RevId: 749159983

Merge pull request jax-ml#27733 from gspschmid:gschmid/jax_fwd_and_bwd

037dab7

PiperOrigin-RevId: 749195753

Update changelog to add information about breaking change with respec…

8318157

…t to tracing cache after sharding_in_types config was turned on which lead to `sharding` always being available on `ShapedArray` PiperOrigin-RevId: 749206500

Make Shape::add_dimensions() validate arguments by default.

580fe9e

PiperOrigin-RevId: 749243140

Update XLA dependency to use revision

0573706

http://github.com/openxla/xla/commit/1dd84dc2e7f87d79ba9f77b9874ff4a50227ad5e. PiperOrigin-RevId: 749332712

jax.random.bernoulli: add mode='high' for improved sampling for small p

6c56d65

fix BUILD file syntax error

8117043

PiperOrigin-RevId: 749464614

[Pallas/Fuser] Add support for pl.Element in fuser BlockSpec

f3224ca

PiperOrigin-RevId: 749492881

Update XLA dependency to use revision

8861781

http://github.com/openxla/xla/commit/f94b36b783b9d955ec8cc966fc0b76cf9e265382. PiperOrigin-RevId: 749548364

fix: typo

ed719ec

Add support for TPU7x in Mosaic.

3dad9de

PiperOrigin-RevId: 749779206

Update XLA dependency to use revision

b4c4135

http://github.com/openxla/xla/commit/9219fd7ef180a01f814d3fce9f8aecfd80b9fd6c. PiperOrigin-RevId: 749784566

Fixed cached mlir module mutation issue in export

c4bb9b6

Description: - Copy mlir module before adding new attributes Fixes jax-ml#27991

Merge pull request jax-ml#28141 from emergenz:fix-typo

11ae4cc

PiperOrigin-RevId: 749807401

Merge pull request jax-ml#28146 from vfdev-5:fix-export-mlir-module-m…

ff6c370

…utation-27991 PiperOrigin-RevId: 749811476

Merge pull request jax-ml#28130 from jakevdp:bernoulli-high

dd75a51

PiperOrigin-RevId: 749813377

Simplify (and potentially accelerate) fftfreq(), as a single vectoriz…

8d2e9a8

…ed op instead of multiple .at[] calls. PiperOrigin-RevId: 749818535

implement _replace_with for PRNGKeyArray

85ad1fd

[docs] fix typo in doc link

6a68094

Amend the scheme format and top-level domain.

Allow all reshapes if the operand is fully replicated

36cd313

PiperOrigin-RevId: 749885945

Merge pull request jax-ml#28155 from Caslyn:fix-doc-link

59b2add

PiperOrigin-RevId: 749889362

Merge pull request jax-ml#28156 from cgarciae:fix-prngkey

c70b43c

PiperOrigin-RevId: 749898586

Switch JAX to the new ProgramShape::AddParameter() API.

d3a8428

PiperOrigin-RevId: 749908416

[ragged-paged-attn] Add auto-tuned table that is being used for vLLM

640de9e

Migrate auto-tuned table from https://github.com/pytorch/xla/blob/master/torch_xla/experimental/tuned_block_sizes.py PiperOrigin-RevId: 749965181

rocm-repo-management-api-2 bot requested a review from a team as a code owner April 22, 2025 06:02

rocm-repo-management-api-2 bot enabled auto-merge (rebase) April 22, 2025 06:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CI: 04/22/25 upstream sync #377

CI: 04/22/25 upstream sync #377

rocm-repo-management-api-2 bot commented Apr 22, 2025

CI: 04/22/25 upstream sync #377

Are you sure you want to change the base?

CI: 04/22/25 upstream sync #377

Conversation

rocm-repo-management-api-2 bot commented Apr 22, 2025