Releases: redpanda-data/redpanda
v23.2.28
Bug Fixes
- Fix a race condition between suffix truncation / delete records and adjacent segment compaction that can lead to crashes and data-loss. by @ nvartolomei in #17254
- Fix initial_leader_epoch/KIP-320 handling in fetch requests. It was ignored until now which prevented consumers to correctly detect suffix truncation. For Redpanda (and Raft), this is a minor problem since suffix truncation is a very improbable event. by @nvartolomei in #17727
- Fixes a bug of config_frontend methods getting called on shards other than the controller shard. by @pgellert in [#17211](https://github.com/ /pull/17211)
- Prevent detecting leader epoch advancement when state is not up to date by @mmaslankaprv in [#17882](https://github.com/redpanda-data/ redpanda/pull/17882)
- Reduces maximum log line size from
1MiB
to128KiB
to reduce occurrences of memory allocation failures by @michael-redpanda in #17924 - #16612 fixes small inconsistency between Kafka and Redpanda when trying to query end_offset of an empty log by @mmaslankaprv in #17881
- #17238 Fixes a bug in CreateTopicsResponse to now return all the configs of the topic, not just the topic-specific override configs. by @pgellert in #17241
- #17790 Fix a bug that could lead to raft log inconsistencies when 2 out of 3 nodes in a configuration are changed. by @ztlpn in #17797
- prevents partial consumer group recovery by @mmaslankaprv in #17882
- PR #17160 [v23.2.x] compression: Allocate memory for LZ4_compressEnd by @abhijat
- PR #17826 [v23.2.x] CORE-1722: compression: Use preallocated decompression buffers for lz4 by @abhijat
- PR #17881 [v23.2.x] k/replicated_partition: fixed querying end offset of an empty log by @nvartolomei
- PR #17882 Backport of #17673 #17498 #16560 by @mmaslankaprv
Improvements
-
Adds a new public metric redpanda_raft_recovery_partition_movement_consumed_bandwidth that tracks how much bandwidth is currently in use for raft recovery. This helps tune raft_learner_recovery_rate. by @bharathv in #17217
-
PR #17397 [v23.2.x] k/group: recover leader epoch on leader change by @nvartolomei * PR #17448 [v23.2.x] tx: fix param ordering in log statement by @nvartolomei
-
PR #17577 [v23.2.x] c/topics_frontend: break the loop when dispatching to current leader by @mmaslankaprv
Full Changelog: v23.2.27...v23.2.28
v23.3.12
Bug Fixes
- Fix an issue where
rpk transform logs
waits for records without the--follow
flag specified. by @rockwotj in #17837 - Fixes
rpk transform init --install-deps
so that an explicit true value is not needed. by @rockwotj in [#17867](https:// github.com//pull/17867) - Fixes a crash when data transforms error and restart by @rockwotj in [#17696](https://github.com/redpanda-data/redpanda/pull/ 17696)
- Reduces maximum log line size from
1MiB
to128KiB
to reduce occurrences of memory allocation failures by @michael-redpanda in #17923 - #16612 fixes small inconsistency between Kafka and Redpanda when trying to query end_offset of an empty log by @mmaslankaprv in #17809
- #17718 Fix reported config source for cleanup.policy by reporting DEFAULT_CONFIG instead of DYNAMIC_TOPIC_CONFIG for the default value. by @pgellert in [#17719](https://github.com/redpanda-data/ redpanda/pull/17719)
- #17791 Fix a bug that could lead to raft log inconsistencies when 2 out of 3 nodes in a configuration are changed. by @ztlpn in #17796
- #17817 Fix problem in Tiered-Storage that could potentially cause consumers to get stuck by @Lazin in #17818
- #17891 fix a race between eviction and producer registration that results in an invalid transaction state. by @bharathv in #17900
Improvements
- Handle missing data transform logs topic in
rpk transform logs
by @rockwotj in [#17835](https://github.com/redpanda-data/ redpanda/pull/17835) - #17197 more accurate node status reporting by @mmaslankaprv in #17698
- skipping overhead of collecting node health report for each node separately. by @mmaslankaprv in [#17864](https://github.com/ /pull/17864)
- PR #17756 [v23.3.x] kafka: chunked_vector for config responses by @ pgellert
- PR #17792 [v23.3.x] CORE-1752: cst: improved logging by @abhijat
- PR #17825 [v23.3.x] CORE-1722: compression: Use preallocated decompression buffers for lz4 by @abhijat
- PR #17888 [v23.3.x] CORE-2365: storage: increase size of offset key map fragment size by @dotnwat
Full Changelog: v23.3.11...v23.3.12
v23.3.11
Features
- Introduce "trust_file_crc32c" metric to export a checksum for each trust file in the system. by @oleiman in #17587
Bug Fixes
- #16650 Fix oversized allocation in storage. by @Lazin in #17541
- #17459 Fixes a bug with TLS metrics where expiration timestamps would not advance on certificate reload by @oleiman in #17460
- rpk: prevent a segfault when creating a profile from a cloud that is not in ready state. by @r-vasquez in #17585
- PR #17435 [v23.3.x] c/frag_vector: added
get_allocator()
method to fragmented vector by @mmaslankaprv - PR #17449 [v23.3.x] tx: fix param ordering in log statement by @ nvartolomei
- PR #17572 [v23.3.x] Fixed
node_hash_map
caused oversized allocations in cluster module by @mmaslankaprv - PR #17573 [v23.3.x] use chunked vector as batches cache in
raft:: replicate_batcher
by @mmaslankaprv - PR #17576 [v23.3.x] c/topics_frontend: break the loop when dispatching to current leader by @mmaslankaprv
- PR #17578 [v23.3.x] rm_stm: do not hold producer lock for the duration of the barrier by @bharathv
- PR #17584 [v23.3.x] k/group_manager: used chunked_vector when cleaning groups by @mmaslankaprv
Improvements
- #17428 Improves error feedback when Redpanda is given an invalid number of partitions during either topic creation or when the partition count for a topic is increased. by @michael-redpanda in #17431
- PR #17574 [v23.3.x] Improved validation of Fetch requests when reading from follower by @mmaslankaprv
Full Changelog: v23.3.10...v23.3.11
v23.3.10
Features
- #16075 Data Transform's Rust SDK now supports a Schema Registry Client. by @rockwotj in #17174
- #17235 rpk: support running
rpk transform
against Redpanda cloud clusters. by @r-vasquez in #17252
Bug Fixes
- Fix a bug where logging in a transform could cause the transform to not make progress. by @rockwotj in #17192
- Fix a race condition between suffix truncation / delete records and adjacent segment compaction that can lead to crashes and data-loss. by @nvartolomei in #17253
- Fixes a bug of config_frontend methods getting called on shards other than the controller shard. by @pgellert in #17184
- Fixes a bug that would prevent the transaction coordinator topic from reclaiming disk space. by @andrwng in #17380
- Fixes a concurrency issue in transform offset commits pertaining to taking/applying snapshots. by @bharathv in #17400
- #17237 Fixes a bug in CreateTopicsResponse to now return all the configs of the topic, not just the topic-specific override configs. by @pgellert in #17239
- #17419 fixed enabling cloud storage in existing clusters by @mmaslankaprv in #17420
Improvements
- Adds a new public metric redpanda_raft_recovery_partition_movement_consumed_bandwidth that tracks how much bandwidth is currently in use for raft recovery. This helps tune raft_learner_recovery_rate. by @bharathv in #17216
- less overhead of health report collection by @mmaslankaprv in #17360
Full Changelog: v23.3.9...v23.3.10
v23.3.9
Features
Full Changelog: v23.3.8...v23.3.9
v23.2.27
Bug Fixes
- Schema Registry: Deleted schemas no longer reappear after certain compaction patterns on the
_schemas
topic. by @BenPope in #17095 - #16129 Fixes a bug in SASL user deletion and update where usernames with a + symbol in the username were prevented from being deleted by @pgellert in #16813
- #16716 Fixed a few oversized allocations for some admin server endpoints. by @rockwotj in #16747
Improvements
- #16995 Added
EHOSTUNREACH
to retry-able error code list by @michael-redpanda in #16996
Full Changelog: v23.2.26...v23.2.27
v23.3.8
Features
- #16941
rpk redpanda config bootstrap
now support bootstrapping your advertised addresses configuration. by @r-vasquez in #16942
Bug Fixes
- Fix a crash that happened when a cluster that was partially in recovery mode tried to upload consumer offsets to cloud storage. by @ ztlpn in #17022
- Return a HTTP 400 error code when deploying a transform to a topic that doesn't exist instead of a 500 by @rockwotj in #17018
- Schema Registry: Deleted schemas no longer reappear after certain compaction patterns on the
_schemas
topic. by @BenPope in #17094 - #16679 Retains control batches from transactions to preserve transaction boundaries. This prevents some (very unlikely) scenarios where aborted data is read. by @bharathv in [#17100](https://github.com/ /pull/17100)
- PR #17093 [v23.3.x] c/topic_table: replaced partition metadata map with chunked_vector by @mmaslankaprv
- PR #17099 [v23.3.x] storage: ensure monotonic stable offset updates by @ nvartolomei
- PR #17111 [v23.3.x] cloud_storage_clients: classify request_timeout as retriable by @nvartolomei
Improvements
- #16815 Node-wide throughput throttling is now fair an responsive. by @ BenPope in #16848
- #16993 cluster: Avoid oversize allocs for topic creation and configuration by @BenPope in #17012
- #17107
rpk profile
has been reworked in an attempt to be simpler; see PR #17038 for more detail by @twmb in #17108 - PR #17115 [v23.3.x] Using
contiguous_range_map
inpartition_leaders_table
by @mmaslankaprv - PR #17120 [v23.3.x] rpk profile: a few more fixes by @twmb
Full Changelog: v23.3.7...v23.3.8
v23.3.7
Features
- You can create namespaces in Redpanda cloud using rpk cloud namespace. by @r-vasquez in [#16777](https://github.com/redpanda- data/redpanda/pull/16777)
- #16570 [#16572](https://github.com/redpanda-data/redpanda/issues/ 16572) Publish log (i.e. stderr/stdout) output from data transforms exclusively to an internally managed Redpanda topic (
_redpanda.transform_logs
). Data transform logs will no longer appear in broker logs. by @oleiman in [#16663](https://github. com//pull/16663) - #16895 Add Prometheus metrics for data transforms logging by @ oleiman in #16913
Bug Fixes
- Fixes a plausible correctness issue with idempotent requests during replication failures. by @bharathv in [#16749](https:// github.com//pull/16749)
- #16129 Fixes a bug in SASL user deletion and update where usernames with a + symbol in the username were prevented from being deleted by @pgellert in [#16811](https://github.com/redpanda-data/ redpanda/pull/16811)
- #16659 Fixes a bug in the tiered storage time-based query implementation that could result in a consumer hang when consuming very old data. by @andrwng in [#16660](https://github.com/ /pull/16660)
- #16717 Fixed a few oversized allocations for some admin server endpoints. by @rockwotj in #16719
- #16884 Fixed deleting Data Transforms with names that had URL unsafe characters by @rockwotj in #16885
- #16937 Fixes a bug in windowed compaction that could cause Redpanda to crash when an error occurs while reading batches. by @andrwng in [#16940](https://github.com/redpanda-data/redpanda/pull/ 16940)
Improvements
- Adds observability into producer evictions in each shard. by @bharathv in [#16839](https://github.com/redpanda-data/redpanda/ pull/16839)
- Fix large wasm module deployments by @rockwotj in #16767
- Increase
data_transforms_logging_buffer_capacity_bytes
from 100KiB to 500KiB by @oleiman in [#16977](https://github.com/ /pull/16977) - Large allocations are now logged by default (similar to reactor stalls) by @StephanDollberg in [#16844](https://github.com/ /pull/16844)
- #16795 Added ability to change transactional manage topic properties by @mmaslankaprv in #16968
- #16831 get_cluster_uuid returns a correctly formatted string by @ andijcr in #16832
- #16888 Data Transform builds in rpk now uses tinygo v0.31.1 by @ rockwotj in #16889
- #16947 better control of memory usage in storage layer. by @ mmaslankaprv in #16963
- #16997 Added
EHOSTUNREACH
to retry-able error code list by @ michael-redpanda in #16998 - optimized updating leadership metadata with health reports by @mmaslankaprv in [#16709](https://github.com/redpanda-data/ redpanda/pull/16709)
- preventing large allocation in partition balancer code by @mmaslankaprv in [#16939](https://github.com/redpanda-data/ redpanda/pull/16939)
- rpk: Remove 10s timeout in
rpk profile create
by @r-vasquez in [#16852](https://github.com/redpanda-data/redpanda/pull/ 16852) - PR #16682 [v23.3.x] Implement async_for_each by @travisdowns
- PR #16688 [v23.3.x] Add forward iterator to async_for_each by @ travisdowns
- PR #16691 [v23.3.x] Rethrow on unknown exceptions in fetch handler by @ballard26
- PR #16784 [v23.3.x] c/leaders: trigger leadership notification when term changes by @mmaslankaprv
- PR #16801 [v23.3.x] c/topic_table_probe: use btree_map in topic table probe by @mmaslankaprv
- PR #16829 [v23.3.x] rpk: update help text of decommission-status by @ daisukebe
- PR #16891 [v23.3.x] cmake: upgrade tinygo compiler by @rockwotj
- PR #16894 [v23.3.x] cloud_storage: Improve stale_reader test by @Lazin
- PR #16897 [v23.3.x] Fixed background apply fiber race condition in
raft::state_machine_manager
by @mmaslankaprv - PR #16903 [v23.3.x] cloud_storage: various non-functional changes by @andrwng
- PR #16908 Revert "[v23.3.x] rm_stm/idempotency: fix the producer lock scope" by @bharathv
- PR #16935 [v23.3.x] fix for cluster_config_test.py::test_aliasing by @andijcr
- PR #16965 [v23.3.x] Ensure
fragment_vector
fragments are always <= 128KiB by @ballard26
Full Changelog: v23.3.6...v23.3.7
v23.3.6
Bug Fixes
- Fix a bug that resulted in Redpanda ignoring until the next restart config values that were reset to their defaults. by @ztlpn in #16638
- Prevent detecting leader epoch advancement when state is not up to date by @mmaslankaprv in [#16573](https://github.com/redpanda-data/redpanda/ pull/16573)
- #16621 Avoid a large contiguous allocation when creating thousands of topics in a single CreateTopics request. by @travisdowns in #16622
- #16627 #16628
rpk tune -- output-script
: Add a missing new line in the ballast file tuner when using the--output-script
flag by @r-vasquez in [#16629](https://github. com//pull/16629)
Improvements
- Validate transform code at deploy time to ensure the correct SDK is used. by @rockwotj in [#16498](https://github.com/redpanda-data/redpanda/ pull/16498)
- #16627 #16628
rpk tune -- output-script
: rpk now creates a file for you if the provided file does not exist. by @r-vasquez in [#16629](https://github.com/redpanda-data/ redpanda/pull/16629) - PR #16546 [v23.3.x] rpc: Add config flag to enable/disable compression for replies by @ StephanDollberg
- PR #16565 [v23.3.x] introduce chunked_vector by @rockwotj
- PR #16569 [v23.3.x] rpc: Disable compression for internal rpc replies by @StephanDollberg
Full Changelog: v23.3.5...v23.3.6
v23.2.26
Bug Fixes
- Fix a bug that resulted in Redpanda ignoring until the next restart config values that were reset to their defaults. by @ztlpn in #16641
- #16624 #16625
rpk tune -- output-script
: Add a missing new line in the ballast file tuner when using the--output-script
flag by @r-vasquez in [#16626](https://github. com//pull/16626)
Improvements
- #16624 #16625
rpk tune -- output-script
: rpk now creates a file for you if the provided file does not exist. by @r-vasquez in [#16626](https://github.com/redpanda-data/ redpanda/pull/16626) - PR #16493 [v23.2.x] Fixed large allocation in
kafka::wait_for_leaders
by @mmaslankaprv - PR #16550 [v23.2.x] rpc: Add config flag to enable/disable compression for replies by @ StephanDollberg
- PR #16641 [v23.2.x] config: update bindings when properties are reset by @andijcr
Full Changelog: v23.2.25...v23.2.26