Skip to content

Apache Pinot Release 1.3.0-RC0

Pre-release
Pre-release
Compare
Choose a tag to compare
@abhioncbr abhioncbr released this 12 Jan 20:11
· 12 commits to master since this release

What Changed

This release brings significant improvements, including enhancements to the multistage query engine and the introduction of a time series query engine for efficient analysis. Key features include database query quotas, cursor-based pagination for large result sets, multi-stream ingestion, and new function support for URL and GeoJson. Security vulnerabilities have been addressed, along with several bug fixes and performance enhancements, ensuring a more robust and versatile platform.

Multistage Engine Improvements

Reuse common expressions in a query (spool) #14507, Design Doc

Refines query plan reuse in Apache Pinot by allowing reuse across stages instead of subtrees. Stages serve as natural boundaries in the query plan, which is divided into pull-based operators. To execute queries, Pinot introduces stages connected by MailboxSendOperator and MailboxReceiveOperator. The proposal modifies MailboxSendOperator to send data to multiple stages, transforming stage connections into a Directed Acyclic Graph (DAG) for greater efficiency and flexibility.

Segment Plan for MultiStage Queries #13733, #14212

It is focusing on providing comprehensive execution plans, including physical operator details. The new explain mode aligns with Calcite terminology and uses a broker-server communication flow to analyze and transform query plans into explained physical plans without executing them. A new ExplainedPlanNode is introduced to enrich query execution plans with physical details, ensuring better transparency and debugging capabilities for users.

DataBlock Serde Performance Improvements #13303, #13304

Improve the performance of DataBlock building, serialization, and deserialization by reducing memory allocation and copies without altering the binary format. Benchmarks show 1x to 3x throughput gains, with significant reductions in memory allocation, minimizing GC-related latency issues in production. The improvement is achieved by changes to the buffers and the addition of a couple of stream classes.

Notable Improvements and Bug Fixes

  • Allow adding and subtracting timestamp types. #14782
  • Remove PinotAggregateToSemiJoinRule to avoid mistakenly remove DISTINCT from IN clause. #14719
  • Support to use timestamp indexes. #14690
  • Support for polymorphic scalar comparison functions(=, !=, >, >=, <, <=). #13711
  • Optimized MergeEqInFilterOptimizer by reducing the hash computation of expression. #14732
  • Add support for is_enable_group_trim aggregate option. #14664
  • Add support for is_leaf_return_final_result aggregate option. #14645
  • Override return type for NOW to TIMESTAMP. #14614
  • Fix broken BIG_DECIMAL aggregations (MIN / MAX / SUM / AVG). #14689
  • Add cluster configuration to allow limiting the number of multi-stage queries running concurrently. #14574
  • Allow filter for lookup JOIN. #14523
  • Fix the bug of query option being completely overridden when generating leaf stage query. #14603
  • Fix timestamp literal handling in the multi-stage query engine. #14502
  • Add TLS support to mailboxes used in the multi-stage engine. #14476
  • Allow configuring TLS between brokers and servers. #14387
  • Add tablesQueried metadata to BrokerResponse for multi-stage queries. #14384
  • Apply filter reduce expressions Calcite rule at the end to prevent literal-only filter pushdown to leaf stage. #14448
  • Add support for all data types in return type inference from string literals for JSON extract functions. #14289
  • Add support for IGNORE NULLS option to FIRST_VALUE and LAST_VALUE window functions. #14264
  • Fix for window frame upper bound offset extraction in PinotWindowExchangeNodeInsertRule. #14304
  • Add support for defining custom window frame bounds for window functions. #14273
  • Improvements to allow using DISTINCTCOUNTTHETASKETCH with filter arguments. #14285
  • Fix for ROUND scalar function in the multi-stage query engine. #14284
  • Support for enabling LogicalProject pushdown optimizations to eliminate exchange of unused columns. #14198
  • Support for COALESCE as a variadic scalar function. #14195
  • Support for lookup join. #13966
  • Add NULLIF scalar function. #14203
  • Compute all groups for group by queries with only filtered aggregations. #14211
  • Add broker API to run a query on both query engines and compare results. #13746
  • Database handling improvement in multi-stage engine. #14040
  • Adds per-block row tracking for CROSS JOINs to prevent OOM while allowing large joins to function within memory limits. #13981
  • OOM Protection Support for Multi-Stage Queries. #13598, #13955
  • Refactor function registry for multi-stage engine. #13573
  • Enforce max rows in join limit on joined rows with left input. #13922
  • Argument type to lookup function for literal only query. #13673
  • Ensure broker queries fail when the multi-stage engine is disabled, aligning behavior with the controller to improve user experience. #13732

Timeseries Engine Support in Pinot Design Doc

Introduction of a Generic Time Series Query Engine in Apache Pinot, enabling native support for various time-series query languages (e.g., PromQL, M3QL) through a pluggable framework.
This enhancement addresses limitations in Pinot’s current SQL-based query engines for time-series analysis, providing optimized performance and usability for observability use cases, especially those requiring high-cardinality metrics.

Key Features

Pluggable Time-Series Query Language:

  • Pinot will support multiple time-series query languages, such as PromQL and Uber’s M3QL, via plugins like pinot-m3ql.
  • Example queries:
    • Plot hourly order counts for specific merchants.
    • Perform week-over-week analysis of order counts.
  • These plugins will leverage a new SPI module to enable seamless integration of custom query languages.

Pluggable Time-Series Operators:

  • Custom operators specific to each query language (e.g., nonNegativeDerivative or holt_winters) can be implemented within language-specific plugins without modifying Pinot’s core code.
  • Extensible operator abstractions will allow stakeholders to define unique time-series analysis functions.

Advantages of the New Engine:

  • Optimized for Time-Series Data: Processes data in series rather than rows, improving performance and simplifying the addition of complex analysis functions.
  • Reduced Complexity in Pinot Core: The engine reuses existing components like the Multi-Stage Engine (MSE) Query Scheduler, Query Dispatcher, and Mailbox, while language parsers and planners remain modular in plugins.
  • Improved Usability: Users can run concise and powerful time-series queries in their preferred language, avoiding the verbosity and limitations of SQL.

Impact on Observability Use Cases:

This new engine significantly enhances Pinot’s ability to handle complex time-series analyses efficiently, making it an ideal database for high-cardinality metrics and observability workloads.

The improvement a step forward in transforming Pinot into a robust and versatile platform for time-series analytics, enabling seamless integration of diverse query languages and custom operators.

Here are some of the key PRs that have been merged as part of this feature:

  • Pinot time series engine SPI. #13885
  • Add combine and segment level operators for time series. #13999
  • Working E2E quickstart for time series engine. #14048
  • Handling NULL cases in sum, min, max series builders. #14084
  • Remove unnecessary time series materialization and minor cleanups. #14092
  • Fix offset handling and effective time filter and enable Group-By expressions. #14104
  • Enabling jSON column for Group-By in time series. #14141
  • Fix bug in handling empty Filters in time series. #14192
  • Minor time series engine improvements. #14227
  • Fix time series query correctness issue. #14251
  • Define time series ID and broker response name tag semantics. #14286
  • Use num docs from value block in time series aggregation operator. #14331
  • Make time buckets half open on the left. #14413
  • Fix Server Selection Bug + Enforce Timeout. #14426
  • Response Size Limit, Metrics and Series Limit. #14501
  • Refactor to Enable Broker Reduce. #14582
  • Enable streaming response for time series. #14598
  • Add time series exchange operator, plan node and serde. #14611
  • Add support for partial aggregate and complex intermediate type. #14631
  • Complete support for multi-server queries. #14676

Database Query Quota #13544

Introduces the ability to impose query rate limits at the database level, covering all queries made to tables within a database. A database-level rate limiter is implemented, and a new method, acquireDatabase(databaseName), is added to the QueryQuotaManager interface to check database query quotas.

Database Query Quota Configuration

  • Query and storage quotas are now provisioned similarly to table quotas but managed separately in a DatabaseConfig znode.
  • Details about the DatabaseConfig znode:
    • It does not represent a logical database entity.
    • Its absence does not prevent table creation under a database.
    • Deletion does not remove tables within the database.

Default and Override Quotas

  • A default query quota (databaseMaxQueriesPerSecond: 1000) is provided in ClusterConfig.
  • Overrides for specific databases can be configured via znodes (e.g., PROPERTYSTORE/CONFIGS/DATABASE/).

APIs for Configuration

Method Path Description
POST /databases/{databaseName}/quotas?maxQueriesPerSecond= Sets the database query quota
GET /databases/{databaseName}/quotas Get the database query quota

Dynamic Quota Updates

  • Quotas are determined by a combination of default cluster-level quotas and database-specific overrides.
  • Per-broker quotas are adjusted dynamically based on the number of live brokers.
  • Updates are handled via:
    • A custom DatabaseConfigRefreshMessage sent to brokers upon database config changes.
    • A ClusterConfigChangeListener in ClusterChangeMediator to process updates in cluster configs.
    • Adjustments to per-broker quotas upon broker resource changes.
    • Creation of database rate limiters during the OFFLINE -> ONLINE state transition of tables in BrokerResourceOnlineOfflineStateModel.

This feature provides fine-grained control over query rate limits, ensuring scalability and efficient resource management for databases within Pinot.

Binary Workload Scheduler for Constrained Execution #13847

Introduction of the BinaryWorkloadScheduler, which categorizes queries into two distinct workloads to ensure cluster stability and prioritize critical operations:

Workload Categories:

1. Primary Workload:

  • Default category for all production traffic.
  • Queries are executed using an unbounded FCFS (First-Come, First-Served) scheduler.
  • Designed for high-priority, critical queries to maintain consistent availability and performance.

2. Secondary Workload:

  • Reserved for ad-hoc queries, debugging tools, dashboards/notebooks, development environments, and one-off tests.
  • Imposes several constraints to minimize impact on the primary workload:
    • Limited concurrent queries: Caps the number of in-progress queries, with excess queries queued.
    • Thread restrictions: Limits the number of worker threads per query and across all queries in the secondary workload.
    • Queue pruning: Queries stuck in the queue too long are pruned based on time or queue length.

Key Benefits:

  • Prioritization: Guarantees the primary workload remains unaffected by resource-intensive or long-running secondary queries.
  • Stability: Protects cluster availability by preventing incidents caused by poorly optimized or excessive ad-hoc queries.
  • Scalability: Efficiently manages traffic in multi-tenant clusters, maintaining service reliability across workloads.

Cursors Support #14110, Design Doc

Cursor support will allow Pinot clients to consume query results in smaller chunks. This feature allows clients to work with lesser resources esp. memory. Application logic is simpler with cursors. For example an app UI paginates through results in a table or a graph. Cursor support has been implemented using APIs.

API

Method Path Description
POST /query/sql New broker API parameter has been added to trigger pagination.
GET /resultStore/{requestId}/results Broker API that can be used to iterate over the result set of a query submitted using the above API.
GET /resultStore/{requestId}/ Returns the BrokerResponse metadata of the query.
GET /resultStore Lists all the requestIds of all the query results available in the response store.
DELETE /resultStore/{requestId}/ Delete the results of a query.

SPI

The feature provides two SPIs to extend the feature to support other implementations:

  • ResponseSerde: Serialize/Deserialize the response.
  • ResponseStore: Store responses in a storage system. Both SPIs use Java SPI and the default ServiceLoader to find implementation of the SPIs. All implementation should be annotated with AutoService to help generate files for discovering the implementations.

URL Functions Support #14646

Implemented various URL functions to handle various aspects of URL processing, including extraction, encoding/decoding, and manipulation, making them useful for tasks involving URL parsing and modification

URL Extraction Methods

  • urlProtocol(String url): Extracts the protocol (scheme) from the URL.
  • urlDomain(String url): Extracts the domain from the URL.
  • urlDomainWithoutWWW(String url): Extracts the domain without the leading "www." if present.
  • urlTopLevelDomain(String url): Extracts the top-level domain (TLD) from the URL.
  • urlFirstSignificantSubdomain(String url): Extracts the first significant subdomain from the URL.
  • cutToFirstSignificantSubdomain(String url): Extracts the first significant subdomain and the top-level domain from the URL.
  • cutToFirstSignificantSubdomainWithWWW(String url): Returns the part of the domain that includes top-level subdomains up to the "first significant subdomain", without stripping "www.".
  • urlPort(String url): Extracts the port from the URL.
  • urlPath(String url): Extracts the path from the URL without the query string.
  • urlPathWithQuery(String url): Extracts the path from the URL with the query string.
  • urlQuery(String url): Extracts the query string without the initial question mark (?) and excludes the fragment (#) and everything after it.
  • urlFragment(String url): Extracts the fragment identifier (without the hash symbol) from the URL.
  • urlQueryStringAndFragment(String url): Extracts the query string and fragment identifier from the URL.
  • extractURLParameter(String url, String name): Extracts the value of a specific query parameter from the URL.
  • extractURLParameters(String url): Extracts all query parameters from the URL as an array of name=value pairs.
  • extractURLParameterNames(String url): Extracts all parameter names from the URL query string.
  • urlHierarchy(String url): Generates a hierarchy of URLs truncated at path and query separators.
  • urlPathHierarchy(String url): Generates a hierarchy of path elements from the URL, excluding the protocol and host.

URL Manipulation Methods

  • urlEncode(String url): Encodes a string into a URL-safe format.
  • urlDecode(String url) Decodes a URL-encoded string.
  • urlEncodeFormComponent(String url): Encodes the URL string following RFC-1866 standards, with spaces encoded as +.
  • urlDecodeFormComponent(String url): Decodes the URL string following RFC-1866 standards, with + decoded as a space.
  • urlNetloc(String url): Extracts the network locality (username:password@host:port) from the URL.
  • cutWWW(String url): Removes the leading "www." from a URL’s domain.
  • cutQueryString(String url): Removes the query string, including the question mark.
  • cutFragment(String url): Removes the fragment identifier, including the number sign.
  • cutQueryStringAndFragment(String url): Removes both the query string and fragment identifier.
  • cutURLParameter(String url, String name): Removes a specific query parameter from a URL.
  • cutURLParameters(String url, String[] names): Removes multiple specific query parameters from a URL.

Multi Stream Ingestion Support #13790, Design Doc

  • Add support to ingest from multiple source by a single table
  • Use existing interface (TableConfig) to define multiple streams
  • Separate the partition id definition between Stream and Pinot segment
  • Compatible with existing stream partition auto expansion logics
    The feature does not change any existing interfaces. Users could define the table config in the same way and combine with any other transform functions or instance assignment strategies.

New Scalar Functions Support. #14671

  • intDiv and intDivOrZero: Perform integer division, with intDivOrZero returning zero for division by zero or when dividing a minimal negative number by minus one.
  • isFinite, isInfinite, and isNaN: Check if a double value is finite, infinite, or NaN, respectively.
  • ifNotFinite: Returns a default value if the given value is not finite.
  • moduloOrZero and positiveModulo: Variants of the modulo operation, with moduloOrZero returning zero for division by zero or when dividing a minimal negative number by minus one.
  • negate: Returns the negation of a double value.
  • gcd and lcm: Calculate the greatest common divisor and least common multiple of two long values, respectively.
  • hypot: Computes the hypotenuse of a right-angled triangle given the lengths of the other two sides.
  • byteswapInt and byteswapLong: Perform byte swapping on integer and long values.

GeoJSON Support #14405

Add support for GeoJSON Scalar functions:

ST_GeomFromGeoJson(string) -> binary
ST_GeogFromGeoJson(string) -> binary
ST_AsGeoJson(binary) -> string

Supported data types:

  • Point
  • LineString
  • Polygon
  • MultiPoint
  • MultiLineString
  • MultiPolygon
  • GeometryCollection
  • Feature
  • FeatureCollection

Improved Implementation of Distinct Operators. #14701

Main optimizations:

  • Add per data type DistinctTable and utilize primitive type if possible
  • Specialize single column case to reduce overhead
  • Allow processing null values with dictionary based operators
  • Specialize unlimited LIMIT case
  • Do not create priority queue before collecting LIMIT values
  • Add support for null ordering

Upsert Improvements

Features and Improvements

Track New Segments for Upsert Tables #13992

  • Improvement for addressing a race condition where newly uploaded segments may be processed by the server before brokers add them to the routing table, potentially causing queries to miss valid documents.
  • Introduce a configurable newSegmentTrackingTimeMs (default 10s) to track new segments on the server side, allowing them to be accessed as optional segments until brokers update their routing tables.

Ensure Upsert Deletion Consistency with Compaction Flow Enabled #13347

Enhancement addresses inconsistencies in upsert-compaction by introducing a mechanism to track the distinct segment count for primary keys. By ensuring a record exists in only one segment before compacting deleted records, it prevents older non-deleted records from being incorrectly revived during server restarts, ensuring consistent table state.

Consistent Segments Tracking for Consistent Upsert View #13677

This improves consistent upsert view handling by addressing segment tracking and query inconsistencies. Key changes include:

  • Complete and Consistent Segment Tracking: Introduced a new Set to track segments before registration to the table manager, ensuring synchronized segment membership and validDocIds access.
  • Improved Segment Replacement: Added DuoSegmentDataManager to register both mutable and immutable segments during replacement, allowing queries to access a complete data view without blocking ingestion.
  • Query Handling Enhancements: Queries now acquire the latest consuming segments to avoid missing newly ingested data if the broker's routing table isn't updated.
  • Misc Fixes: Addressed edge cases, such as updating _numDocsIndexed before metadata updates, returning empty bitmaps instead of null, and preventing bitmap re-acquisition outside locking logic.
    These changes, gated by the new feature flag upsertConfig.consistencyMode, are tested with unit and stress tests in a staging environment to ensure reliability.

Other Notable Improvements and Bug Fixes

  • Config for max output segment size in UpsertCompactMerge task. #14772
  • Add config for ignoreCrcMismatch for upsert-compaction task. #14668
  • Upsert small segment merger task in minions. #14477
  • Fix to acquire segmentLock before taking segment snapshot. #14179
  • Update upsert TTL watermark in replaceSegment. #14147
  • Fix checks on largest comparison value for upsert ttl and allow to add segments out of ttl. #14094
  • More observability and metrics to track upsert rate of deletion. #13838

Lucene and Text Search Improvements

  • Store index metadata file for Lucene text indexes. #13948
  • Runtime configurability for Lucene analyzers and query parsers, enabling dynamic text tokenization and advanced log search capabilities like case-sensitive/insensitive searches. #13003

Security Improvements and Vulnerability Fixes

  • Force SSL cert reload daily using the scheduled thread. #14535
  • Allow configuring TLS between brokers and servers for the multi-stage engine. #14387
  • Strip Matrix parameter from BasePath checking. #14383
  • Disable replacing environment variables and system properties in get table configs REST API. #14002
  • Dependencies upgrade for vulnerabilities. #13892
  • TLS Configuration Support for QueryServer and Dispatch Clients. #13645
  • Returning tables names failing authorization in Exception for Multi-State Engine Queries. #13195
  • TLS Port support for Minion. #12943
  • Upgrade the hadoop version to 3.3.6 to fix vulnerabilities. #12561)
  • Fix vulnerabilities for msopenjdk 11 pinot-base-runtime image. #14030

Miscellaneous Improvements

  • Allow setting ForwardIndexConfig default settings via cluster config. #14773
  • Extend Merge Rollup Capabilities for Datasketches. #14625
  • Skip task validation during table creation with schema. #14683
  • Add capability to configure sketch precision / accuracy for different rollup buckets. Helpful in a space-saving for use cases where historical data does not require high accuracy. #14373
  • Add support for application-level query quota. #14226
  • Improvement to allow setting ForwardIndexConfig default settings via cluster config. #14773
  • Enhanced mutable Index class to be as pluggable. #14609
  • Improvement to allow configurable initial capacity for IndexedTable. #14620
  • Add a new segment reload API for flexible control, allowing specific segments to be reloaded on designated servers and enabling workload management through batch processing and replica group targeting. #14544
  • Add a server API to list segments that need to be refreshed for a table. #14544
  • Introduced the ability to erase dimension values before rollup in merged segments, reducing cardinality and optimizing space for less critical historical data. #14355
  • Add support for immutable CLPForwardIndex creator and related classes. #14288
  • Add support for Minion Task to support automatic Segment Refresh. #14300
  • Add support for S3A Connector. #14474
  • Add support for hex decimal to long scalar functions. #14435
  • Remove emitting null value fields during data transformation for SchemaConformingTransformer. #14351
  • Improved CSV record reader to skip unparseable lines. #14396
  • Add the ability to specify a target instance for segment reloading and improve API response messages when segments are not found on the target instances. #14393
  • Add support for JSON Path Exists function. #14376
  • Improvement for MSQ explain and stageStats when dealing with empty tables. #14374
  • Improvement for dynamically adjusting GroupByResultHolder's initial capacity based on filter predicates to optimize resource allocation and improve performance for filtered group-by queries. #14001
  • Add support for isEqualSet Function. #14313
  • Improvement to ensure consistent index configuration by constructing IndexLoadingConfig and SegmentGeneratorConfig from table config and schema, fixing inconsistencies and honoring FieldConfig.EncodingType. #14258
  • Add usage of CLPMutableForwardIndexV2 by default to improve ingestion performance and efficiency. #14241
  • Add support for application-level query quota. #14226
  • Add null handling support for aggregations grouped by MV columns. #14071
  • Add support for enabling the capability to specify zstd and lz4 segment compression via config. #14008
  • Add support for map data type on UI. #14245
  • Add support for ComplexType in SchemaInfo to render Complex Column count in UI. #14254
  • Introduced raw fwd index version V5 containing implicit num doc length, improving space efficiency. #14105
  • Improvement for colocated Joins without hints. #13943
  • Enhanced optimizeDictionary to optionally optimize var-width type columns. #13994
  • Enhanced optimizeDictionary to optionally optimize var-width type columns. #13994
  • Add support for BETWEEN in NumericalFilterOptimizer. #14163
  • Add support for NULLIF scalar function. #14203
  • Improvement for allowing usage of star-tree index with null handling enabled when no null values in segment columns. #14177
  • Improvement Improvement for avoiding using setter in IndexLoadingConfig for consuming segment. #14190
  • Implement consistent data push for Spark3 segment generation and metadata push jobs. #14139
  • Improvement addressing ingestion delays in realtime tables with many partitions by mitigating simultaneous segment commits across consumers. #14170
  • Improve query options validation and error handling. #14158
  • Add support arbitrary number of WHEN THEN clauses in the scalar CASE function. #14125
  • Add support for configuring Theta and Tuple aggregation functions. #14167
  • Add support for Map type in complex schema. #13906)
  • Add TTL watermark storage/loading for the dedup feature to prevent stale metadata from being added to the store when loading segments. #14137
  • Polymorphic scalar function implementation for BETWEEN. #14113
  • Polymorphic binary arithmetic scalar functions. #14089
  • Improvement for Adaptive Server Selection to penalize servers returning server side exceptions. #14029
  • Add a server level config for segment server upload to deep store. #14093
  • Add support to upload segments in batch mode with METADATA upload type. #13646
  • Remove recreateDeletedConsumingSegment flag from RealtimeSegmentValidationManager. #14024
  • Kafka3 support for realtime ingestion. #13891
  • Allow to build index on preserved field in SchemaConformingTransformer. #13993
  • Add support to differentiate null and emptyLists for multi-value columns in avro decoder. #13572
  • Broker config to set default query null handling behavior. #13977
  • Moves the untarring method to BaseTaskExecutor to enable downloading and untarring from a peer server if deepstore untarring fails and allowDownloadFromServer is enabled. #13964
  • Optimize Adaptive Server Selection. #13952
  • New SPI to support custom executor services, providing default implementations for cached and fixed thread pools. #13921
  • Introduction of shared IdealStateUpdaterLock for PinotLLCRealtimeSegmentManager to prevent race conditions and timeouts during large segment updates. #13947
  • Support for configuring aggregation function parameters in the star-tree index. #13835
  • Write support for creating Pinot segments in Pinot Spark connector. #13748
  • Array flattening support in SchemaConformingTransformer. #13890
  • Allow table names in TableConfigs with or without database name when database context is passed. #13934
  • Improvement in null handling performance for nullable single input aggregation functions. #13791
  • Improvement in column-based null handling by refining method naming, adding documentation, and updating validation and constructor logic to support column-specific null strategies. #13839
  • UI load time improvements. #13296
  • Enhanced the noRawDataForTextIndex config to skip writing raw data when re-using the mutable index is enabled, fixing a global disable issue and improving ingestion performance. #13776
  • Improvements to polymorphic scalar comparison functions for better backward compatibility. #13870
  • Add TablePauseStatus to track the pause details. #13803
  • Check stale dedup metadata when adding new records/segments. #13848
  • Improve error messages with star-tree indexes creation. #13818
  • Adds support for ZStandard and LZ4 compression in tar archives, enhancing efficiency and reducing CPU bottlenecks for large-scale data operations. #13782
  • Support for IPv6 in Net Utils. #13805
  • Optimize NullableSingleInputAggregationFunction when the entire block is null based on the null bitmap’s cardinality. #13758
  • Supporting extra headers in the request for supporting database and for routing the requests. #13417
  • Adds routing policy details to query error messages for unavailable segments, providing context to ease confusion and expedite issue triage. #13706
  • Refactoring and cleanup for permissions and access. #13696, #13633
  • Prevent 500 error for non-existent tasktype in /tasks/{taskType}/tasks API. #13537
  • Changed STREAM_DATA_LOSS from a Meter to a Gauge to accurately reflect data loss detection and ensures proper cleanup. #13712

Bug Fixes

  • Fix typo in RefreshSegmentTaskExecutor logger. #14763
  • Fix to avoid handling JSON_ARRAY as multi-value JSON during transformation. #14738
  • Fix for partition-enabled instance assignment with minimized movement. #14726
  • Fix v1 query engine behaviour for aggregations without group by where the limit is zero. #13564
  • Fix metadata fetch by increasing timeout for the Kafka client connection. #14638
  • Fix integer overflow in GroupByUtils. #14610
  • Fix for using PropertiesWriter to escape index_map keys properly. #12018
  • Fix query option validation for group-by queries. #14618
  • Fix for making RecordExtractor preserve empty array/map and map entries with empty values. #14547
  • Fix CRC mismatch during deep store upload retry task. #14506
  • Fix for allowing reload for UploadedRealtimeSegmentName segments. #14494
  • Fix default value handling in REGEXP_EXTRACT transform function. #14489
  • Fix for Spark upsert table backfill support. #14443
  • Fix long value parsing in jsonextractscalar. #14337
  • Fix deep store upload retry for infinite retention tables. #14406
  • Fix to ensure deterministic index processing order across server replicas and runs to prevent inconsistent segment data file layouts and unnecessary synchronization. #14391
  • Fix for real-time validation NPE when stream partition is no longer available. #14392
  • Fix for handling NULL values encountered in CLPDecodeTransformFunction. #14364
  • Fix for TextMatchFilterOptimizer grouping for the inner compound query. #14299
  • Fix for removing redundant API calls on the home page. #14295
  • Fix the missing precondition check for the V5 writer version in BaseChunkForwardIndexWriter. #14265
  • Fix for computing all groups for group by queries with only filtered aggregations. #14211
  • Fix for race condition in IdealStateGroupCommit. #14237
  • Fix default column handling when the forward index is disabled. #14215
  • Fix bug with server return final aggregation result when null handling is enabled. #14181
  • Fix Kubernetes Routing Issue in Helm chart. #13450
  • Fix raw index conversion from v4. #14171
  • Fix for consuming segments cleanup on server startup. #14174
  • Fix for making S3PinotFS listFiles return directories when non-recursive. #14073
  • Fix for rebalancer EV converge check for low disk mode. #14178
  • Fix for copying native text index during format conversion. #14172
  • Fix for enforcing removeSegment flow with _enableDeletedKeysCompactionConsistency. #13914
  • Fix for Init BrokerQueryEventListener. #13995
  • Fix for supporting ComplexFieldSpec in Schema and column metadata. #13905
  • Fix race condition in shared literal transform functions. #13916
  • Fix for honoring the column max length property while populating min/max values for column metadata. #13897
  • Fix for skipping encoding the path URL for the azure deep store. #13850
  • Fix for handling DUAL SQL queries in Into JDBC client. #13846
  • Fix TLS configuration for HTTP clients. #13477
  • Fix bugs in DynamicBrokerSelection. #13816
  • Fix literal type handling in LiteralValueExtractor. #13715
  • Fix for handling NULL values appropriately during segment reload for newly derived columns. #13212
  • Fix filtered aggregate with ordering. #13784
  • Fix implementing a table-level lock to prevent parallel updates to the SegmentLineage ZK record and align real-time table ideal state updates with minion task locking for consistency. #13735
  • Fix INT overflow issue for FixedByteSVMutableForwardIndex with large segment size. #13717
  • Fix preload enablement checks to consider the preload executor and refine numMissedSegments logging to exclude unchanged segments, preventing incorrect missing segment reports. #13747
  • Fix a bug in resource status evaluation during service startup, ensuring resources return GOOD when servers have no assigned segments, addressing issues with small tables and segment redistribution. #13541
  • Fix RealtimeProvisioningHelperCommand to allow using just schemaFile along with sampleCompletedSegmentDir. #13727