Skip to content

Commit

Permalink
Add monthly update for February 2024 (#9650)
Browse files Browse the repository at this point in the history
Summary:
Resolves: #9299

Pull Request resolved: #9650

Reviewed By: pedroerp

Differential Revision: D56910802

Pulled By: bikramSingh91

fbshipit-source-id: ba2a0adf324acfd80d924269ef7ee711bae46feb
  • Loading branch information
majetideepak authored and facebook-github-bot committed May 3, 2024
1 parent 2e61956 commit 9fbc544
Show file tree
Hide file tree
Showing 3 changed files with 73 additions and 2 deletions.
2 changes: 2 additions & 0 deletions velox/docs/develop/testing/fuzzer.rst
Original file line number Diff line number Diff line change
Expand Up @@ -141,6 +141,8 @@ tested:
Total aggregations verified against DuckDB: 2537 (44.63%)
Total failed aggregations: 1061 (18.67%)
.. _window-fuzzer:

Window Fuzzer
-------------

Expand Down
5 changes: 3 additions & 2 deletions velox/docs/monthly-updates.rst
Original file line number Diff line number Diff line change
Expand Up @@ -5,9 +5,10 @@ Monthly Updates
.. toctree::
:maxdepth: 1

monthly-updates/january-2024
monthly-updates/march-2024
monthly-updates/april-2024
monthly-updates/march-2024
monthly-updates/february-2024
monthly-updates/january-2024
monthly-updates/2023/index
monthly-updates/2022/index
monthly-updates/2021/index
68 changes: 68 additions & 0 deletions velox/docs/monthly-updates/february-2024.rst
Original file line number Diff line number Diff line change
@@ -0,0 +1,68 @@
********************
February 2024 Update
********************

Core Library
============

* Add support for aggregations over distinct inputs to StreamingAggregation.
* Add support for deserializing a single column in Presto page format.
* Add support for deserializing an all-null column serialized as UNKNOWN type in Presto page format.
* Add stats for null skew in join operator.
* Convert TIMESTAMP_WITH_TIME_ZONE type to a primitive type.
* Add background profiler that starts Linux perf on the Velox process.
* Fix ``out of range in dynamic array`` error in Task::toJson.
* Delete unused ``max_arbitrary_buffer_size`` config.

Presto Functions
================

* Add :func:`typeof`, :func:`from_iso8601_date` scalar functions.
* Add support for DECIMAL input type to :func:`set_agg` and :func:`set_union` aggregate functions.
* Add support for UNKNOWN input type to :func:`checksum` aggregate function.
* Add support for DATE +/- INTERVAL YEAR MONTH functions.
* Add support for ``UCT|UCT|GMT|GMT0`` as ``Z`` to :func:`parse_datetime` scalar function.

Spark Functions
===============

* Add :spark:func:`array_repeat`, :spark:func:`date_from_unix_date`, :spark:func:`weekday`, :spark:func:`minute`, :spark:func:`second` scalar functions.
* Add :spark:func:`ntile` window function.

Hive Connector
==============

* Add ``ignore_missing_files`` config.
* Add write support to ABFS file system.
* Add support for proxy to S3 file system.

Arrow
=====

* Add support to export UNKNOWN type to Arrow array.
* Add support to convert Arrow REE arrays to Velox Vectors.

Performance and Correctness
===========================

* Add FieldReference benchmark.
* Add :ref:`Window fuzzer <window-fuzzer>`.
* Fix ``Too many open files`` error in Join fuzzer.

Build System
============

* Add ``VELOX_BUILD_MINIMAL_WITH_DWIO`` CMake option.
* Move documentation, header and format check to Github Action.

Credits
=======

Aaron Feldman, Ankita Victor, Bikramjeet Vig, Christian Zentgraf, Daniel Munoz,
David McKnight, Deepak Majeti, Ge Gao, Hongze Zhang, Jacob Wujciak-Jens, Jia Ke,
Jialiang Tan, Jimmy Lu, Kevin Wilfong, Krishna Pai, Lu Niu, Masha Basmanova,
Nick Terrell, Orri Erling, PHILO-HE, Pedro Pedreira, Pramod, Pranjal Shankhdhar,
Richard Barnes, Schierbeck, Cody, Sergey Pershin, Wei He, Yedidya Feldblum,
Zac Wen, Zhenyuan Zhao, aditi-pandit, duanmeng, gayangya, hengjiang.ly, hitarth,
lingbin, mwish, rrando901, rui-mo, xiaodou, xiaoxmeng, xumingming, yingsu00,
zhli1142015, 高阳阳

0 comments on commit 9fbc544

Please sign in to comment.