Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GH-39227: [C++][Parquet] new Boolean RecordReader geting raw LSB bitmap #40885

Draft
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

mapleFU
Copy link
Member

@mapleFU mapleFU commented Mar 29, 2024

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

@mapleFU
Copy link
Member Author

mapleFU commented Mar 29, 2024

@wgtmac @pitrou This is still wip. Do you think this interface is ok? If yes I'll continue it

@pitrou
Copy link
Member

pitrou commented Mar 29, 2024

It's not very pretty, but I think it's ok.

static std::shared_ptr<RecordReader> Make(
const ColumnDescriptor* descr, LevelInfo leaf_info,
::arrow::MemoryPool* pool = ::arrow::default_memory_pool(),
bool read_dictionary = false, bool read_dense_for_nullable = false);
bool read_dictionary = false, bool read_dense_for_nullable = false,
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These parameters are enough to have a new RecordReaderOptions, IMHO.

@github-actions github-actions bot added awaiting committer review Awaiting committer review and removed awaiting review Awaiting review labels Mar 29, 2024
@@ -487,6 +494,11 @@ class DictionaryRecordReader : virtual public RecordReader {
virtual std::shared_ptr<::arrow::ChunkedArray> GetResult() = 0;
};

class BitmapBooleanRecordReader : virtual public RecordReader {
public:
virtual std::shared_ptr<::arrow::Array> GetChunkedData() = 0;
Copy link
Member

@wgtmac wgtmac Mar 29, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not sure if making it an explicit BooleanArray is a good idea. In this case, we can simply call it BooleanRecordReader

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants