Partitioned scan for RegionEngine #3886

evenyag · 2024-05-08T12:38:33Z

What type of enhancement is this?

API improvement

What does the enhancement do?

The RegionEngine trait provides a handle_query() method to scan a region and returns a stream of RecordBatch.

greptimedb/src/store-api/src/region_engine.rs

Lines 136 to 140 in d997463

    
               async fn handle_query( 
        
                   &self, 
        
                   region_id: RegionId, 
        
                   request: ScanRequest, 
        
               ) -> Result<SendableRecordBatchStream, BoxedError>;

This method is easy to use but has some limitations:

The output concurrency is always 1
The engine can't return more information about the query to callers

To maximize parallelism in #2806, the engine should provide a way to return multiple streams to scan different partitions of a region concurrently.

Implementation challenges

This issue proposes to add a new method to the region engine which supports partitioned scan. The method returns a trait object that can create a stream according to a partition index.

pub struct ScannerProperties {
    // Properties of the scanner
    // e.g. number of partitions, range of partitions
}

pub trait RegionScanner {
    fn properties(&self) -> &ScannerProperties;

    fn scan_partition(&self, partition: usize) -> Result<SendableRecordBatchStream, BoxedError>;
}

pub type RegionScannerRef = Arc<dyn RegionScanner>;

pub trait RegionEngine {
    async fn handle_partitioned_query(
        &self,
        region_id: RegionId,
        request: ScanRequest,
    ) -> Result<RegionScannerRef, BoxedError>;
}

We could then use the scanner to implement a PhysicalPlan and let the query engine process multiple partitions. We might need to refactor the StreamScanAdapter as it assumes there is only one partition.

greptimedb/src/table/src/table/scan.rs

Lines 103 to 119 in a6a702d

    
           fn execute( 
        
               &self, 
        
               partition: usize, 
        
               context: Arc<TaskContext>, 
        
           ) -> QueryResult<SendableRecordBatchStream> { 
        
               let tracing_context = TracingContext::from_json(context.session_id().as_str()); 
        
               let span = tracing_context.attach(common_telemetry::tracing::info_span!("stream_adapter")); 
        
               let mut stream = self.stream.lock().unwrap(); 
        
               let stream = stream.take().context(query_error::ExecuteRepeatedlySnafu)?; 
        
               let mem_usage_metrics = MemoryUsageMetrics::new(&self.metric, partition); 
        
               Ok(Box::pin(StreamWithMetricWrapper { 
        
                   stream, 
        
                   metric: mem_usage_metrics, 
        
                   span, 
        
               })) 
        
           }

The text was updated successfully, but these errors were encountered:

evenyag · 2024-05-20T13:12:53Z

closed by #3948

evenyag added the C-performance Category Performance label May 8, 2024

evenyag self-assigned this May 8, 2024

evenyag added this to mito2 May 8, 2024

This was referenced May 8, 2024

Support parallel scan in mito engine #2806

Closed

refactor: Remove PhysicalPlan trait and use ExecutionPlan directly #3894

Merged

evenyag mentioned this issue May 15, 2024

feat: Adds RegionScanner trait #3948

Merged

3 tasks

evenyag closed this as completed May 20, 2024

github-project-automation bot moved this to Done in mito2 May 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Partitioned scan for RegionEngine #3886

Partitioned scan for RegionEngine #3886

evenyag commented May 8, 2024 •

edited

Loading

evenyag commented May 20, 2024

Partitioned scan for RegionEngine #3886

Partitioned scan for RegionEngine #3886

Comments

evenyag commented May 8, 2024 • edited Loading

What type of enhancement is this?

What does the enhancement do?

Implementation challenges

evenyag commented May 20, 2024

evenyag commented May 8, 2024 •

edited

Loading