Struct re_dataframe::query::QueryHandle
source · pub struct QueryHandle<E: StorageEngineLike> {
pub(crate) engine: E,
pub(crate) query: QueryExpression,
state: OnceLock<QueryHandleState>,
}
Expand description
A handle to a dataframe query, ready to be executed.
Cheaply created via QueryEngine::query
.
Fields§
§engine: E
Handle to the QueryEngine
.
query: QueryExpression
The original query expression used to instantiate this handle.
state: OnceLock<QueryHandleState>
Internal private state. Lazily computed.
It is important that handles stay cheap to create.
Implementations§
source§impl<E: StorageEngineLike> QueryHandle<E>
impl<E: StorageEngineLike> QueryHandle<E>
pub(crate) fn new(engine: E, query: QueryExpression) -> Self
source§impl<E: StorageEngineLike> QueryHandle<E>
impl<E: StorageEngineLike> QueryHandle<E>
sourcefn init(&self) -> &QueryHandleState
fn init(&self) -> &QueryHandleState
Lazily initialize internal private state.
It is important that query handles stay cheap to create.
fn init_(&self, store: &ChunkStore, cache: &QueryCache) -> QueryHandleState
fn compute_user_selection( &self, view_contents: &[ColumnDescriptor], selection: &[ColumnSelector], ) -> Vec<(usize, ColumnDescriptor)>
fn fetch_view_chunks( &self, store: &ChunkStore, cache: &QueryCache, query: &RangeQuery, view_contents: &[ColumnDescriptor], ) -> (Option<usize>, Vec<Vec<(AtomicU64, Chunk)>>)
sourcefn fetch_clear_chunks(
&self,
store: &ChunkStore,
cache: &QueryCache,
query: &RangeQuery,
view_contents: &[ColumnDescriptor],
) -> IntMap<EntityPath, Vec<Chunk>>
fn fetch_clear_chunks( &self, store: &ChunkStore, cache: &QueryCache, query: &RangeQuery, view_contents: &[ColumnDescriptor], ) -> IntMap<EntityPath, Vec<Chunk>>
Returns all potentially relevant clear Chunk
s for each unique entity path in the view contents.
These chunks take recursive clear semantics into account and are guaranteed to be properly densified. The component data is stripped out, only the indices are left.
fn fetch_chunks<'a>( &self, _store: &ChunkStore, cache: &QueryCache, query: &RangeQuery, entity_path: &EntityPath, component_descrs: impl IntoIterator<Item = &'a ComponentDescriptor>, ) -> Option<Vec<(AtomicU64, Chunk)>>
sourcepub fn query(&self) -> &QueryExpression
pub fn query(&self) -> &QueryExpression
The query used to instantiate this handle.
sourcepub fn view_contents(&self) -> &[ColumnDescriptor]
pub fn view_contents(&self) -> &[ColumnDescriptor]
Describes the columns that make up this view.
sourcepub fn selected_contents(&self) -> &[(usize, ColumnDescriptor)]
pub fn selected_contents(&self) -> &[(usize, ColumnDescriptor)]
Describes the columns that make up this selection.
The extra usize
is the index in Self::view_contents
that this selection points to.
sourcepub fn schema(&self) -> &Arrow2Schema
pub fn schema(&self) -> &Arrow2Schema
All results returned by this handle will strictly follow this Arrow schema.
Columns that do not yield any data will still be present in the results, filled with null values.
sourcepub fn seek_to_row(&self, row_idx: usize)
pub fn seek_to_row(&self, row_idx: usize)
Advance all internal cursors so that the next row yielded will correspond to row_idx
.
Does nothing if row_idx
is out of bounds.
§Concurrency
Cursors are implemented using atomic variables, which means calling any of the seek_*
while iteration is concurrently ongoing is memory-safe but logically undefined racy
behavior. Be careful.
§Performance
This requires going through every chunk once, and for each chunk running a binary search if
the chunk’s time range contains the index_value
.
I.e.: it’s pretty cheap already.
sourcefn seek_to_index_value(&self, index_value: IndexValue)
fn seek_to_index_value(&self, index_value: IndexValue)
Advance all internal cursors so that the next row yielded will correspond to index_value
.
If index_value
isn’t present in the dataset, this seeks to the first index value
available past that point, if any.
§Concurrency
Cursors are implemented using atomic variables, which means calling any of the seek_*
while iteration is concurrently ongoing is memory-safe but logically undefined racy
behavior. Be careful.
§Performance
This requires going through every chunk once, and for each chunk running a binary search if
the chunk’s time range contains the index_value
.
I.e.: it’s pretty cheap already.
sourcepub fn num_rows(&self) -> u64
pub fn num_rows(&self) -> u64
How many rows of data will be returned?
The number of rows depends and only depends on the view contents. The selected contents has no influence on this value.
sourcepub fn next_row(&self) -> Option<Vec<Box<dyn Arrow2Array>>>
pub fn next_row(&self) -> Option<Vec<Box<dyn Arrow2Array>>>
Returns the next row’s worth of data.
The returned vector of Arrow arrays strictly follows the schema specified by Self::schema
.
Columns that do not yield any data will still be present in the results, filled with null values.
Each cell in the result corresponds to the latest locally known value at that particular point in
the index, for each respective ColumnDescriptor
.
See QueryExpression::sparse_fill_strategy
to go beyond local resolution.
Example:
while let Some(row) = query_handle.next_row() {
// …
}
§Pagination
Use Self::seek_to_row
:
query_handle.seek_to_row(42);
for row in query_handle.into_iter().take(len) {
// …
}
sourcepub fn next_row_async(
&self,
) -> impl Future<Output = Option<Vec<Box<dyn Arrow2Array>>>>
pub fn next_row_async( &self, ) -> impl Future<Output = Option<Vec<Box<dyn Arrow2Array>>>>
Asynchronously returns the next row’s worth of data.
The returned vector of Arrow arrays strictly follows the schema specified by Self::schema
.
Columns that do not yield any data will still be present in the results, filled with null values.
Each cell in the result corresponds to the latest locally known value at that particular point in
the index, for each respective ColumnDescriptor
.
See QueryExpression::sparse_fill_strategy
to go beyond local resolution.
Example:
while let Some(row) = query_handle.next_row_async().await {
// …
}
pub fn _next_row( &self, store: &ChunkStore, cache: &QueryCache, ) -> Option<Vec<Box<dyn Arrow2Array>>>
sourcepub fn next_row_batch(&self) -> Option<RecordBatch>
pub fn next_row_batch(&self) -> Option<RecordBatch>
Calls Self::next_row
and wraps the result in a RecordBatch
.
Only use this if you absolutely need a RecordBatch
as this adds a lot of allocation
overhead.
See Self::next_row
for more information.
pub async fn next_row_batch_async(&self) -> Option<RecordBatch>
source§impl<E: StorageEngineLike> QueryHandle<E>
impl<E: StorageEngineLike> QueryHandle<E>
sourcepub fn iter(&self) -> impl Iterator<Item = Vec<Box<dyn Arrow2Array>>> + '_
pub fn iter(&self) -> impl Iterator<Item = Vec<Box<dyn Arrow2Array>>> + '_
Returns an iterator backed by Self::next_row
.
sourcepub fn into_iter(self) -> impl Iterator<Item = Vec<Box<dyn Arrow2Array>>>
pub fn into_iter(self) -> impl Iterator<Item = Vec<Box<dyn Arrow2Array>>>
Returns an iterator backed by Self::next_row
.
sourcepub fn batch_iter(&self) -> impl Iterator<Item = RecordBatch> + '_
pub fn batch_iter(&self) -> impl Iterator<Item = RecordBatch> + '_
Returns an iterator backed by Self::next_row_batch
.
sourcepub fn into_batch_iter(self) -> impl Iterator<Item = RecordBatch>
pub fn into_batch_iter(self) -> impl Iterator<Item = RecordBatch>
Returns an iterator backed by Self::next_row_batch
.
Auto Trait Implementations§
impl<E> !Freeze for QueryHandle<E>
impl<E> !RefUnwindSafe for QueryHandle<E>
impl<E> Send for QueryHandle<E>where
E: Send,
impl<E> Sync for QueryHandle<E>where
E: Sync,
impl<E> Unpin for QueryHandle<E>where
E: Unpin,
impl<E> !UnwindSafe for QueryHandle<E>
Blanket Implementations§
source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
source§impl<T> CheckedAs for T
impl<T> CheckedAs for T
source§fn checked_as<Dst>(self) -> Option<Dst>where
T: CheckedCast<Dst>,
fn checked_as<Dst>(self) -> Option<Dst>where
T: CheckedCast<Dst>,
source§impl<Src, Dst> CheckedCastFrom<Src> for Dstwhere
Src: CheckedCast<Dst>,
impl<Src, Dst> CheckedCastFrom<Src> for Dstwhere
Src: CheckedCast<Dst>,
source§fn checked_cast_from(src: Src) -> Option<Dst>
fn checked_cast_from(src: Src) -> Option<Dst>
§impl<T> Downcast for Twhere
T: Any,
impl<T> Downcast for Twhere
T: Any,
§fn into_any(self: Box<T>) -> Box<dyn Any>
fn into_any(self: Box<T>) -> Box<dyn Any>
Box<dyn Trait>
(where Trait: Downcast
) to Box<dyn Any>
. Box<dyn Any>
can
then be further downcast
into Box<ConcreteType>
where ConcreteType
implements Trait
.§fn into_any_rc(self: Rc<T>) -> Rc<dyn Any>
fn into_any_rc(self: Rc<T>) -> Rc<dyn Any>
Rc<Trait>
(where Trait: Downcast
) to Rc<Any>
. Rc<Any>
can then be
further downcast
into Rc<ConcreteType>
where ConcreteType
implements Trait
.§fn as_any(&self) -> &(dyn Any + 'static)
fn as_any(&self) -> &(dyn Any + 'static)
&Trait
(where Trait: Downcast
) to &Any
. This is needed since Rust cannot
generate &Any
’s vtable from &Trait
’s.§fn as_any_mut(&mut self) -> &mut (dyn Any + 'static)
fn as_any_mut(&mut self) -> &mut (dyn Any + 'static)
&mut Trait
(where Trait: Downcast
) to &Any
. This is needed since Rust cannot
generate &mut Any
’s vtable from &mut Trait
’s.§impl<T> DowncastSync for T
impl<T> DowncastSync for T
§impl<T> Instrument for T
impl<T> Instrument for T
§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
source§impl<T> IntoEither for T
impl<T> IntoEither for T
source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
self
into a Left
variant of Either<Self, Self>
if into_left
is true
.
Converts self
into a Right
variant of Either<Self, Self>
otherwise. Read moresource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
self
into a Left
variant of Either<Self, Self>
if into_left(&self)
returns true
.
Converts self
into a Right
variant of Either<Self, Self>
otherwise. Read moresource§impl<T> IntoRequest<T> for T
impl<T> IntoRequest<T> for T
source§fn into_request(self) -> Request<T>
fn into_request(self) -> Request<T>
T
in a tonic::Request