A platform for high-performance distributed tool and library development written in C++. It can be deployed in two different cluster modes: standalone or distributed. API for v0.5.0, released on June 13, 2018.
|
#include <HashPartitionSink.h>
Public Member Functions | |
HashPartitionSink (int numPartitions, int numNodes, TupleSpec &inputSchema, TupleSpec &attsToOperateOn) | |
Handle< Object > | createNewOutputContainer () override |
void | writeOut (TupleSetPtr input, Handle< Object > &writeToMe) override |
~HashPartitionSink () | |
Public Member Functions inherited from pdb::ComputeSink | |
virtual | ~ComputeSink () |
Private Attributes | |
int | whichAttToHash |
int | whichAttToStore |
int | numPartitions |
int | numNodes |
Definition at line 34 of file HashPartitionSink.h.
|
inline |
constructor.
numPartitions | number of partitions in the cluster |
inputSchema | the schema of input tuple set |
attToOperateOn | the column that we want to partition and keep |
Definition at line 45 of file HashPartitionSink.h.
|
inline |
Definition at line 126 of file HashPartitionSink.h.
|
inlineoverridevirtual |
create container for output
Implements pdb::ComputeSink.
Definition at line 67 of file HashPartitionSink.h.
|
inlineoverridevirtual |
partition the input tuple set, and store the partitioned result to the output container
input | the input tuple set |
writeToMe | the output container |
Implements pdb::ComputeSink.
Definition at line 87 of file HashPartitionSink.h.
|
private |
Definition at line 140 of file HashPartitionSink.h.
|
private |
Definition at line 137 of file HashPartitionSink.h.
|
private |
Definition at line 131 of file HashPartitionSink.h.
|
private |
Definition at line 134 of file HashPartitionSink.h.