Problem Based Benchmark Suite (2020)

addDataSeq Data Generator:

addDataSeq [-t {int,double}] [-r <m>] <infile> <outfile>

This generator takes a sequence of simple type <S> (Int, Double, String) in the sequence file format and creates a sequence of pairs of type <S><T>Pair by adding to each element a value of type <T>. The type <S> is determined from the input file and the type <T> is specified by the -t argument (int by default). The added values are generated uniformly at random. For doudble they are generated in the range [0,1) and for type int they are generated in the range [0:m), where m is specified by the -r argument (m = n by default).

last modified 17:46, 20 Sep 2020

This project has been funded by the following sources:
Intel Labs Academic Research Office for the Parallel Algorithms for Non-Numeric Computing Program,
National Science Foundation, and
IBM Research.