Replies: 3 comments 9 replies
-
Hi, Yes, we follow the TPC-C specification for random string generation, but if I understand the question you are saying that the default random strings from the specification do not compress very well when using solutions to compress the table data because they are well ... random and in some cases non-uniform. So the question is, is it possible to make the generated data less random, so it will compress better. It is interesting that sysbench does this. It is not something we plan to do, however it is all open source, so you can change the generation of data as you wish. If you look in the modules directory you can find the common data generation functions in Using the example below for generating the address as you use, here is the function that does that:
and make address gets called here in the build script:
So for a similar effect you could either change Make Address or the build functions to prepend the random data with name-, street1- etc. Or for a really simple case of making things non-random change the character array (globArray) to have less options to select random data from, e.g. instead of all the alphabet, reduce the available options to fewer letters. |
Beta Was this translation helpful? Give feedback.
-
Anything you change in |
Beta Was this translation helpful? Give feedback.
-
Hi @sm-shaw ! I finished with first version. I changed scripts and now they generate compressible data like in the sysbench-tpcc ( Compression factor near 3.5 ) for MySQL. Next step it will be change tcl script for Oracle and Postgres. Can you please check my git-repository ( https://github.com/vbienf/TPC-C-compressions-for-HammerDB ) and review this changes? I need your approve that I changed only data generation and no more. |
Beta Was this translation helpful? Give feedback.
-
Hi all!
I didn't find any information how to generate compressible data with HammerDB. F.e. for sysbench I using for each string specific mask:
link to source - https://github.com/Percona-Lab/sysbench-tpcc/blob/master/tpcc_common.lua
This mask helps to generate compressible data for my test.
How I can add a specific mask to string to generate data with HammerDB?
Beta Was this translation helpful? Give feedback.
All reactions