================================================================================================
Benchmark for performance of JSON parsing
================================================================================================

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
JSON schema inferring:                    Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
No encoding                                        2084           2134          46          2.4         416.8       1.0X
UTF-8 is set                                       3077           3093          14          1.6         615.3       0.7X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
count a short column:                     Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
No encoding                                        2854           2863           8          1.8         570.8       1.0X
UTF-8 is set                                       4066           4066           1          1.2         813.1       0.7X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
count a wide column:                      Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
No encoding                                        3348           3368          26          0.3        3347.8       1.0X
UTF-8 is set                                       5215           5239          22          0.2        5214.7       0.6X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
select wide row:                          Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
No encoding                                       11046          11102          54          0.0      220928.4       1.0X
UTF-8 is set                                      12135          12181          54          0.0      242697.4       0.9X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
Select a subset of 10 columns:            Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Select 10 columns                                  2486           2488           2          0.4        2486.5       1.0X
Select 1 column                                    1505           1506           2          0.7        1504.6       1.7X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
creation of JSON parser per line:         Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Short column without encoding                       888            889           3          1.1         887.6       1.0X
Short column with UTF-8                            1134           1136           2          0.9        1134.3       0.8X
Wide column without encoding                       8012           8056          51          0.1        8012.4       0.1X
Wide column with UTF-8                             9830           9844          22          0.1        9829.7       0.1X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
JSON functions:                           Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Text read                                            85             87           2         11.7          85.4       1.0X
from_json                                          1706           1711           4          0.6        1706.4       0.1X
json_tuple                                         1528           1534           7          0.7        1528.2       0.1X
get_json_object                                    1275           1286          17          0.8        1275.0       0.1X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
Dataset of json strings:                  Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Text read                                           369            370           1         13.6          73.8       1.0X
schema inferring                                   1880           1883           4          2.7         376.0       0.2X
parsing                                            3731           3737           8          1.3         746.1       0.1X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
Json files in the per-line mode:          Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Text read                                           553            579          32          9.0         110.6       1.0X
Schema inferring                                   2195           2196           2          2.3         439.0       0.3X
Parsing without charset                            4272           4274           3          1.2         854.3       0.1X
Parsing with UTF-8                                 5459           5464           6          0.9        1091.7       0.1X

OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
Write dates and timestamps:               Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Create a dataset of timestamps                      102            112          13          9.8         101.9       1.0X
to_json(timestamp)                                  840            841           1          1.2         839.6       0.1X
write timestamps to files                           692            696           4          1.4         692.0       0.1X
Create a dataset of dates                           120            121           1          8.4         119.7       0.9X
to_json(date)                                       589            591           2          1.7         589.3       0.2X
write dates to files                                442            445           2          2.3         442.3       0.2X

OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
Read dates and timestamps:                                             Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
-----------------------------------------------------------------------------------------------------------------------------------------------------
read timestamp text from files                                                   143            145           4          7.0         142.6       1.0X
read timestamps from files                                                      2449           2469          17          0.4        2448.6       0.1X
infer timestamps from files                                                     5579           5596          15          0.2        5578.8       0.0X
read date text from files                                                        132            139           7          7.6         131.7       1.1X
read date from files                                                            1017           1020           2          1.0        1017.5       0.1X
timestamp strings                                                                227            230           3          4.4         227.2       0.6X
parse timestamps from Dataset[String]                                           2827           2830           3          0.4        2826.5       0.1X
infer timestamps from Dataset[String]                                           6001           6008           6          0.2        6001.2       0.0X
date strings                                                                     259            261           2          3.9         259.0       0.6X
parse dates from Dataset[String]                                                1382           1387           4          0.7        1382.3       0.1X
from_json(timestamp)                                                            3557           3561           7          0.3        3556.8       0.0X
from_json(date)                                                                 2146           2148           2          0.5        2146.4       0.1X
infer error timestamps from Dataset[String] with default format                 1989           1993           4          0.5        1989.3       0.1X
infer error timestamps from Dataset[String] with user-provided format           1922           1925           3          0.5        1922.1       0.1X
infer error timestamps from Dataset[String] with legacy format                  1919           1923           4          0.5        1919.1       0.1X

OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
Filters pushdown:                         Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
w/o filters                                       14387          14399          12          0.0      143871.9       1.0X
pushdown disabled                                 13891          13899           7          0.0      138912.3       1.0X
w/ filters                                          782            784           2          0.1        7820.5      18.4X

OpenJDK 64-Bit Server VM 1.8.0_292-8u292-b10-0ubuntu1~18.04-b10 on Linux 5.4.0-1045-aws
Intel(R) Xeon(R) Platinum 8375C CPU @ 2.90GHz
Partial JSON results:                     Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
parse invalid JSON                                 3100           3106           6          0.0      309967.6       1.0X
