Addr of buf1 = 0x7facb9c7d010
Offs of buf1 = 0x7facb9c7d180
Addr of buf2 = 0x7facb7c7c010
Offs of buf2 = 0x7facb7c7c1c0
Addr of buf3 = 0x7facb5c7b010
Offs of buf3 = 0x7facb5c7b100
Addr of buf4 = 0x7facb3c7a010
Offs of buf4 = 0x7facb3c7a140
Threads #: 16 Pthreads
Matrix size: 2048
Using multiply kernel: multiply1
Execution time = 6.889 seconds
