Skip to content

Commit

Permalink
[lhe] ** COMPLETE LHE PART 4 (RANDOM COLOR) ** rerun 15 tmad and 60 t…
Browse files Browse the repository at this point in the history
…put alltees - finally all ok

This completes the random color choice #402

This took around 8 hours from 1h to 9h
STARTED  AT Sun Dec 18 07:32:03 CET 2022
./tput/teeThroughputX.sh -flt -hrd -makej -eemumu -ggtt -ggttg -ggttgg -ggttggg -makeclean
ENDED(1) AT Sun Dec 18 08:39:14 CET 2022 [Status=0]
./tput/teeThroughputX.sh -flt -hrd -makej -eemumu -ggtt -ggttgg -inlonly -makeclean
ENDED(2) AT Sun Dec 18 09:00:20 CET 2022 [Status=0]
./tput/teeThroughputX.sh -makej -eemumu -ggtt -ggttg -ggttgg -ggttggg -flt -bridge -makeclean
ENDED(3) AT Sun Dec 18 09:09:51 CET 2022 [Status=0]
./tput/teeThroughputX.sh -eemumu -ggtt -ggttgg -flt -rmbhst
ENDED(4) AT Sun Dec 18 09:13:28 CET 2022 [Status=0]
./tput/teeThroughputX.sh -eemumu -ggtt -ggttgg -flt -curhst
ENDED(5) AT Sun Dec 18 09:17:01 CET 2022 [Status=0]
  • Loading branch information
valassi committed Dec 18, 2022
1 parent df6d852 commit 7b306eb
Show file tree
Hide file tree
Showing 75 changed files with 5,159 additions and 5,072 deletions.
132 changes: 66 additions & 66 deletions epochX/cudacpp/tmad/logs_eemumu_mad/log_eemumu_mad_d_inl0_hrd0.txt

Large diffs are not rendered by default.

130 changes: 65 additions & 65 deletions epochX/cudacpp/tmad/logs_eemumu_mad/log_eemumu_mad_f_inl0_hrd0.txt

Large diffs are not rendered by default.

134 changes: 67 additions & 67 deletions epochX/cudacpp/tmad/logs_eemumu_mad/log_eemumu_mad_m_inl0_hrd0.txt

Large diffs are not rendered by default.

130 changes: 65 additions & 65 deletions epochX/cudacpp/tmad/logs_ggtt_mad/log_ggtt_mad_d_inl0_hrd0.txt

Large diffs are not rendered by default.

132 changes: 66 additions & 66 deletions epochX/cudacpp/tmad/logs_ggtt_mad/log_ggtt_mad_f_inl0_hrd0.txt

Large diffs are not rendered by default.

134 changes: 67 additions & 67 deletions epochX/cudacpp/tmad/logs_ggtt_mad/log_ggtt_mad_m_inl0_hrd0.txt

Large diffs are not rendered by default.

136 changes: 68 additions & 68 deletions epochX/cudacpp/tmad/logs_ggttg_mad/log_ggttg_mad_d_inl0_hrd0.txt

Large diffs are not rendered by default.

128 changes: 64 additions & 64 deletions epochX/cudacpp/tmad/logs_ggttg_mad/log_ggttg_mad_f_inl0_hrd0.txt

Large diffs are not rendered by default.

130 changes: 65 additions & 65 deletions epochX/cudacpp/tmad/logs_ggttg_mad/log_ggttg_mad_m_inl0_hrd0.txt

Large diffs are not rendered by default.

899 changes: 464 additions & 435 deletions epochX/cudacpp/tmad/logs_ggttgg_mad/log_ggttgg_mad_d_inl0_hrd0.txt

Large diffs are not rendered by default.

899 changes: 464 additions & 435 deletions epochX/cudacpp/tmad/logs_ggttgg_mad/log_ggttgg_mad_f_inl0_hrd0.txt

Large diffs are not rendered by default.

897 changes: 463 additions & 434 deletions epochX/cudacpp/tmad/logs_ggttgg_mad/log_ggttgg_mad_m_inl0_hrd0.txt

Large diffs are not rendered by default.

136 changes: 68 additions & 68 deletions epochX/cudacpp/tmad/logs_ggttggg_mad/log_ggttggg_mad_d_inl0_hrd0.txt

Large diffs are not rendered by default.

136 changes: 68 additions & 68 deletions epochX/cudacpp/tmad/logs_ggttggg_mad/log_ggttggg_mad_f_inl0_hrd0.txt

Large diffs are not rendered by default.

134 changes: 67 additions & 67 deletions epochX/cudacpp/tmad/logs_ggttggg_mad/log_ggttggg_mad_m_inl0_hrd0.txt

Large diffs are not rendered by default.

86 changes: 43 additions & 43 deletions epochX/cudacpp/tput/logs_eemumu_mad/log_eemumu_mad_d_inl0_hrd0.txt
Original file line number Diff line number Diff line change
Expand Up @@ -35,22 +35,22 @@ make[1]: Entering directory `/data/avalassi/GPU2020/madgraph4gpuX/epochX/cudacpp
make[1]: Nothing to be done for `all'.
make[1]: Leaving directory `/data/avalassi/GPU2020/madgraph4gpuX/epochX/cudacpp/ee_mumu.mad/SubProcesses/P1_ll_ll'

DATE: 2022-12-17_04:13:02
DATE: 2022-12-18_08:25:10

On itscrd70.cern.ch [CPU: Intel(R) Xeon(R) Silver 4216 CPU] [GPU: 1x Tesla V100S-PCIE-32GB]:
=========================================================================
runExe /data/avalassi/GPU2020/madgraph4gpuX/epochX/cudacpp/ee_mumu.mad/SubProcesses/P1_ll_ll/build.none_d_inl0_hrd0/gcheck.exe -p 2048 256 12 OMP=
Process = SIGMA_SM_EPEM_MUPMUM_CUDA [nvcc 11.7.99 (gcc 11.2.0)] [inlineHel=0] [hardcodePARAM=0]
Workflow summary = CUD:DBL+THX:CURDEV+RMBDEV+MESDEV/none+NAVBRK
FP precision = DOUBLE (NaN/abnormal=0, zero=0)
EvtsPerSec[Rmb+ME] (23) = ( 5.535517e+07 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 5.159528e+08 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 8.722821e+08 ) sec^-1
EvtsPerSec[Rmb+ME] (23) = ( 5.542585e+07 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 5.015904e+08 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 8.831455e+08 ) sec^-1
MeanMatrixElemValue = ( 1.371706e-02 +- 3.270315e-06 ) GeV^0
TOTAL : 1.027292 sec
471,359,919 cycles:u # 0.458 GHz
806,095,810 instructions:u # 1.71 insn per cycle
1.507308389 seconds time elapsed
TOTAL : 0.755650 sec
468,579,154 cycles:u # 0.473 GHz
809,589,748 instructions:u # 1.73 insn per cycle
1.112841852 seconds time elapsed
==PROF== Profiling "sigmaKin": launch__registers_per_thread 148
==PROF== Profiling "sigmaKin": sm__sass_average_branch_targets_threads_uniform.pct 100%
-------------------------------------------------------------------------
Expand All @@ -66,14 +66,14 @@ Process = SIGMA_SM_EPEM_MUPMUM_CPP [gcc 11.2.0] [inlineHel=0
Workflow summary = CPP:DBL+CXS:CURHST+RMBHST+MESHST/none+NAVBRK
FP precision = DOUBLE (NaN/abnormal=0, zero=0)
Internal loops fptype_sv = SCALAR ('none': ~vector[1], no SIMD)
EvtsPerSec[Rmb+ME] (23) = ( 9.271626e+05 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 1.299344e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 1.299344e+06 ) sec^-1
EvtsPerSec[Rmb+ME] (23) = ( 9.248174e+05 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 1.292975e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 1.292975e+06 ) sec^-1
MeanMatrixElemValue = ( 1.371706e-02 +- 3.270315e-06 ) GeV^0
TOTAL : 7.284241 sec
19,282,455,241 cycles:u # 2.641 GHz
42,965,017,429 instructions:u # 2.23 insn per cycle
7.555554229 seconds time elapsed
TOTAL : 7.303382 sec
19,341,935,467 cycles:u # 2.642 GHz
42,965,017,260 instructions:u # 2.22 insn per cycle
7.350387114 seconds time elapsed
=Symbols in CPPProcess.o= (~sse4: 362) (avx2: 0) (512y: 0) (512z: 0)
-------------------------------------------------------------------------
runExe /data/avalassi/GPU2020/madgraph4gpuX/epochX/cudacpp/ee_mumu.mad/SubProcesses/P1_ll_ll/build.none_d_inl0_hrd0/runTest.exe
Expand All @@ -91,14 +91,14 @@ Process = SIGMA_SM_EPEM_MUPMUM_CPP [gcc 11.2.0] [inlineHel=0
Workflow summary = CPP:DBL+CXS:CURHST+RMBHST+MESHST/sse4+CXVBRK
FP precision = DOUBLE (NaN/abnormal=0, zero=0)
Internal loops fptype_sv = VECTOR[2] ('sse4': SSE4.2, 128bit) [cxtype_ref=YES]
EvtsPerSec[Rmb+ME] (23) = ( 1.490052e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 2.819962e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 2.819962e+06 ) sec^-1
EvtsPerSec[Rmb+ME] (23) = ( 1.483867e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 2.821019e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 2.821019e+06 ) sec^-1
MeanMatrixElemValue = ( 1.371706e-02 +- 3.270315e-06 ) GeV^0
TOTAL : 4.717002 sec
12,429,056,804 cycles:u # 2.626 GHz
27,349,468,889 instructions:u # 2.20 insn per cycle
4.842404733 seconds time elapsed
TOTAL : 4.740640 sec
12,479,822,002 cycles:u # 2.626 GHz
27,349,470,345 instructions:u # 2.19 insn per cycle
4.776880584 seconds time elapsed
=Symbols in CPPProcess.o= (~sse4: 1271) (avx2: 0) (512y: 0) (512z: 0)
-------------------------------------------------------------------------
runExe /data/avalassi/GPU2020/madgraph4gpuX/epochX/cudacpp/ee_mumu.mad/SubProcesses/P1_ll_ll/build.sse4_d_inl0_hrd0/runTest.exe
Expand All @@ -116,14 +116,14 @@ Process = SIGMA_SM_EPEM_MUPMUM_CPP [gcc 11.2.0] [inlineHel=0
Workflow summary = CPP:DBL+CXS:CURHST+RMBHST+MESHST/avx2+CXVBRK
FP precision = DOUBLE (NaN/abnormal=0, zero=0)
Internal loops fptype_sv = VECTOR[4] ('avx2': AVX2, 256bit) [cxtype_ref=YES]
EvtsPerSec[Rmb+ME] (23) = ( 1.947324e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 4.939320e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 4.939320e+06 ) sec^-1
EvtsPerSec[Rmb+ME] (23) = ( 1.937918e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 4.938666e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 4.938666e+06 ) sec^-1
MeanMatrixElemValue = ( 1.371706e-02 +- 3.270315e-06 ) GeV^0
TOTAL : 3.716934 sec
9,252,033,814 cycles:u # 2.479 GHz
16,334,068,187 instructions:u # 1.77 insn per cycle
3.783164369 seconds time elapsed
TOTAL : 3.751869 sec
9,307,395,922 cycles:u # 2.479 GHz
16,334,069,525 instructions:u # 1.75 insn per cycle
3.790887207 seconds time elapsed
=Symbols in CPPProcess.o= (~sse4: 0) (avx2: 1046) (512y: 0) (512z: 0)
-------------------------------------------------------------------------
runExe /data/avalassi/GPU2020/madgraph4gpuX/epochX/cudacpp/ee_mumu.mad/SubProcesses/P1_ll_ll/build.avx2_d_inl0_hrd0/runTest.exe
Expand All @@ -141,14 +141,14 @@ Process = SIGMA_SM_EPEM_MUPMUM_CPP [gcc 11.2.0] [inlineHel=0
Workflow summary = CPP:DBL+CXS:CURHST+RMBHST+MESHST/512y+CXVBRK
FP precision = DOUBLE (NaN/abnormal=0, zero=0)
Internal loops fptype_sv = VECTOR[4] ('512y': AVX512, 256bit) [cxtype_ref=YES]
EvtsPerSec[Rmb+ME] (23) = ( 1.987289e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 5.212046e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 5.212046e+06 ) sec^-1
EvtsPerSec[Rmb+ME] (23) = ( 1.990092e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 5.253769e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 5.253769e+06 ) sec^-1
MeanMatrixElemValue = ( 1.371706e-02 +- 3.270315e-06 ) GeV^0
TOTAL : 3.652304 sec
9,103,884,093 cycles:u # 2.482 GHz
15,962,872,487 instructions:u # 1.75 insn per cycle
3.752595567 seconds time elapsed
TOTAL : 3.662366 sec
9,109,409,090 cycles:u # 2.483 GHz
15,962,872,836 instructions:u # 1.75 insn per cycle
3.715378969 seconds time elapsed
=Symbols in CPPProcess.o= (~sse4: 0) (avx2: 1001) (512y: 0) (512z: 0)
-------------------------------------------------------------------------
runExe /data/avalassi/GPU2020/madgraph4gpuX/epochX/cudacpp/ee_mumu.mad/SubProcesses/P1_ll_ll/build.512y_d_inl0_hrd0/runTest.exe
Expand All @@ -166,14 +166,14 @@ Process = SIGMA_SM_EPEM_MUPMUM_CPP [gcc 11.2.0] [inlineHel=0
Workflow summary = CPP:DBL+CXS:CURHST+RMBHST+MESHST/512z+CXVBRK
FP precision = DOUBLE (NaN/abnormal=0, zero=0)
Internal loops fptype_sv = VECTOR[8] ('512z': AVX512, 512bit) [cxtype_ref=YES]
EvtsPerSec[Rmb+ME] (23) = ( 1.930997e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 4.832456e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 4.832456e+06 ) sec^-1
EvtsPerSec[Rmb+ME] (23) = ( 1.921632e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 4.786086e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 4.786086e+06 ) sec^-1
MeanMatrixElemValue = ( 1.371706e-02 +- 3.270315e-06 ) GeV^0
TOTAL : 3.749659 sec
8,421,322,282 cycles:u # 2.237 GHz
12,773,070,331 instructions:u # 1.52 insn per cycle
3.865634440 seconds time elapsed
TOTAL : 3.786244 sec
8,460,862,856 cycles:u # 2.235 GHz
12,773,070,715 instructions:u # 1.51 insn per cycle
3.825116150 seconds time elapsed
=Symbols in CPPProcess.o= (~sse4: 0) (avx2: 266) (512y: 0) (512z: 683)
-------------------------------------------------------------------------
runExe /data/avalassi/GPU2020/madgraph4gpuX/epochX/cudacpp/ee_mumu.mad/SubProcesses/P1_ll_ll/build.512z_d_inl0_hrd0/runTest.exe
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -35,7 +35,7 @@ make[1]: Entering directory `/data/avalassi/GPU2020/madgraph4gpuX/epochX/cudacpp
make[1]: Nothing to be done for `all'.
make[1]: Leaving directory `/data/avalassi/GPU2020/madgraph4gpuX/epochX/cudacpp/ee_mumu.mad/SubProcesses/P1_ll_ll'

DATE: 2022-12-17_04:51:03
DATE: 2022-12-18_09:02:41

On itscrd70.cern.ch [CPU: Intel(R) Xeon(R) Silver 4216 CPU] [GPU: 1x Tesla V100S-PCIE-32GB]:
=========================================================================
Expand All @@ -47,14 +47,14 @@ WARNING! Set grid in Bridge (nevt=524288, gpublocks=2048, gputhreads=256, gpublo
Process = SIGMA_SM_EPEM_MUPMUM_CUDA [nvcc 11.7.99 (gcc 11.2.0)] [inlineHel=0] [hardcodePARAM=0]
Workflow summary = CUD:DBL+THX:CURHST+RMBHST+BRDDEV/none+NAVBRK
FP precision = DOUBLE (NaN/abnormal=0, zero=0)
EvtsPerSec[Rmb+ME] (23) = ( 2.579687e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 3.473373e+07 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 3.473373e+07 ) sec^-1
EvtsPerSec[Rmb+ME] (23) = ( 2.562154e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 3.332593e+07 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 3.332593e+07 ) sec^-1
MeanMatrixElemValue = ( 1.371706e-02 +- 3.270315e-06 ) GeV^0
TOTAL : 3.390014 sec
7,340,447,866 cycles:u # 2.012 GHz
11,742,011,278 instructions:u # 1.60 insn per cycle
3.705930636 seconds time elapsed
TOTAL : 3.408698 sec
7,348,723,205 cycles:u # 2.005 GHz
11,759,458,594 instructions:u # 1.60 insn per cycle
3.725813147 seconds time elapsed
==PROF== Profiling "sigmaKin": launch__registers_per_thread 148
==PROF== Profiling "sigmaKin": sm__sass_average_branch_targets_threads_uniform.pct 100%
-------------------------------------------------------------------------
Expand All @@ -71,14 +71,14 @@ Process = SIGMA_SM_EPEM_MUPMUM_CPP [gcc 11.2.0] [inlineHel=0
Workflow summary = CPP:DBL+CXS:CURHST+RMBHST+BRDHST/none+NAVBRK
FP precision = DOUBLE (NaN/abnormal=0, zero=0)
Internal loops fptype_sv = SCALAR ('none': ~vector[1], no SIMD)
EvtsPerSec[Rmb+ME] (23) = ( 8.999193e+05 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 1.241932e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 1.241932e+06 ) sec^-1
EvtsPerSec[Rmb+ME] (23) = ( 8.994121e+05 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 1.241030e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 1.241030e+06 ) sec^-1
MeanMatrixElemValue = ( 1.371706e-02 +- 3.270315e-06 ) GeV^0
TOTAL : 7.739105 sec
20,326,597,375 cycles:u # 2.619 GHz
43,370,678,181 instructions:u # 2.13 insn per cycle
7.765892080 seconds time elapsed
TOTAL : 7.748524 sec
20,353,390,532 cycles:u # 2.618 GHz
43,370,678,896 instructions:u # 2.13 insn per cycle
7.776587549 seconds time elapsed
=Symbols in CPPProcess.o= (~sse4: 362) (avx2: 0) (512y: 0) (512z: 0)
-------------------------------------------------------------------------
runExe /data/avalassi/GPU2020/madgraph4gpuX/epochX/cudacpp/ee_mumu.mad/SubProcesses/P1_ll_ll/build.none_d_inl0_hrd0/runTest.exe
Expand All @@ -97,14 +97,14 @@ Process = SIGMA_SM_EPEM_MUPMUM_CPP [gcc 11.2.0] [inlineHel=0
Workflow summary = CPP:DBL+CXS:CURHST+RMBHST+BRDHST/sse4+CXVBRK
FP precision = DOUBLE (NaN/abnormal=0, zero=0)
Internal loops fptype_sv = VECTOR[2] ('sse4': SSE4.2, 128bit) [cxtype_ref=YES]
EvtsPerSec[Rmb+ME] (23) = ( 1.411287e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 2.551591e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 2.551591e+06 ) sec^-1
EvtsPerSec[Rmb+ME] (23) = ( 1.411244e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 2.550241e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 2.550241e+06 ) sec^-1
MeanMatrixElemValue = ( 1.371706e-02 +- 3.270315e-06 ) GeV^0
TOTAL : 5.218729 sec
13,560,426,162 cycles:u # 2.590 GHz
28,213,884,053 instructions:u # 2.08 insn per cycle
5.245394860 seconds time elapsed
TOTAL : 5.211440 sec
13,569,330,178 cycles:u # 2.592 GHz
28,213,884,980 instructions:u # 2.08 insn per cycle
5.239336730 seconds time elapsed
=Symbols in CPPProcess.o= (~sse4: 1271) (avx2: 0) (512y: 0) (512z: 0)
-------------------------------------------------------------------------
runExe /data/avalassi/GPU2020/madgraph4gpuX/epochX/cudacpp/ee_mumu.mad/SubProcesses/P1_ll_ll/build.sse4_d_inl0_hrd0/runTest.exe
Expand All @@ -123,14 +123,14 @@ Process = SIGMA_SM_EPEM_MUPMUM_CPP [gcc 11.2.0] [inlineHel=0
Workflow summary = CPP:DBL+CXS:CURHST+RMBHST+BRDHST/avx2+CXVBRK
FP precision = DOUBLE (NaN/abnormal=0, zero=0)
Internal loops fptype_sv = VECTOR[4] ('avx2': AVX2, 256bit) [cxtype_ref=YES]
EvtsPerSec[Rmb+ME] (23) = ( 1.807565e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 4.162125e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 4.162125e+06 ) sec^-1
EvtsPerSec[Rmb+ME] (23) = ( 1.809485e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 4.149221e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 4.149221e+06 ) sec^-1
MeanMatrixElemValue = ( 1.371706e-02 +- 3.270315e-06 ) GeV^0
TOTAL : 4.238088 sec
10,440,172,734 cycles:u # 2.450 GHz
17,716,217,161 instructions:u # 1.70 insn per cycle
4.265166886 seconds time elapsed
TOTAL : 4.237645 sec
10,452,537,891 cycles:u # 2.453 GHz
17,716,217,606 instructions:u # 1.69 insn per cycle
4.266231955 seconds time elapsed
=Symbols in CPPProcess.o= (~sse4: 0) (avx2: 1046) (512y: 0) (512z: 0)
-------------------------------------------------------------------------
runExe /data/avalassi/GPU2020/madgraph4gpuX/epochX/cudacpp/ee_mumu.mad/SubProcesses/P1_ll_ll/build.avx2_d_inl0_hrd0/runTest.exe
Expand All @@ -149,14 +149,14 @@ Process = SIGMA_SM_EPEM_MUPMUM_CPP [gcc 11.2.0] [inlineHel=0
Workflow summary = CPP:DBL+CXS:CURHST+RMBHST+BRDHST/512y+CXVBRK
FP precision = DOUBLE (NaN/abnormal=0, zero=0)
Internal loops fptype_sv = VECTOR[4] ('512y': AVX512, 256bit) [cxtype_ref=YES]
EvtsPerSec[Rmb+ME] (23) = ( 1.859258e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 4.382820e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 4.382820e+06 ) sec^-1
EvtsPerSec[Rmb+ME] (23) = ( 1.852234e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 4.368594e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 4.368594e+06 ) sec^-1
MeanMatrixElemValue = ( 1.371706e-02 +- 3.270315e-06 ) GeV^0
TOTAL : 4.128689 sec
10,181,770,916 cycles:u # 2.452 GHz
17,345,021,131 instructions:u # 1.70 insn per cycle
4.155206123 seconds time elapsed
TOTAL : 4.152946 sec
10,256,752,366 cycles:u # 2.455 GHz
17,345,021,840 instructions:u # 1.69 insn per cycle
4.181108075 seconds time elapsed
=Symbols in CPPProcess.o= (~sse4: 0) (avx2: 1001) (512y: 0) (512z: 0)
-------------------------------------------------------------------------
runExe /data/avalassi/GPU2020/madgraph4gpuX/epochX/cudacpp/ee_mumu.mad/SubProcesses/P1_ll_ll/build.512y_d_inl0_hrd0/runTest.exe
Expand All @@ -175,14 +175,14 @@ Process = SIGMA_SM_EPEM_MUPMUM_CPP [gcc 11.2.0] [inlineHel=0
Workflow summary = CPP:DBL+CXS:CURHST+RMBHST+BRDHST/512z+CXVBRK
FP precision = DOUBLE (NaN/abnormal=0, zero=0)
Internal loops fptype_sv = VECTOR[8] ('512z': AVX512, 512bit) [cxtype_ref=YES]
EvtsPerSec[Rmb+ME] (23) = ( 1.804957e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 4.101295e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 4.101295e+06 ) sec^-1
EvtsPerSec[Rmb+ME] (23) = ( 1.800537e+06 ) sec^-1
EvtsPerSec[MatrixElems] (3) = ( 4.092863e+06 ) sec^-1
EvtsPerSec[MECalcOnly] (3a) = ( 4.092863e+06 ) sec^-1
MeanMatrixElemValue = ( 1.371706e-02 +- 3.270315e-06 ) GeV^0
TOTAL : 4.240726 sec
9,561,952,890 cycles:u # 2.242 GHz
13,940,588,312 instructions:u # 1.46 insn per cycle
4.267935638 seconds time elapsed
TOTAL : 4.253992 sec
9,583,657,299 cycles:u # 2.240 GHz
13,940,588,419 instructions:u # 1.45 insn per cycle
4.282017435 seconds time elapsed
=Symbols in CPPProcess.o= (~sse4: 0) (avx2: 266) (512y: 0) (512z: 683)
-------------------------------------------------------------------------
runExe /data/avalassi/GPU2020/madgraph4gpuX/epochX/cudacpp/ee_mumu.mad/SubProcesses/P1_ll_ll/build.512z_d_inl0_hrd0/runTest.exe
Expand Down
Loading

0 comments on commit 7b306eb

Please sign in to comment.