INFO: No output directory specified (--dir) INFO: Using directory: "Unittest_SANS_benchmark2_20251212_135234" INFO: Regenerating c-file: Unittest_SANS_benchmark2.c WARNING: Full-path code-generator "/u/data/pkwi/McStas/mcstas/3.x-dev/bin/mcstas-antlr -I$MCSTAS" not found!! WARNING: Attempting replacement by "mcstas-antlr -I$MCSTAS" DEBUG: CMD: mcstas-antlr -I$MCSTAS -t -o ./Unittest_SANS_benchmark2.c Unittest_SANS_benchmark2.instr No initialization present? ----------------------------------------------------------- Generating single GPU kernel or single CPU section layout: -> SPLIT 10 at component sample ----------------------------------------------------------- Generating GPU/CPU -DFUNNEL layout: -> GPU kernel from component a1 -> GPU kernel from component arm -> GPU kernel from component source -> GPU kernel from component coll1 -> GPU kernel from component coll2 -> GPU kernel from component sample_position -> GPU kernel from component Sample_in -> SPLIT within buffer at component sample -> GPU kernel from component detector -> GPU kernel from component detectorSANS -> GPU kernel from component detectorDB -> GPU kernel from component Ldetector -> GPU kernel from component PSDrad -> GPU kernel from component PSDrad_full ----------------------------------------------------------- DEBUG: CMD: mcstas-antlr -I$MCSTAS finished INFO: Recompiling: ./Unittest_SANS_benchmark2.out DEBUG: CMD: nvc -o ./Unittest_SANS_benchmark2.out ./Unittest_SANS_benchmark2.c -lm -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/u/data/pkwi/McStas/mcstas/3.x-dev/miniconda3/lib -Wl,-rpath-link,/u/data/pkwi/McStas/mcstas/3.x-dev/miniconda3/lib -L/u/data/pkwi/McStas/mcstas/3.x-dev/miniconda3/lib -mtune=haswell -fPIC -O2 -isystem /u/data/pkwi/McStas/mcstas/3.x-dev/miniconda3/include -DUSE_MPI -lmpi -Wl,-rpath,/opt/nvidia/hpc_sdk/Linux_x86_64/25.9/comm_libs/12.9/openmpi4/openmpi-4.1.5/lib -L,/opt/nvidia/hpc_sdk/Linux_x86_64/25.9/comm_libs/12.9/openmpi4/openmpi-4.1.5/lib -I/opt/nvidia/hpc_sdk/Linux_x86_64/25.9/comm_libs/12.9/openmpi4/openmpi-4.1.5/include -fast -Minfo=accel -acc=gpu -gpu=mem:managed -DOPENACC "./Unittest_SANS_benchmark2.c", line 94: warning: Routine "mcgenstate" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] _class_particle mcgenstate(void); ^ Remark: individual warnings can be suppressed with "--diag_suppress " "./Unittest_SANS_benchmark2.c", line 96: warning: Routine "mcsetstate" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] _class_particle mcsetstate(double x, double y, double z, double vx, double vy, double vz, double t, double sx, double sy, double sz, double p, int mcgravitation, void * mcMagnet, int allow_backprop); ^ "./Unittest_SANS_benchmark2.c", line 98: warning: Routine "mcgetstate" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] _class_particle mcgetstate(_class_particle mcparticle, double * x, double * y, double * z, double * vx, double * vy, double * vz, double * t, double * sx, double * sy, double * sz, double * p); ^ "./Unittest_SANS_benchmark2.c", line 112: warning: Routine "particle_getvar" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] double particle_getvar(_class_particle *p, char *name, int *suc); ^ "./Unittest_SANS_benchmark2.c", line 116: warning: Routine "str_comp" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] int str_comp(char *str1, char *str2); ^ "./Unittest_SANS_benchmark2.c", line 145: warning: Routine "particle_getvar_void" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] void* particle_getvar_void(_class_particle *p, char *name, int *suc); ^ "./Unittest_SANS_benchmark2.c", line 116: warning: Routine "str_comp" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] int str_comp(char *str1, char *str2); ^ "./Unittest_SANS_benchmark2.c", line 178: warning: Routine "particle_setvar_void" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] int particle_setvar_void(_class_particle *, char *, void*); ^ "./Unittest_SANS_benchmark2.c", line 204: warning: Routine "particle_setvar_void_array" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] int particle_setvar_void_array(_class_particle *, char *, void*, int); ^ "./Unittest_SANS_benchmark2.c", line 217: warning: Routine "particle_restore" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] void particle_restore(_class_particle *p, _class_particle *p0); ^ "./Unittest_SANS_benchmark2.c", line 234: warning: Routine "particle_getuservar_byid" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] double particle_getuservar_byid(_class_particle *p, int id, int *suc){ ^ "./Unittest_SANS_benchmark2.c", line 245: warning: Routine "particle_uservar_init" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] void particle_uservar_init(_class_particle *p){ ^ "./Unittest_SANS_benchmark2.c", line 313: warning: Routine "noprintf" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] int noprintf(); ^ "./Unittest_SANS_benchmark2.c", line 315: warning: Routine "str_len" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] size_t str_len(const char *s); ^ "./Unittest_SANS_benchmark2.c", line 666: warning: Routine "mcget_ncount" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] unsigned long long int mcget_ncount(void); /* wrapper to set mcncount */ ^ "./Unittest_SANS_benchmark2.c", line 887: warning: Routine "_randnorm2" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] double _randnorm2(randstate_t* state); ^ "./Unittest_SANS_benchmark2.c", line 904: warning: Routine "kiss_srandom" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] uint64_t *kiss_srandom(uint64_t state[7], uint64_t seed); ^ "./Unittest_SANS_benchmark2.c", line 906: warning: Routine "kiss_random" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] uint64_t kiss_random(uint64_t state[7]); ^ "./Unittest_SANS_benchmark2.c", line 914: warning: Routine "_rand01" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] double _rand01(randstate_t* state); ^ "./Unittest_SANS_benchmark2.c", line 916: warning: Routine "_randpm1" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] double _randpm1(randstate_t* state); ^ "./Unittest_SANS_benchmark2.c", line 918: warning: Routine "_rand0max" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] double _rand0max(double max, randstate_t* state); ^ "./Unittest_SANS_benchmark2.c", line 920: warning: Routine "_randminmax" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] double _randminmax(double min, double max, randstate_t* state); ^ "./Unittest_SANS_benchmark2.c", line 922: warning: Routine "_randtriangle" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] double _randtriangle(randstate_t* state); ^ "./Unittest_SANS_benchmark2.c", line 1018: warning: Routine "coords_set" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] Coords coords_set(MCNUM x, MCNUM y, MCNUM z); ^ "./Unittest_SANS_benchmark2.c", line 1020: warning: Routine "coords_get" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] Coords coords_get(Coords a, MCNUM *x, MCNUM *y, MCNUM *z); ^ "./Unittest_SANS_benchmark2.c", line 1022: warning: Routine "coords_add" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] Coords coords_add(Coords a, Coords b); ^ "./Unittest_SANS_benchmark2.c", line 1024: warning: Routine "coords_sub" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] Coords coords_sub(Coords a, Coords b); ^ "./Unittest_SANS_benchmark2.c", line 1026: warning: Routine "coords_neg" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] Coords coords_neg(Coords a); ^ "./Unittest_SANS_benchmark2.c", line 1028: warning: Routine "coords_scale" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] Coords coords_scale(Coords b, double scale); ^ "./Unittest_SANS_benchmark2.c", line 1030: warning: Routine "coords_sp" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] double coords_sp(Coords a, Coords b); ^ "./Unittest_SANS_benchmark2.c", line 1032: warning: Routine "coords_xp" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] Coords coords_xp(Coords b, Coords c); ^ "./Unittest_SANS_benchmark2.c", line 1034: warning: Routine "coords_len" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] double coords_len(Coords a); ^ "./Unittest_SANS_benchmark2.c", line 4645: warning: variable "num" was declared but never referenced [declared_but_not_referenced] int num = 3; ^ "./Unittest_SANS_benchmark2.c", line 6346: warning: variable "status" was set but never used [set_but_not_used] int status; ^ "/usr/include/math.h", line 30: warning: #warning directive: A macro called log was already defined when was included. [warning_directive] # warning A macro called log was already defined when was included. ^ "/usr/include/math.h", line 31: warning: #warning directive: This will cause compilation problems. [warning_directive] # warning This will cause compilation problems. ^ "./Unittest_SANS_benchmark2.c", line 7164: warning: variable "BBB" was declared but never referenced [declared_but_not_referenced] double G,P,gam,B,k,AAA,BBB,erf,QQ; ^ "./Unittest_SANS_benchmark2.c", line 7821: warning: variable "tc2" was set but never used [set_but_not_used] Coords tc1, tc2; ^ "./Unittest_SANS_benchmark2.c", line 7864: warning: variable "tc2" was set but never used [set_but_not_used] Coords tc1, tc2; ^ "./Unittest_SANS_benchmark2.c", line 9160: warning: incompatible redefinition of macro "exit" (declared at line 645) [bad_macro_redef] #define exit(...) noprintf() ^ "./Unittest_SANS_benchmark2.c", line 9172: warning: Routine "class_Progress_bar_trace" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] _class_Progress_bar *class_Progress_bar_trace(_class_Progress_bar *_comp, _class_particle *_particle) { ^ "./Unittest_SANS_benchmark2.c", line 9263: warning: Routine "class_Source_simple_trace" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] _class_Source_simple *class_Source_simple_trace(_class_Source_simple *_comp, _class_particle *_particle) { ^ "./Unittest_SANS_benchmark2.c", line 9366: warning: Routine "class_Slit_trace" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] _class_Slit *class_Slit_trace(_class_Slit *_comp, _class_particle *_particle) { ^ "./Unittest_SANS_benchmark2.c", line 9410: warning: Routine "class_PSD_monitor_trace" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] _class_PSD_monitor *class_PSD_monitor_trace(_class_PSD_monitor *_comp, _class_particle *_particle) { ^ "./Unittest_SANS_benchmark2.c", line 9481: warning: Routine "class_SANS_benchmark2_trace" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] _class_SANS_benchmark2 *class_SANS_benchmark2_trace(_class_SANS_benchmark2 *_comp, _class_particle *_particle) { ^ "./Unittest_SANS_benchmark2.c", line 9508: warning: variable "thmax" was set but never used [set_but_not_used] double qmax,qmaxl,Ymax,Xmax,thmax; ^ "./Unittest_SANS_benchmark2.c", line 9763: warning: Routine "class_L_monitor_trace" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] _class_L_monitor *class_L_monitor_trace(_class_L_monitor *_comp, _class_particle *_particle) { ^ "./Unittest_SANS_benchmark2.c", line 9836: warning: Routine "class_PSD_monitor_rad_trace" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] _class_PSD_monitor_rad *class_PSD_monitor_rad_trace(_class_PSD_monitor_rad *_comp, _class_particle *_particle) { ^ "./Unittest_SANS_benchmark2.c", line 9901: warning: Routine "raytrace" will be treated as sequential because its "routine" directive doesn't specify a level of parallelism [no_routine_type] int raytrace(_class_particle* _particle) { /* single event propagation, called by mccode_main for Unittest_SANS_benchmark2:TRACE */ ^ "./Unittest_SANS_benchmark2.c", line 11207: warning: variable "radius" was declared but never referenced [declared_but_not_referenced] double radius = 0; ^ "./Unittest_SANS_benchmark2.c", line 11208: warning: variable "h" was declared but never referenced [declared_but_not_referenced] double h = 0; ^ "./Unittest_SANS_benchmark2.c", line 11610: warning: variable "t" was declared but never referenced [declared_but_not_referenced] time_t t; ^ "./Unittest_SANS_benchmark2.c", line 11611: warning: variable "ct" was set but never used [set_but_not_used] clock_t ct; ^ "./Unittest_SANS_benchmark2.c", line 1522: warning: variable "mcstartdate" was set but never used [set_but_not_used] static long mcstartdate = 0; /* start simulation time */ ^ "./Unittest_SANS_benchmark2.c", line 2991: warning: function "strcpy_valid" was declared but never referenced [declared_but_not_referenced] static char *strcpy_valid(char *valid, char *original) ^ mcgenstate: 105, Generating acc routine seq Generating NVIDIA GPU code particle_getvar: 119, Generating acc routine seq Generating NVIDIA GPU code particle_getvar_void: 152, Generating acc routine seq Generating NVIDIA GPU code particle_setvar_void: 180, Generating acc routine seq Generating NVIDIA GPU code particle_setvar_void_array: 206, Generating acc routine seq Generating NVIDIA GPU code particle_restore: 218, Generating acc routine seq Generating NVIDIA GPU code particle_getuservar_byid: 234, Generating acc routine seq Generating NVIDIA GPU code particle_uservar_init: 245, Generating acc routine seq Generating NVIDIA GPU code noprintf: 1570, Generating acc routine seq Generating NVIDIA GPU code str_comp: 1574, Generating acc routine seq Generating NVIDIA GPU code str_len: 1583, Generating acc routine seq Generating NVIDIA GPU code mcget_ncount: 4263, Generating acc routine seq Generating NVIDIA GPU code coords_set: 4697, Generating acc routine seq Generating NVIDIA GPU code coords_get: 4708, Generating acc routine seq Generating NVIDIA GPU code coords_add: 4717, Generating acc routine seq Generating NVIDIA GPU code coords_sub: 4729, Generating acc routine seq Generating NVIDIA GPU code coords_neg: 4741, Generating acc routine seq Generating NVIDIA GPU code coords_scale: 4751, Generating acc routine seq Generating NVIDIA GPU code coords_sp: 4761, Generating acc routine seq Generating NVIDIA GPU code coords_xp: 4769, Generating acc routine seq Generating NVIDIA GPU code coords_len: 4779, Generating acc routine seq Generating NVIDIA GPU code coords_print: 4801, Generating acc routine seq Generating NVIDIA GPU code coords_norm: 4808, Generating acc routine seq Generating NVIDIA GPU code rot_set_rotation: 4854, Generating acc routine seq Generating NVIDIA GPU code rot_test_identity: 4889, Generating acc routine seq Generating NVIDIA GPU code rot_mul: 4900, Generating acc routine seq Generating NVIDIA GPU code rot_copy: 4917, Generating acc routine seq Generating NVIDIA GPU code rot_transpose: 4928, Generating acc routine seq Generating NVIDIA GPU code rot_apply: 4944, Generating acc routine seq Generating NVIDIA GPU code vec_prod_func: 4973, Generating acc routine seq Generating NVIDIA GPU code scalar_prod: 4984, Generating acc routine seq Generating NVIDIA GPU code norm_func: 4988, Generating acc routine seq Generating NVIDIA GPU code mccoordschange: 5165, Generating acc routine seq Generating NVIDIA GPU code mccoordschange_polarisation: 5196, Generating acc routine seq Generating NVIDIA GPU code normal_vec: 5213, Generating acc routine seq Generating NVIDIA GPU code solve_2nd_order: 5312, Generating acc routine seq Generating NVIDIA GPU code _randvec_target_circle: 5396, Generating acc routine seq Generating NVIDIA GPU code _randvec_target_rect_angular: 5462, Generating acc routine seq Generating NVIDIA GPU code _randvec_target_rect_real: 5539, Generating acc routine seq Generating NVIDIA GPU code kiss_srandom: 5811, Generating acc routine seq Generating NVIDIA GPU code kiss_random: 5823, Generating acc routine seq Generating NVIDIA GPU code _hash: 5855, Generating acc routine seq Generating NVIDIA GPU code _randnorm2: 5896, Generating acc routine seq Generating NVIDIA GPU code _randtriangle: 5907, Generating acc routine seq Generating NVIDIA GPU code _rand01: 5912, Generating acc routine seq Generating NVIDIA GPU code _randpm1: 5920, Generating acc routine seq Generating NVIDIA GPU code _rand0max: 5928, Generating acc routine seq Generating NVIDIA GPU code _randminmax: 5935, Generating acc routine seq Generating NVIDIA GPU code mcsetstate: 6613, Generating acc routine seq Generating NVIDIA GPU code mcgetstate: 6653, Generating acc routine seq Generating NVIDIA GPU code inside_rectangle: 6713, Generating acc routine seq Generating NVIDIA GPU code box_intersect: 6730, Generating acc routine seq Generating NVIDIA GPU code cylinder_intersect: 6845, Generating acc routine seq Generating NVIDIA GPU code sphere_intersect: 6900, Generating acc routine seq Generating NVIDIA GPU code sb_min: 7027, Generating implicit acc routine seq Generating implicit acc routine seq Generating NVIDIA GPU code sb_max: 7031, Generating implicit acc routine seq Generating implicit acc routine seq Generating NVIDIA GPU code sb_imin: 7035, Generating implicit acc routine seq Generating implicit acc routine seq Generating NVIDIA GPU code sb_imax: 7039, Generating implicit acc routine seq Generating implicit acc routine seq Generating NVIDIA GPU code gamma: 7044, Generating implicit acc routine seq Generating implicit acc routine seq Generating NVIDIA GPU code errf: 7092, Generating implicit acc routine seq Generating implicit acc routine seq Generating NVIDIA GPU code J1: 7124, Generating implicit acc routine seq Generating implicit acc routine seq Generating NVIDIA GPU code dSigdW: 7153, Generating implicit acc routine seq Generating implicit acc routine seq Generating NVIDIA GPU code init: 9131, Generating update device(_Sample_in_var,_a1_var,_source_var,_sample_var,_arm_var,_coll1_var,_instrument_var,_PSDrad_var,_sample_position_var,_detectorSANS_var,_detectorDB_var,_coll2_var,_PSDrad_full_var,_Ldetector_var,_detector_var) class_Progress_bar_trace: 9172, Generating acc routine seq Generating NVIDIA GPU code class_Source_simple_trace: 9263, Generating acc routine seq Generating NVIDIA GPU code class_Slit_trace: 9366, Generating acc routine seq Generating NVIDIA GPU code class_PSD_monitor_trace: 9410, Generating acc routine seq Generating NVIDIA GPU code class_SANS_benchmark2_trace: 9481, Generating acc routine seq Generating NVIDIA GPU code class_L_monitor_trace: 9763, Generating acc routine seq Generating NVIDIA GPU code class_PSD_monitor_rad_trace: 9836, Generating acc routine seq Generating NVIDIA GPU code raytrace: 9901, Generating acc routine seq Generating NVIDIA GPU code raytrace_all: 10237, Generating implicit firstprivate(gpu_innerloop) Generating NVIDIA GPU code 10251, #pragma acc loop gang(numgangs), vector(vecsize) /* blockIdx.x threadIdx.x */ 10237, Local memory used for particleN 10251, Generating implicit firstprivate(mpi_node_rank,_particle,seed,ncount) finally: 10990, Generating update self(_Sample_in_var,_a1_var,_sample_var,_source_var,_arm_var,_coll1_var,_instrument_var,_PSDrad_var,_sample_position_var,_detectorSANS_var,_detectorDB_var,_coll2_var,_PSDrad_full_var,_Ldetector_var,_detector_var) mcenabletrace: 6055, Generating update device(mcdotrace) DEBUG: CMD: nvc finished INFO: ===