2025-12-10 13:40:56

1. Benchmark Execution Summary

1.1. Session

Hostname: gaya
User: gaya
Time Start: 20251210T134100+0100
Time End: 20251211T001706+0100

1.2. Cases

Total: 58
Failures: 7
Runs: 1

1.3. Parametrization

	Hash	resources.tasks	memory	mesh	discretization	solver	Total Time (s)
🟢	167af2e7	32	50	M1	P1	gamg	5603.943	Description	Logs
🟢	176d3fed	4	200	M1	P3	gamg	3206.041	Description	Logs
🟢	17975aa7	64	400	M3	P1	gamg	6032.827	Description	Logs
🟢	23cd5463	128	200	M1	P2	gamg	6090.893	Description	Logs
🟢	244b6f4f	16	50	M2	P1	gamg	5572.734	Description	Logs
🟢	27195a9a	2	50	M2	P1	gamg	1941.177	Description	Logs
🔴	2e057b04	1	1200	M2	P3	gamg	1753.755	Description	Logs
🟢	2fee09d5	64	50	M2	P1	gamg	5911.237	Description	Logs
🟢	32beb7e3	1	200	M2	P2	gamg	1501.813	Description	Logs
🟢	345604ad	8	200	M1	P3	gamg	4693.128	Description	Logs
🟢	39768c0a	64	1200	M2	P3	gamg	37472.491	Description	Logs
🟢	3f3de4ff	8	50	M1	P1	gamg	3295.840	Description	Logs
🟢	3fb44e10	4	200	M1	P2	gamg	3122.187	Description	Logs
🟢	45824f95	32	200	M2	P2	gamg	5780.014	Description	Logs
🟢	4d73eb1b	128	200	M1	P3	gamg	6127.138	Description	Logs
🟢	4f0f94c0	16	200	M1	P3	gamg	5213.542	Description	Logs
🟢	5349842d	384	400	M3	P1	gamg	38052.912	Description	Logs
🟢	5604fc20	256	400	M3	P1	gamg	37157.406	Description	Logs
🟢	5a008286	32	1200	M2	P3	gamg	36970.633	Description	Logs
🟢	5c01ce2b	8	200	M1	P2	gamg	4629.449	Description	Logs
🟢	62f13b88	128	50	M1	P1	gamg	6063.780	Description	Logs
🟢	6311f485	32	400	M3	P1	gamg	5899.037	Description	Logs
🟢	688bd837	4	50	M2	P1	gamg	3254.202	Description	Logs
🟢	6892f9ed	16	50	M1	P1	gamg	4765.445	Description	Logs
🟢	68d6fab9	64	1200	M3	P2	gamg	37716.317	Description	Logs
🟢	693cb2e9	128	400	M3	P1	gamg	36574.423	Description	Logs
🟢	69498aae	4	200	M2	P2	gamg	3539.596	Description	Logs
🟢	6de50075	8	200	M2	P2	gamg	4898.058	Description	Logs
🟢	6f8fe105	2	50	M1	P1	gamg	1528.769	Description	Logs
🟢	7785a035	128	1200	M2	P3	gamg	37757.479	Description	Logs
🟢	7cd50510	1	200	M1	P3	gamg	405.949	Description	Logs
🟢	850da4f0	1	50	M1	P1	gamg	55.464	Description	Logs
🟢	8531d310	1	50	M2	P1	gamg	531.810	Description	Logs
🟢	956d2d83	64	200	M1	P2	gamg	5847.395	Description	Logs
🔴	9656e45f	2	200	M2	P2	gamg	2154.733	Description	Logs
🔴	98df175a	4	1200	M2	P3	gamg	36465.514	Description	Logs
🟢	9c0c73a6	128	200	M2	P2	gamg	6264.085	Description	Logs
🟢	a2abc5fb	32	50	M2	P1	gamg	5684.020	Description	Logs
🟢	a78765f5	16	200	M2	P2	gamg	5631.239	Description	Logs
🟢	a9eabded	4	50	M1	P1	gamg	3102.085	Description	Logs
🟢	af27eff5	256	1200	M3	P2	gamg	37997.657	Description	Logs
🔴	b02bfe9c	8	1200	M2	P3	gamg	36701.205	Description	Logs
🟢	b4ac4784	1	200	M1	P2	gamg	128.562	Description	Logs
🟢	ba32da44	64	50	M1	P1	gamg	5816.028	Description	Logs
🔴	c19ec7ad	2	1200	M2	P3	gamg	3084.710	Description	Logs
🟢	c8addc44	16	200	M1	P2	gamg	4801.674	Description	Logs
🟢	c91ae1d8	32	1200	M3	P2	gamg	37400.100	Description	Logs
🟢	ccdebbea	128	50	M2	P1	gamg	6190.901	Description	Logs
🟢	d0631d6d	64	200	M1	P3	gamg	5883.660	Description	Logs
🟢	d917cf05	128	1200	M3	P2	gamg	37901.608	Description	Logs
🟢	de8f4f1b	32	200	M1	P2	gamg	5635.676	Description	Logs
🔴	e853ec5e	2	200	M1	P3	gamg	1711.795	Description	Logs
🟢	f1d5f83f	384	1200	M3	P2	gamg	38163.210	Description	Logs
🟢	f29397bf	2	200	M1	P2	gamg	1601.889	Description	Logs
🟢	fa4f1061	64	200	M2	P2	gamg	5959.450	Description	Logs
🟢	fc15b253	8	50	M2	P1	gamg	4729.373	Description	Logs
🔴	fd0b630b	16	1200	M2	P3	gamg	36826.805	Description	Logs
🟢	ff585e9d	32	200	M1	P3	gamg	5683.963	Description	Logs

Hash

resources.tasks

memory

mesh

discretization

solver

Total Time (s)

🟢

167af2e7

gamg

5603.943

Description

Logs

🟢

176d3fed

200

gamg

3206.041

Description

Logs

🟢

17975aa7

400

gamg

6032.827

Description

Logs

🟢

23cd5463

128

200

gamg

6090.893

Description

Logs

🟢

244b6f4f

gamg

5572.734

Description

Logs

🟢

27195a9a

gamg

1941.177

Description

Logs

🔴

2e057b04

1200

gamg

1753.755

Description

Logs

🟢

2fee09d5

gamg

5911.237

Description

Logs

🟢

32beb7e3

200

gamg

1501.813

Description

Logs

🟢

345604ad

200

gamg

4693.128

Description

Logs

🟢

39768c0a

1200

gamg

37472.491

Description

Logs

🟢

3f3de4ff

gamg

3295.840

Description

Logs

🟢

3fb44e10

200

gamg

3122.187

Description

Logs

🟢

45824f95

200

gamg

5780.014

Description

Logs

🟢

4d73eb1b

128

200

gamg

6127.138

Description

Logs

🟢

4f0f94c0

200

gamg

5213.542

Description

Logs

🟢

5349842d

384

400

gamg

38052.912

Description

Logs

🟢

5604fc20

256

400

gamg

37157.406

Description

Logs

🟢

5a008286

1200

gamg

36970.633

Description

Logs

🟢

5c01ce2b

200

gamg

4629.449

Description

Logs

🟢

62f13b88

128

gamg

6063.780

Description

Logs

🟢

6311f485

400

gamg

5899.037

Description

Logs

🟢

688bd837

gamg

3254.202

Description

Logs

🟢

6892f9ed

gamg

4765.445

Description

Logs

🟢

68d6fab9

1200

gamg

37716.317

Description

Logs

🟢

693cb2e9

128

400

gamg

36574.423

Description

Logs

🟢

69498aae

200

gamg

3539.596

Description

Logs

🟢

6de50075

200

gamg

4898.058

Description

Logs

🟢

6f8fe105

gamg

1528.769

Description

Logs

🟢

7785a035

128

1200

gamg

37757.479

Description

Logs

🟢

7cd50510

200

gamg

405.949

Description

Logs

🟢

850da4f0

gamg

55.464

Description

Logs

🟢

8531d310

gamg

531.810

Description

Logs

🟢

956d2d83

200

gamg

5847.395

Description

Logs

🔴

9656e45f

200

gamg

2154.733

Description

Logs

🔴

98df175a

1200

gamg

36465.514

Description

Logs

🟢

9c0c73a6

128

200

gamg

6264.085

Description

Logs

🟢

a2abc5fb

gamg

5684.020

Description

Logs

🟢

a78765f5

200

gamg

5631.239

Description

Logs

🟢

a9eabded

gamg

3102.085

Description

Logs

🟢

af27eff5

256

1200

gamg

37997.657

Description

Logs

🔴

b02bfe9c

1200

gamg

36701.205

Description

Logs

🟢

b4ac4784

200

gamg

128.562

Description

Logs

🟢

ba32da44

gamg

5816.028

Description

Logs

🔴

c19ec7ad

1200

gamg

3084.710

Description

Logs

🟢

c8addc44

200

gamg

4801.674

Description

Logs

🟢

c91ae1d8

1200

gamg

37400.100

Description

Logs

🟢

ccdebbea

128

gamg

6190.901

Description

Logs

🟢

d0631d6d

200

gamg

5883.660

Description

Logs

🟢

d917cf05

128

1200

gamg

37901.608

Description

Logs

🟢

de8f4f1b

200

gamg

5635.676

Description

Logs

🔴

e853ec5e

200

gamg

1711.795

Description

Logs

🟢

f1d5f83f

384

1200

gamg

38163.210

Description

Logs

🟢

f29397bf

200

gamg

1601.889

Description

Logs

🟢

fa4f1061

200

gamg

5959.450

Description

Logs

🟢

fc15b253

gamg

4729.373

Description

Logs

🔴

fd0b630b

1200

gamg

36826.805

Description

Logs

🟢

ff585e9d

200

gamg

5683.963

Description

Logs

2. Benchmark: Elliptic linear PDE: Thermal Bridges

2.1. Description

The benchmark known as "thermal bridges" is an example of an application that enables us to validate numerical simulation tools using Feel++. We have developed tests based on the ISO 10211:2017 standard (ISO 10211:2017 - Thermal bridges in building construction — Heat flows and surface temperatures — Detailed calculations, 2017), which provides methodologies for evaluating thermal bridges in building construction.

Thermal bridges are areas within a building envelope where heat flow is different compared to adjacent areas, often resulting in increased heat loss or unwanted condensation. The standard is intended to ensure that thermal bridges simulation are accurately computed. It provides reference values (and tolerance) on heat temperature and heat flux at several location of the geometry.

At the mathematical level, this application requires finding the numerical solution of an elliptic linear PDE (i.e. the heat equation). We employ a finite element method based on continuous Lagrange Finite Element of order 1,2 and 3 (denoted by P1,P2,P3). And we analyzed the execution time of the main components of the simulation.

The Figure 4.3 represents the geometry of this benchmark and the domain decomposition by material.

Figure 1. Figure 4.3: Thermal Bridges benchmarks - geometry and materials

Figure 2. Figure 4.3: Thermal Bridges benchmarks - geometry and materials

2.2. Benchmarking Tools Used

The benchmark was performed on the gaya supercomputer (see Section 10.1). The performance tools integrated into the Feel-toolboxes framework were used to measure the execution time. Moreover, we need to note that we have used here Apptainer with Feel SIF image based on Ubuntu noble OS.

This benchmark was done using feelpp.benchmarking, version 4.0.0

The metrics measured are the execution time of the main components of the simulation. We enumerate these parts in the following:

Init: load mesh from filesystem and initialize heat toolbox (finite element context and algebraic data structure)
Assembly: calculate and assemble the matrix and rhs values obtained using the finite element method
Solve: the linear system by using a preconditioned GMRES.
PostProcess: compute validation measures (temperature at points and heat flux) and export on the filesystem a visualization format (EnsighGold) of the solution.

2.3. Input/Output Dataset Description

2.3.1. Input Data

Meshes: We have generated three levels of mesh called M1, M2 and M3. These meshes are stored in GMSH format. The statistics can be found in Table 4.6. We have also prepared for each mesh level a collection of partitioned mesh. The format used is an in-house mesh format of Feel based on JSON+HDF5 file type. The Gmsh meshes and the partitioned meshes can be found on our Girder database management, in the Feel collections.
Setup: Use standard setup of Feel++ toolboxes. It corresponds to a cfg file and JSON file. These config files are present in the Github of feelpp.
Sif image: feelpp:v0.111.0-preview.10-noble-sif (stored in the Github registry of Feel++)

Tag	# points	# edges	# faces	# elements	P1	P2	P3
M1	1.94E+05	1.30E+06	2.46E+06	1.06E+06	1.94E+05	1.49E+06	4.96E+06
M2	1.40E+06	9.78E+06	1.66E+07	1.66E+07	1.40E+06	1.12E+07	3.75E+07
M3	1.06E+07	7.53E+07	1.29E+08	1.29E+08	1.06E+07	8.59E+07	2.90E+08

Tag

# points

# edges

# faces

# elements

1.94E+05

1.30E+06

2.46E+06

1.06E+06

1.94E+05

1.49E+06

4.96E+06

1.40E+06

9.78E+06

1.66E+07

1.40E+06

1.12E+07

3.75E+07

1.06E+07

7.53E+07

1.29E+08

1.06E+07

8.59E+07

2.90E+08

2.3.2. Output Data

The output includes the computed values of validation measure in CSV files format, export visualization files (mesh, partitioning, temperature), and the time taken to perform each simulation step.

2.4. Results Summary

We start by showing in fig. 4.4 an example of numeric solution and mesh partitioning that we have obtained in the simulation pipeline. The partitioning process is considered an offline process here but requires some time and memory consumption. This should be explicitly described in a future work. With fig. 4.5, we have validated the simulation run by checking measures compared to reference values.

Figure 3. Figure 4.4: Thermal bridges benchmarks - temperature solution

Figure 4. Figure 4.4: Thermal bridges benchmarks - partitioning example

The benchmark performance results are summarized in Figure 4.6, Figure 4.7, Figure 4.8 which correspond respectively to choice of the mesh M1, M2 and M3. Moreover, for each mesh, we have experimented with several finite element discretizations called P1, P2 and P3. For each order of finite element approximation, we have selected a set of number of CPU cores. Concerning the mesh M1, considered a coarse mesh, we note that the scalability scaling is not good, especially for low order. This is simply because the problem is too small for so many HPC resources. MPI communications and IO effects are non-negligible. For the mesh M2 and M3, results are better (but not ideal), and we can rapidly see the limit reached by the scalability test. Finally, the fined mesh M3, illustrates the best scalability on this benchmarking experiment. We see a reduction in computational cost by increasing the computation resources. However, due to the fast execution, the time goes fast to the limit.

With these benchmarking experiences, we have also seen that we have some variability in performance measures. Some aspects such as the filesystem and network load, are not under our control, it can explain a part of this (when computational time belongs small locally).

2.5. Challenges Identified

Several challenges were encountered during the benchmarking process:

Memory Usage: Reduce the memory footprint
Parallelization Inefficiencies: Understand and improve performance when MPI communication and filesystem IO will be dominant

To conclude, we have realized HPC performance tests of benchmark called thermal bridges. We have realized with success the execution of several simulations on significant resources and demonstrated the validation of Feel framework in the elliptic PDE context. We have also validated the deployment of Feel with container support. Now, we need to provide more refined measures to detect and analyze reasons for performance degradation. And also compare to other software installations, like Spack.

2.6. Results

2.6.1. Convergence of validation measures

Heat Flux Convergence

The validation plot for heat flux demonstrates solution convergence across increasing mesh levels (refinement) for different finite element discretizations (P1, P2, P3). Consistency in heat flow values across methods validates the numerical model.

Temperature Validation

2.6.2. P1 Discretization Performance

2.6.3. P2 Discretization Performance

2.6.4. P3 Discretization Performance

2.6.5. Solver Metrics

To understand the parallel scalability, two key solver metrics are presented: the absolute execution time for the algebraic solve step and the number of iterations required by the GMRES solver. Stable or decreasing iteration counts with mesh refinement and strong scaling in solve time are essential indicators of an efficient preconditioner and solver configuration.