Prof. Dr. Gerhard Wellein

Professorship for High Performance Computing

Professors

Address

Martensstraße 391058 Erlangen

Contact

Email: gerhard.wellein@fau.de

Posts

Publications

Afzal A., Hager G., Wellein G.:
Exploring metrics for analyzing dynamic behavior in MPI programs via a coupled-oscillator model
In: Parallel Computing (2026)
ISSN: 0167-8191
DOI: 10.1016/j.parco.2026.103184
URL: https://www.sciencedirect.com/science/article/abs/pii/S0167819126000025

Afzal A., Hager G., Wellein G.:
GROMACS Unplugged: How Power Capping and Frequency Shapes Performance on GPUs
31st International European Conference on Parallel and Distributed Computing (Euro-Par 2025) (Dresden, Germany, 25. August 2025 – 29. August 2025)
In: Euro-Par 2025: Parallel Processing Workshops Volume in the Springer Lecture Notes in Computer Science (LNCS) series. 2025
DOI: 10.48550/arXiv.2510.06902
Lacey D., Alappat C., Lange F., Hager G., Fehske H., Wellein G.:
Cache blocking of distributed-memory parallel matrix power kernels
In: International Journal of High Performance Computing Applications 39 (2025), p. 385-404
ISSN: 1094-3420
DOI: 10.1177/10943420251319332
Wind S., Sopa J., Truhn D., Lotfinia M., Nguyen TT., Bressem K., Adams L., Rusu M., Köstler H., Wellein G., Maier A., Tayebi Arasteh S.:
Multi-step retrieval and reasoning improves radiology question answering with large language models
In: npj Digital Medicine 8 (2025), Article No.: 790
ISSN: 2398-6352
DOI: 10.1038/s41746-025-02250-5
URL: https://www.nature.com/articles/s41746-025-02250-5

Alappat C., Thies J., Hager G., Fehske H., Wellein G.:
Algebraic temporal blocking for sparse iterative solvers on multi-core CPUs
In: International Journal of High Performance Computing Applications (2024)
ISSN: 1094-3420
DOI: 10.1177/10943420241283828
Lange F., Wellein G., Fehske H.:
Charge-order melting in the one-dimensional Edwards model
In: Physical Review Research 6 (2024), Article No.: L022007
ISSN: 2643-1564
DOI: 10.1103/PhysRevResearch.6.L022007
Laukemann J., Gruber T., Hager G., Oryspayev D., Wellein G.:
CloverLeaf on Intel Multi-Core CPUs: A Case Study in Write-Allocate Evasion
38th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2024 (San Francisco, CA, 27. May 2024 – 31. May 2024)
In: 2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS) 2024
DOI: 10.1109/IPDPS57955.2024.00038
Laukemann J., Hager G., Wellein G.:
Microarchitectural comparison and in-core modeling of state-of-the-art CPUs: Grace, Sapphire Rapids, and Genoa
SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (Atlanta, 17. November 2024 – 22. November 2024)
In: SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis, New York City: 2024
DOI: 10.1109/SCW63240.2024.00181
Owen H., Ernst D., Gruber T., Lemkuhl O., Houzeaux G., Gasparino L., Wellein G.:
Alya towards Exascale: Optimal OpenACC Performance of the Navier-Stokes Finite Element Assembly on GPUs
38th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2024 (San Francisco, 27. May 2024 – 31. May 2024)
In: 2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS) 2024
DOI: 10.1109/IPDPS57955.2024.00043

Afzal A., Hager G., Markidis S., Wellein G.:
Making applications faster by asynchronous execution: Slowing down processes or relaxing MPI collectives
In: Future Generation Computer Systems-The International Journal of Grid Computing Theory Methods and Applications (2023)
ISSN: 0167-739X
DOI: 10.1016/j.future.2023.06.017
Afzal A., Hager G., Wellein G.:
Physical Oscillator Model for Supercomputing
14th IEEE/ACM Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS23) (Denver, CO, USA, 12. November 2023 – 17. November 2023)
In: 14th IEEE/ACM Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS23) 2023
DOI: 10.1145/3624062.3625535
Afzal A., Hager G., Wellein G.:
SPEChpc 2021 Benchmarks on Ice Lake and Sapphire Rapids Infiniband Clusters: A Performance and Energy Case Study
14th IEEE/ACM Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS23) (Denver, CO, USA, 12. November 2023 – 17. November 2023)
In: 14th IEEE/ACM Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS23) 2023
DOI: 10.1145/3624062.3624197
Afzal A., Hager G., Wellein G., Markidis S.:
Exploring Techniques for the Analysis of Spontaneous Asynchronicity in MPI-Parallel Applications
14th International Conference on Parallel Processing and Applied Mathematics, PPAM 2022 (Gdansk, Poland, 11. September 2022 – 14. June 2023)
In: Wyrzykowski, R., Dongarra, J., Deelman, E., Karczewski, K. (ed.): Lecture Notes in Computer Science 2023
DOI: 10.1007/978-3-031-30442-2_12
Ernst D., Holzer M., Hager G., Knorr M., Wellein G.:
Analytical performance estimation during code generation on modern GPUs
In: Journal of Parallel and Distributed Computing 173 (2023), p. 152-167
ISSN: 0743-7315
DOI: 10.1016/j.jpdc.2022.11.003
Oikonomou E., Gruber T., Achanta RC., Höller S., Alzheimer C., Wellein G., Huth T.:
2D-dwell-time analysis with simulations of ion-channel gating using high-performance computing.
In: Biophysical Journal (2023)
ISSN: 0006-3495
DOI: 10.1016/j.bpj.2023.02.023
Ravedutti Lucio Machado R., Eitzinger J., Köstler H., Wellein G.:
MD-Bench: A Generic Proxy-App Toolbox for State-of-the-Art Molecular Dynamics Algorithms
In: Parallel Processing and Applied Mathematics. PPAM 2022., Springer, Cham, 2023, p. 321-332 (Lecture Notes in Computer Science (LNCS), Vol.13826)
ISBN: 978-3-031-30441-5
DOI: 10.1007/978-3-031-30442-2_24
Ravedutti Lucio Machado R., Eitzinger J., Laukemann J., Hager G., Köstler H., Wellein G.:
MD-Bench: A performance-focused prototyping harness for state-of-the-art short-range molecular dynamics algorithms
In: Future Generation Computer Systems-The International Journal of Grid Computing Theory Methods and Applications (2023)
ISSN: 0167-739X
DOI: 10.1016/j.future.2023.06.023
Ravedutti Lucio Machado R., Eitzinger J., Laukemann J., Hager G., Köstler H., Wellein G.:
MD-Bench: A performance-focused prototyping harness for state-of-the-art short-range molecular dynamics algorithms
In: Future Generation Computer Systems-The International Journal of Grid Computing Theory Methods and Applications 149 (2023), p. 25-38
ISSN: 0167-739X
DOI: 10.1016/j.future.2023.06.023

Afzal A., Hager G., Wellein G.:
Addressing White-box Modeling and Simulation Challenges in Parallel Computing
ACM SIGSIM-PADS ’22 (GA, Atlanta, USA, 8. June 2022 – 10. June 2022)
In: SIGSIM-PADS ’22: SIGSIM Conference on Principles of Advanced Discrete Simulation 2022
DOI: 10.1145/3518997.3534986
Afzal A., Hager G., Wellein G.:
Analytic performance model for parallel overlapping memory-bound kernels
In: Concurrency and Computation-Practice & Experience (2022)
ISSN: 1532-0626
DOI: 10.1002/cpe.6816
URL: https://onlinelibrary.wiley.com/doi/10.1002/cpe.6816
Afzal A., Hager G., Wellein G.:
The Role of Idle Waves, Desynchronization, and Bottleneck Evasion in the Performance of Parallel Programs
In: IEEE Transactions on Parallel and Distributed Systems (2022), p. 1-16
ISSN: 1045-9219
DOI: 10.1109/TPDS.2022.3221085
Alappat C., Hager G., Schenk O., Wellein G.:
Level-based Blocking for Sparse Matrices: Sparse Matrix-Power-Vector Multiplication
In: IEEE Transactions on Parallel and Distributed Systems (2022), p. 1-18
ISSN: 1045-9219
DOI: 10.1109/TPDS.2022.3223512

Address

Contact

Posts

Publications

2026

2025

2024

2023

2022