
Prof. Dr. Gerhard Wellein
Professorship for High Performance Computing
Professors
Address
Martensstraße 391058 Erlangen
Contact
Posts
FAU LMQ Research Spotlight: Faster Simulations with Smarter Data Use

Since the advent of multicore CPUs in the mid-2000s, the ability of computers to perform calculations has improved much faster…
LMQ prominently represented in the new FAU Magazine “#FAUmenschen”

The Profile Center Light.Matter.QuantumTechnologies is represented with two articles in the new magazine “#FAUmenschen” featuring Vojislav Krstić and Gerhard Wellein.…
High-performance computer: Supercomputer Helma inaugurated

Fastest AI computer at German universities Cutting-edge technology for research: The Erlangen National High-Performance Computing Center at Friedrich-Alexander-Universität Erlangen-Nürnberg (NHR@FAU)…
New Academic Year 2025/2026

We are looking forward to a new academic year filled with research, collaborations, and scientific exchange. To get the winter…
Starting signal for top AI supercomputer at FAU

Science minister Blume visits the National High-Performance Computing Center Erlangen New milestone in AI infrastructure for Bavarian researchers: At the…
Pioneering AI and HPC with a brand-new high-performance computing center at FAU

Good news at Erlangen Regional Computing Center (RRZE) and Erlangen National High-Performance Computing Center (NHR@FAU): The Budget Committee of the…
Massive expansion of high-tech infrastructure for universities in Northern Bavaria

The budget committee of the Bavarian state parliament has granted approval for the construction of the high-performance computing center at…
NHR@FAU expands its resources for AI and Deep Learning

Artificial Intelligence (AI) is currently booming as a new and exciting component in research, which makes an adequate compute infrastructure…
Installation of new supercomputers for simulation and AI

The Center for National High-Performance Computing at Friedrich-Alexander-Universität Erlangen-Nürnberg (NHR@FAU) has ordered the installation of two supercomputers. With the new…
Publications
- , , :
GROMACS Unplugged: How Power Capping and Frequency Shapes Performance on GPUs
31st International European Conference on Parallel and Distributed Computing (Euro-Par 2025) (Dresden, Germany, 25. August 2025 – 29. August 2025)
In: Euro-Par 2025: Parallel Processing Workshops Volume in the Springer Lecture Notes in Computer Science (LNCS) series. 2025
DOI: 10.48550/arXiv.2412.08792 - , , , , , :
Cache blocking of distributed-memory parallel matrix power kernels
In: International Journal of High Performance Computing Applications 39 (2025), p. 385-404
ISSN: 1094-3420
DOI: 10.1177/10943420251319332
- , , , , :
Algebraic temporal blocking for sparse iterative solvers on multi-core CPUs
In: International Journal of High Performance Computing Applications (2024)
ISSN: 1094-3420
DOI: 10.1177/10943420241283828 - , , :
Charge-order melting in the one-dimensional Edwards model
In: Physical Review Research 6 (2024), Article No.: L022007
ISSN: 2643-1564
DOI: 10.1103/PhysRevResearch.6.L022007 - , , , , :
CloverLeaf on Intel Multi-Core CPUs: A Case Study in Write-Allocate Evasion
38th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2024 (San Francisco, CA, 27. May 2024 – 31. May 2024)
In: 2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS) 2024
DOI: 10.1109/IPDPS57955.2024.00038 - , , :
Microarchitectural comparison and in-core modeling of state-of-the-art CPUs: Grace, Sapphire Rapids, and Genoa
SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (Atlanta, 17. November 2024 – 22. November 2024)
In: SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis, New York City: 2024
DOI: 10.1109/SCW63240.2024.00181 - , , , , , , :
Alya towards Exascale: Optimal OpenACC Performance of the Navier-Stokes Finite Element Assembly on GPUs
38th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2024 (San Francisco, 27. May 2024 – 31. May 2024)
In: 2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS) 2024
DOI: 10.1109/IPDPS57955.2024.00043
- , , , :
Making applications faster by asynchronous execution: Slowing down processes or relaxing MPI collectives
In: Future Generation Computer Systems-The International Journal of Grid Computing Theory Methods and Applications (2023)
ISSN: 0167-739X
DOI: 10.1016/j.future.2023.06.017 - , , :
Physical Oscillator Model for Supercomputing
14th IEEE/ACM Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS23) (Denver, CO, USA, 12. November 2023 – 17. November 2023)
In: 14th IEEE/ACM Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS23) 2023
DOI: 10.1145/3624062.3625535 - , , :
SPEChpc 2021 Benchmarks on Ice Lake and Sapphire Rapids Infiniband Clusters: A Performance and Energy Case Study
14th IEEE/ACM Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS23) (Denver, CO, USA, 12. November 2023 – 17. November 2023)
In: 14th IEEE/ACM Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS23) 2023
DOI: 10.1145/3624062.3624197 - , , , :
Exploring Techniques for the Analysis of Spontaneous Asynchronicity in MPI-Parallel Applications
14th International Conference on Parallel Processing and Applied Mathematics, PPAM 2022 (Gdansk, Poland, 11. September 2022 – 14. June 2023)
In: Wyrzykowski, R., Dongarra, J., Deelman, E., Karczewski, K. (ed.): Lecture Notes in Computer Science 2023
DOI: 10.1007/978-3-031-30442-2_12 - , , , , :
Analytical performance estimation during code generation on modern GPUs
In: Journal of Parallel and Distributed Computing 173 (2023), p. 152-167
ISSN: 0743-7315
DOI: 10.1016/j.jpdc.2022.11.003 - , , , , , , :
2D-dwell-time analysis with simulations of ion-channel gating using high-performance computing.
In: Biophysical Journal (2023)
ISSN: 0006-3495
DOI: 10.1016/j.bpj.2023.02.023 - , , , :
MD-Bench: A Generic Proxy-App Toolbox for State-of-the-Art Molecular Dynamics Algorithms
In: Parallel Processing and Applied Mathematics. PPAM 2022., Springer, Cham, 2023, p. 321-332 (Lecture Notes in Computer Science (LNCS), Vol.13826)
ISBN: 978-3-031-30441-5
DOI: 10.1007/978-3-031-30442-2_24 - , , , , , :
MD-Bench: A performance-focused prototyping harness for state-of-the-art short-range molecular dynamics algorithms
In: Future Generation Computer Systems-The International Journal of Grid Computing Theory Methods and Applications (2023)
ISSN: 0167-739X
DOI: 10.1016/j.future.2023.06.023 - , , , , , :
MD-Bench: A performance-focused prototyping harness for state-of-the-art short-range molecular dynamics algorithms
In: Future Generation Computer Systems-The International Journal of Grid Computing Theory Methods and Applications 149 (2023), p. 25-38
ISSN: 0167-739X
DOI: 10.1016/j.future.2023.06.023
- , , :
Addressing White-box Modeling and Simulation Challenges in Parallel Computing
ACM SIGSIM-PADS ’22 (GA, Atlanta, USA, 8. June 2022 – 10. June 2022)
In: SIGSIM-PADS ’22: SIGSIM Conference on Principles of Advanced Discrete Simulation 2022
DOI: 10.1145/3518997.3534986 - , , :
Analytic performance model for parallel overlapping memory-bound kernels
In: Concurrency and Computation-Practice & Experience (2022)
ISSN: 1532-0626
DOI: 10.1002/cpe.6816
URL: https://onlinelibrary.wiley.com/doi/10.1002/cpe.6816 - , , :
The Role of Idle Waves, Desynchronization, and Bottleneck Evasion in the Performance of Parallel Programs
In: IEEE Transactions on Parallel and Distributed Systems (2022), p. 1-16
ISSN: 1045-9219
DOI: 10.1109/TPDS.2022.3221085 - , , , :
Level-based Blocking for Sparse Matrices: Sparse Matrix-Power-Vector Multiplication
In: IEEE Transactions on Parallel and Distributed Systems (2022), p. 1-18
ISSN: 1045-9219
DOI: 10.1109/TPDS.2022.3223512
