Publications
Conference Papers
- [Cluster'24] Xiang Fu, Shiman Meng, Weiping Zhang, Luanzheng Guo, Kento Sato, Dong H. Ahn, Ignacio Laguna, Gregory L. Lee, Martin Schulz. Distributed Order Recording Techniques for Efficient Record-and-Replay of Multi-threaded Programs. IEEE International Conference on Cluster Computing (Cluster'24), Sep 24-27, Kobe, Japan (Nominated for Best Paper Award).
- [Cluster'24] Bo Fang, Xinyi Li, Harvey Dam, Cheng Tan, Siva Kumar Sastry Hari, Timothy Tsai, Ignacio Laguna, Dingwen Tao, Ganesh Gopalakrishnan, Prashant J. Nair, Kevin Barker, Ang Li. Understanding Mixed Precision GEMM with MPGemmFI: Insights into Fault Resilience. IEEE International Conference on Cluster Computing (Cluster'24), Sep 24-27, Kobe, Japan.
- [HPDC'24] Anh Tran, Ignacio Laguna, Ganesh Gopalakrishnan. FPBOXer: Efficient Input-Generation for Targeting Floating-Point Exceptions in GPU Programs. 33rd International Symposium on High-Performance Parallel and Distributed Computing (HPDC'24), 2024, Pisa, Italy.
- [ICS'24] Dolores Miao, Ignacio Laguna, Cindy Rubio-González. Input Range Generation for Compiler-Induced Numerical Inconsistencies. ICS 2024: International Conference on Supercomputing, June 4-7, 2024, Kyoto, Japan.
- [CCGrid'24] Xinyi Li, Ang Li, Bo Fang, Katarzyna Swirydowicz, Ignacio Laguna, Ganesh Gopalakrishnan. FTTN: Feature-Targeted Testing for Numerical Properties of NVIDIA & AMD Matrix Accelerators. The 24th IEEE/ACM international Symposium on Cluster, Cloud and Internet Computing.
- [CGO'24] Alnis Murtovi, Giorgis Georgakoudis, Konstantinos Parasyris, Chunhua Liao, Ignacio Laguna, and Bernhard Steffen. Enhancing Performance through Control-Flow Unmerging and Loop Unrolling on GPUs. 2024 IEEE/ACM International Symposium on Code Generation and Optimization (CGO).
- [SC'23] Konstantinos Parasyris, Giorgis Georgakoudis, Esteban Rangel, Ignacio Laguna, Johannes Doerfert. Scalable Tuning of (OpenMP) GPU Applications via Kernel Record and Replay. International Conference for High Performance Computing, Networking, Storage and Analysis (SC), Denver, 2023 (Nominated for Best Paper Award).
- [ISC'23] Dolores Miao, Ignacio Laguna, and Cindy Rubio-González. Expression Isolation of Compiler-Induced Numerical Inconsistencies in Heterogeneous Code. In International Conference on High Performance Computing (ISC), Germany, May 22, 2023 (Best Paper Award).
- [HPDC'23] Xinyi Li, Ignacio Laguna, Bo Fang, Katarzyna Swirydowicz, Ang Li, Ganesh Gopalakrishnan. Design and Evaluation of GPU-FPX: A Low-Overhead tool for Floating-Point Exception Detection in NVIDIA GPUs. The 32nd International Symposium on High-Performance Parallel and Distributed Computing. Orlando, Florida, United States, June 20 - 23, 2023.
- [SC'22] Ignacio Laguna, Ganesh Gopalakrishnan. Finding Inputs that Trigger Floating-Point Exceptions in GPUs via Bayesian Optimization. The International Conference for High Performance Computing, Networking, Storage and Analysis (SC22), Dallas, TX, USA, Nov 13-18, 2022.
- [SC'22] Konstantinos Parasyris, James Diffenderfer, Harshitha Menon, Ignacio Laguna, Jackson Vanover, Ryan Vogt, Daniel Osei-Kuffuor. Approximate Computing Through the Lens of Uncertainty Quantification. The International Conference for High Performance Computing, Networking, Storage and Analysis (SC22), Dallas, TX, USA, Nov 13-18, 2022.
- [IISWC'22] Ignacio Laguna, Tanmay Tirpankar, Xinyi Li, Ganesh Gopalakrishnan. FPChecker: Floating-Point Exception Detection Tool and Benchmark for Parallel and Distributed HPC. 2022 IEEE International Symposium on Workload Characterization, November 6 - November 8, 2022, Austin, Texas, USA.
- [SC'21] Konstantinos Parasyris, Giorgis Georgakoudis, Harshitha Menon, James Diffenderfer, Ignacio Laguna, Daniel Osei-Kuffuor, Markus Schordan. HPAC: evaluating approximate computing techniques on HPC OpenMP applications. The International Conference for High Performance Computing, Networking, Storage and Analysis (SC21), St Louis, MO, USA, Nov 14-19, 2021 (Best Reproducibility Paper Award).
- [DSN'21] Amir Taherin, Tirthak Patel, Devesh Tiwari, Giorgis Georgakoudis, Ignacio Laguna. Examining Failures and Repairs on Supercomputers with Multi-GPU Compute Nodes. The IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), virtual event, 2021.
- [CCGrid'21] Konstantinos Parasyris, Giorgis Georgakoudis, Leonardo Bautista-Gomez, Ignacio Laguna. Co-Designing Multi-Level Checkpoint Restart for MPI Applications. The 21th IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing, May 10-13, 2021, Melbourne, Victoria, Australia.
- [SC'20] Bradley Swain, Yanze Li, Peiming Liu, Ignacio Laguna, Giorgis Georgakoudis, Jeff Huang. OMPRacer: Fast, Precise, and Scalable Static Race Detection for OpenMP Programs. ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC), Virtual, Nov 16-19, 2020.
- [SC'20] Hui Guo, Ignacio Laguna, Cindy Rubio-González. pLiner: Isolating Lines of Floating-Point Code for Compiler Induced Variability. ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC), Virtual, Nov 16-19, 2020.
- [IISWC'20] Konstantinos Parasyris, Ignacio Laguna, Harshitha Menon, Markus Schordan, Daniel Osei-Kuffuor, Giorgis Georgakoudis, Mike Lam, Tristan Vanderbruggen. HPC-MixPBench: An HPC Benchmark Suite for Mixed Precision Analysis. 2020 IEEE International Symposium on Workload Characterization.
- [IISWC'20] Luanzheng Guo, Giorgis Georgakoudis, Konstantinos Parasyris, Ignacio Laguna, Dong Li. MATCH: An MPI Fault Tolerance Benchmark Suite. 2020 IEEE International Symposium on Workload Characterization.
- [ISC'20] Giorgis Georgakoudis, Luanzheng Guo, Ignacio Laguna. Reinit++: Evaluating the Performance of Global-Restart Recovery Methods For MPI Fault Tolerance. ISC High Performance, Frankfurt, Germany, Jun 22-24, 2020.
- [IPDPS'20] Ignacio Laguna. Varity: Quantifying Floating-Point Variations in HPC Systems Through Randomized Testing. 34th IEEE International Parallel & Distributed Processing Symposium (IPDPS), New Orleans, May 18-22, 2020.
- [PPoPP'20] Daniel DeFreez, Antara Bhowmick, Ignacio Laguna, Cindy Rubio-González. Detecting and Reproducing Error-Code Propagation Bugs in MPI Implementations. ACM Principles and Practice of Parallel Programming (PPoPP), San Diego, Feb 22-26, 2020.
- [SC'19] Ignacio Laguna, Ryan Marshall, Kathryn Mohror, Martin Ruefenacht, Anthony Skjellum, Nawrin Sultana. A Large-Scale Study of MPI Usage in Open-source HPC Applications. ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC), Denver, Colorado, Nov 17-19, 2019.
- [ISC'19] Ignacio Laguna, Paul C. Wood, Ranvijay Singh, Saurabh Bagchi. GPUMixer: Performance-Driven Floating-Point Tuning for GPU Scientific Applications. ISC High Performance, Frankfurt, Germany, Jun 16-20, 2019 (Best Paper Award).
- [ICS'19] Pradeep Kotipalli, Ranvijay Singh, Paul Wood, Ignacio Laguna, and Saurabh Bagchi. AMPT-GA: Automatic Mixed Precision Floating Point Tuning for GPU Applications. 33rd ACM International Conference on Supercomputing (ICS), pp. 1-11, Jun 26-28, Phoenix, AZ.
- [HPDC'19] Michael Bentley, Ian Briggs, Ganesh Gopalakrishnan,Dong H. Ahn, Ignacio Laguna, Gregory L. Lee, Holger E. Jones. Multi-level Analysis of Compiler-Induced Variability and Performance Tradeoffs. The 28th International Symposium on High-Performance Parallel and Distributed Computing, Phoenix, Arizona, USA - June 24-28, 2019.
- [IPDPS'19] Giorgis Georgakoudis, Ignacio Laguna, Hans Vandierendonck, Dimitrios S. Nikolopoulos, Martin Schulz. SAFIRE: Scalable and Accurate Fault Injection For Parallel Multithreaded Applications. 33rd IEEE International Parallel & Distributed Processing Symposium (IPDPS), Rio de Janeiro, Brazil, May 20-24, 2019.
- [ASE'19] Ignacio Laguna. FPChecker: Detecting Floating-Point Exceptions in GPU Applications. 34th IEEE/ACM International Conference on Automated Software Engineering (ASE), San Diego, CA, 2019.
- [CARLA'19] Anthony Skjellum, Martin Rüfenacht, Nawrin Sultana, Derek Schafer, Ignacio Laguna, Kathryn Mohror. ExaMPI: A Modern Design and Implementation to Accelerate Message Passing Interface Innovation. 6th Latin American Conference on High Performance Computing (CARLA), Costa Rica, Sep 25–27, 2019.
- [SC'18] Luanzheng Guo, Dong Li, Ignacio Laguna, Martin Schulz. FlipTracker: Understanding Natural Error Resilience in HPC Applications. ACM/IEEE Conference for High Performance Computing, Networking, Storage and Analysis (SC), Dallas, TX, 2018.
- [EuroMPI'18] Nawrin Sultana, Anthony Skjellum, Ignacio Laguna, Matthew Shane Farmer, Kathryn Mohror and Murali Emani. MPI Stages: Checkpointing MPI State for Bulk Synchronous Applications. In Proceedings of the 25th European MPI Users Group Meeting (EuroMPI), Barcelona, Spain, Sep. 23-26, 2018.
- [IPDPS'18] Simone Atzeni, Ganesh Gopalakrishnan, Zvonimir Rakamaric, Ignacio Laguna, Gregory L Lee, Dong H Ahn. SWORD: A Bounded Memory-Overhead Detector of OpenMP Data Races in Production Runs. The The 32nd IEEE International Parallel and Distributed Processing Symposium (IPDPS), May, Vancouver, Canada, 2018.
- [SC'17] Giorgis Georgakoudis, Ignacio Laguna, Dimitrios S. Nikolopoulos, Martin Schulz. REFINE: Realistic Fault Injection via Compiler-Based Instrumentation for Accuracy, Portability and Speed. ACM/IEEE Conference for High Performance Computing, Networking, Storage and Analysis (SC), Denver, CO, 2017.
- [PPoPP'17] Sato, Kento, Dong H. Ahn, Ignacio Laguna, Gregory L. Lee, Martin Schulz, and Christopher M Chambreau. Noise Injection Techniques to Expose Subtle and Unintended Message Races. In Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), Austin, Texas, USA, Feb, 2017.
- [IPDPS'17] David Beckingsale, Olga Pearce, Ignacio Laguna, and Todd Gamblin. Apollo: Reusable Models for Fast, Dynamic Tuning of Input-Dependent Code. In The 31th IEEE International Parallel and Distributed Processing Symposium (IPDPS), May, Orlando, Florida, USA, 2017.
- [SC'16] Ignacio Laguna, Martin Schulz. Pinpointing Scale-Dependent Integer Overflow Bugs in Large-Scale Parallel Applications. ACM/IEEE Conference for High Performance Computing, Networking, Storage and Analysis (SC), Salt Lake City, 2016.
- [IPDPS'16] Simone Atzeni, Ganesh Gopalakrishnan, Zvonimir Rakamaric, Dong H. Ahn, Ignacio Laguna, Martin Schulz, Gregory L. Lee, Joachim Protze, Matthias S. Muller. ARCHER: Effectively Spotting Data Races in Large OpenMP Applications. In 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS), Chicago, May 23-27, 2016.
- [CGO'16] Ignacio Laguna, Martin Schulz, David F. Richards, Jon Calhoun, Luke Olson. IPAS: Intelligent Protection Against Silent Output Corruption in Scientific Applications. In the 14th IEEE/ACM International Symposium on Code Generation and Optimization (CGO), Barcelona, March 12-18, 2016.
- [SC'15] Kento Sato, Dong H. Ahn, Ignacio Laguna, Gregory L. Lee, Martin Schulz. Clock Delta Compression for Scalable Order-Replay of Non-Deterministic Parallel Applications. In the ACM/IEEE Conference for High Performance Computing, Networking, Storage and Analysis (SC), Austin, Texas, Nov, 2015.
- [ICCS'15] A. Chien, P. Balaji, P. Beckman, N. Dun, A. Fang, H. Fujita, K. Iskra, Z. Rubenstein, Z. Zheng, R. Schreiber, J. Hammond, J. Dinan, I. Laguna, D. Richards, A. Dubey, B. van Straalen, M. Hoemmen, M. Heroux, K. Teranishi, A. Siegel. Versioned Distributed Arrays for Resilience in Scientific Applications: Global View Resilience. In the International Conference On Computational Science (ICCS), Reykjavik, Iceland, June 1-3, 2015.
- [EuroMPI'14] Ignacio Laguna, David F. Richards, Todd Gamblin, Martin Schulz, Bronis R. de Supinski. Evaluating User-Level Fault Tolerance for MPI Applications. In EuroMPI/ASIA, Kyoto, Japan, Sep 9-12, 2014.
- [PLDI'14] Subrata Mitra, Ignacio Laguna, Dong H. Ahn, Saurabh Bagchi, Martin Schulz, and Todd Gamblin. Accurate Application Progress Analysis for Large-Scale Parallel Debugging. In ACM International Symposium on Programming Language Design and Implementation (PLDI), Edinburgh, UK, June 9-11, 2014.
- [SRDS'13] Ignacio Laguna, Subrata Mitra, Fahad A Arshad, Nawanol Theera-Ampornpunt, Zongyang Zhu, Saurabh Bagchi, Samuel P Midkiff, Mike Kistler, Ahmed Gheith. Automatic Problem Localization via Multidimensional Metric Profiling. In IEEE 32nd International Symposium on Reliable Distributed Systems (SRDS), Braga, Portugal, Sep-Oct, 2013.
- [PACT'12] Ignacio Laguna, Dong H. Ahn, Bronis R. de Supinski, Saurabh Bagchi, Todd Gamblin. Probabilistic Diagnosis of Performance Faults in Large-Scale Parallel Applications. In International Conference on Parallel Architectures and Compilation Techniques (PACT), Minneapolis, MN, Sep, 2012.
- [DSN'12] Greg Bronevetsky, Ignacio Laguna, Saurabh Bagchi and Bronis R. de Supinski. Characterization via Abnormality-Enhanced Classification. In IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), Boston, Massachusetts, Jun, 2012.
- [SC'11] Ignacio Laguna, Todd Gamblin, Bronis R. de Supinski, Saurabh Bagchi, Greg Bronevetsky, Dong H. Ahn, Martin Schulz, Barry Rountree. Large Scale Debugging of Parallel Tasks with AutomaDeD. In ACM/IEEE Supercomputing (SC), Seattle, WA, Nov 2011.
- [DSN'10] Greg Bronevetsky, Ignacio Laguna, Surabh Bagchi, Bronis R. de Supinski, Dong H. Ahn, Martin Schulz. AutomaDeD: Automata-Based Debugging for Dissimilar Parallel Tasks. In IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), Chicago Illinois, Jun-Jul, 2010.
- [Middleware'09] Ignacio Laguna, Fahad A. Arshad, David M. Grothe, Saurabh Bagchi. How To Keep Your Head Above Water While Detecting Errors. In ACM/IFIP/USENIX 10th International Middleware Conference (Middleware), UIUC Illinois, Nov-Dec 2009.
- [SC'09] Dong H. Ahn, Bronis R. de Supinski, Ignacio. Laguna, Greg L. Lee, Ben Liblit, Barton P. Miller, and Martin Schulz. Scalable Temporal Order Analysis for Large Scale Debugging. In ACM/IEEE Supercomputing (SC), Portland, OR, Nov 2009.
- [SRDS'07] Gunjan Khanna, Ignacio Laguna, Fahad A. Arshad and Saurabh Bagchi. Distributed Diagnosis of Failures in a Three Tier E-Commerce System. In IEEE Symposium on Reliable Distributed Systems (SRDS), Beijing, China, Oct 2007.
- [SRDS'07] Gunjan Khanna, Ignacio Laguna, Fahad A. Arshad and Saurabh Bagchi. Stateful Detection in High Throughput Distributed Systems. In IEEE Symposium on Reliable Distributed Systems (SRDS), Beijing, China, Oct 2007.
Journal Papers
- [PARCO] D. Miao, I. Laguna, G. Georgakoudis, K. Parasyris, and C. Rubio-González. An Automated OpenMP Mutation Testing Framework for Performance Optimization. Journal of Parallel Computing (PARCO), 2024.
- [IJHPCA] David E Bernholdt, George Bosilca, Aurelien Bouteiller, Ron Brightwell, Jan Ciesko, Matthew GF Dosanjh, Giorgis Georgakoudis, Ignacio Laguna, Scott Levy, Thomas Naughton, Stephen L Olivier, Howard P Pritchard, Whit Schonbein, Joseph Schuchart, Amir Shehata. Taking the MPI standard and the open MPI library to exascale. The International Journal of High Performance Computing Applications, 2024.
- [PARCO] Ignacio Laguna, Anh Tran, Ganesh Gopalakrishnan. Finding Inputs that Trigger Floating-Point Exceptions in Heterogeneous Computing via Bayesian Optimization. Parallel Computing, Volume 117, September 2023.
- [ITP] Harshitha Menon, James Diffenderfer, Giorgis Georgakoudis, Ignacio Laguna, Michael O. Lam, Daniel Osei-Kuffuor, Konstantinos Parasyris, and Jackson Vanover. Approximate High-Performance Computing: A Fast and Energy-Efficient Computing Paradigm in the Post-Moore Era. IT Professional (Volume: 25, Issue: 2, March-April 2023).
- [CSE] William Godoy, Ritu Arora, Keith Beattie, David E. Bernholdt, Sarah E. Bratt, Daniel S. Katz, Ignacio Laguna et al.. Giving RSEs a Larger Stage through the Better Scientific Software Fellowship. Computing in Science & Engineering, 2023..
- [CACM] Dong H. Ahn, Allison H. Baker, Michael Bentley, Ian Briggs, Ganesh Gopalakrishnan, Dorit M. Hammerling, Ignacio Laguna, Gregory L. Lee, Daniel J. Milroy, Mariana Vertenstein. Keeping Science on Keel When Software Moves. Communications of the ACM, no. 2, 2021, 66-74.
- [JPDC] Luanzheng Guo, Ignacio Laguna, Dong Li. PARIS: Predicting Application Resilience Using Machine Learning. Journal of Parallel and Distributed Computing, Volume 152, June 2021, Pages 111-124.
- [CCPE] Nawrin Sultana, Martin Rüfenacht, Anthony Skjellum, Purushotham Bangalore, Ignacio Laguna, and Kathryn Mohror. Understanding the Use of MPI in Exascale Proxy Applications. Concurrency and Computation: Practice and Experience, Wiley.
- [TPDS] Shinobu Miwa, Ignacio Laguna, Martin Schulz. PredCom: A Predictive Approach to Collecting Approximated Communication Traces. IEEE Transactions on Parallel & Distributed Systems.
- [ParCo] Nawrin Sultana, Martin Rüfenacht, Anthony Skjellum, Ignacio Laguna, and Kathryn Mohror. Failure recovery for bulk synchronous applications with MPI stages. Parallel Computing, Volume 84, May 2019, Pages 1-14.
- [IJHPCA] Sato, Kento, Ignacio Laguna, Gregory L Lee, Martin Schulz, Christopher M Chambreau, Simone Atzeni, Michael Bentley, et al.. Pruners: Providing reproducibility for uncovering non-deterministic errors in runs on supercomputers. The International Journal of High Performance Computing Applications, Vol 33, Issue 5, 2019.
- [CCPE] Sourav Chakraborty, Ignacio Laguna, Murali Emani, Kathryn Mohror, Dhabaleswar K. Panda, Martin Schulz, Hari Subramoni. EReinit: Scalable and Efficient Fault Tolerance for Bulk-Synchronous MPI Applications. Concurrency and Computation: Practice and Experience, Wiley, Volume 32, Issue 3, 2020.
- [IJHPCA] A. Chien, P. Balaji, N. Dun, A. Fang, H. Fujita, K. Iskra, Z. Rubenstein, Z. Zheng, J. Hammond, I. Laguna, D. Richards, A. Dubey, B. van Straalen, M. Hoemmen, M. Heroux, K. Teranishi, A. Siegel. Exploring versioned distributed arrays for resilience in scientific applications: global view resilience. The International Journal of High Performance Computing Applications (IJHPCA), 31, no. 6 (2017): 564-590..
- [IJHPCA] Ignacio Laguna, David F. Richards, Todd Gamblin, Martin Schulz, Bronis R. de Supinski, Kathryn Mohror, and Howard Pritchard. Evaluating and Extending User-Level Fault Tolerance in MPI. The International Journal of High Performance Computing Applications (IJHPCA), vol. 30, num. 3, pp. 305-319, Sep, 2016.
- [CACM] Ignacio Laguna, Dong H. Ahn, Bronis R. de Supinski, Todd Gamblin, Gregory L. Lee, Martin Schulz, Saurabh Bagchi, Milind Kulkarni, Bowen Zhou, Zhezhe Chen, and Feng Qin. Debugging high-performance computing applications at massive scales. In Communications of the ACM, September, 2015.
- [TPDS] Ignacio Laguna, Dong Ahn, Bronis de Supinski, Saurabh Bagchi, and Todd Gamblin. Diagnosis of Performance Faults in Large Scale MPI Applications via Probabilistic Progress-Dependence Inference. IEEE Transactions on Parallel and Distributed Systems (TPDS), vol. 26, no. 5, pp. 1280-1289, May 2015.
- [CSE] Martin Schulz, Jim Belak, Abhinav Bhatele, Peer-Timo Bremer, Greg Bronevetsky, Marc Casas, Todd Gamblin, Katherine E. Isaacs, Ignacio Laguna, Joshua Levine, Valerio Pascucci, David Richards, Barry Rountree. Performance analysis techniques for the exascale co-design process. Parallel Computing: Accelerating Computational Science and Engineering (CSE), vol. 25, pag. 19, 2014, IOS Press.
Workshop Papers
- [HPCTEST] Ignacio Laguna, Patrick Chapman, Konstantinos Parasyris, Giorgis Georgakoudis, Cindy Rubio-González. Testing the Unknown: A Framework for OpenMP Testing via Random Program Generation. Second International Workshop on HPC Testing and Evaluation of Systems, Tools, and Software @ SC24, November 2024..
- [HPCTEST] Anwar Hossain Zahid, Ignacio Laguna, Wei Le. Testing GPU Numerics: Finding Numerical Differences Between NVIDIA and AMD GPUs. Second International Workshop on HPC Testing and Evaluation of Systems, Tools, and Software @ SC24, November 2024..
- [PMAM] Dolores Miao, Ignacio Laguna, Giorgis Georgakoudis, Konstantinos Parasyris, Cindy Rubio-Gonzalez. MUPPET: Optimizing Performance in OpenMP via Mutation Testing. PMAM '24: Proceedings of the 15th International Workshop on Programming Models and Applications for Multicores and Manycores, March 2024..
- [WOCC] Luanzheng Guo, Jay Lofstead, Jie Ren, Ignacio Laguna, Gokcen Kestor, Line Pouchard, Dossay Oryspayev, Hyeran Jeon. Understanding System Resilience for Converged Computing of Cloud, Edge, and HPC. International Workshop on Converged Computing (WOCC), @ISC, Germany, May 25, 2023..
- [XLOOP] Dorra Ben Khalifa, Xinyi Li, Ignacio Laguna, Matthieu Martel, Ganesh Gopalakrishnan. Toward Increasing Trust in Exascale Simulations. The 4th Annual Workshop on Extreme-Scale Experiment-in-the-Loop Computing, at SC22, Dallas, TX, Nov 13-18, 2022.
- [P3HPC] Konstantinos Parasyris, Giorgis Georgakoudis, Johannes Doerfert, Ignacio Laguna, Tom Scogland. Piper: Pipelining OpenMP Offloading Execution through Compiler Optimization for Performance. 2022 International Workshop on Performance Portability and Productivity (P3HPC), at SC22, Dallas, TX, Nov 13-18, 2022.
- [FTXS] Bo Fang, Siva Hari, Timothy Tsai, Xinyi Li, Ganesh Gopalakrishnan, Ignacio Laguna, Kevin Barker, Ang Li. Toward Precision-Aware Fault Tolerance Approaches for Mixed-Precision Applications. 12th Workshop on Fault-Tolerance for HPC at Extreme Scale (FTXS 2022), at SC22, Dallas, TX, Nov 13-18, 2022.
- [IWOMP] Giorgis Georgakoudis, Johannes Doerfert, Ignacio Laguna and Tom Scogland. FAROS: A Framework To Analyze OpenMP Compilation Through Benchmarking and Compiler Optimization Analysis. In the International Workshop on OpenMP (IWOMP), Sep 21-24, 2020 (Best Paper Award).
- [SOAP] Ignacio Laguna, Xinyi Li, Ganesh Gopalakrishnan. BinFPE: accurate floating-point exception detection for GPU applications. SOAP '22: 11th ACM SIGPLAN International Workshop on the State Of the Art in Program Analysis, San Diego, CA, USA, 14 June 2022.
- [ROSS] Stephen Herbein, David Domyancic, Paul Minner, Ignacio Laguna, Rafael Ferreira da Silva, and Dong H. Ahn. MCEM: Multi-Level Cooperative Exception Model for HPC Workflows. In Proceedings of the 9th International Workshop on Runtime and Operating Systems for Supercomputers (ROSS '19). ACM, New York, NY, USA, 27-32.
- [ScalA] Ranvijay Singh, Paul Wood, Ravi Gupta, Saurabh Bagchi, and Ignacio Laguna. Snowpack: efficient parameter choice for GPU kernels via static analysis and statistical prediction. In Proceedings of the 8th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems (ScalA ’17), @SC17, Denver, CO, 2017.
- [FTXS] Ayush Patwari, Ignacio Laguna, Martin Schulz, Saurabh Bagchi. Understanding the Spatial Characteristics of DRAM Errors in HPC Clusters. The 7th Workshop on Fault Tolerance for HPC at eXtreme Scale (FTXS) @HPDC, Washington, D.C., USA, Jun, 2017.
- [IWOMP] Joachim Protze, Dong H. Ahn, Ignacio Laguna, Martin Schulz, and Matthias S. Muller. Testing Infrastructure for OpenMP Debugging Interface Implementations. In the International Workshop on OpenMP (IWOMP), Oct 5, 2016.
- [IWOMP] Joachim Protze, Ignacio Laguna, Dong H. Ahn, John DelSignore, Ariel Burton, Martin Schulz, and Matthias S. Muller. Lessons Learned from Implementing OMPD: a Debugging Interface for OpenMP. In the 11th International Workshop on OpenMP (IWOMP), Aachen, Germany, October 1-2, 2015.
- [LLVM-HPC] Joachim Protze, Simone Atzeni, Dong H Ahn, Martin Schulz, Ganesh Gopalakrishnan, Matthias S Muller, Ignacio Laguna, Zvonimir Rakamaric, Greg L Lee. Towards providing low-overhead data race detection for large OpenMP applications. In Workshop on LLVM Compiler Infrastructure in HPC, held in conjunction with SC’14, New Orleans, Louisiana, Nov, 2014.
- [ScalA] Ignacio Laguna, Edgar A Leon, Martin Schulz, Mark Stephenson. A study of application-level recovery methods for transient network faults. In Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems (ScalA’13), held in conjunction with SC’13, Denver, Colorado, Nov, 2013.
- [SEHPCCSE] Dong H Ahn, Gregory L Lee, Ganesh Gopalakrishnan, Zvonimir Rakamaric, Martin Schulz, Ignacio Laguna. Overcoming extreme-scale reproducibility challenges through a unified, targeted, and multilevel toolset. In 1st International Workshop on Software Engineering for High Performance Computing in Computational Science and Engineering (SEHPCCSE’13), held in conjunction with SC’13, Denver, Colorado, Nov, 2013.
- [SELSE] Greg Bronevetsky, Ignacio Laguna, Saurabh Bagchi, Bronis R. de Supinski, Dong H. Ahn, and Martin Schulz. Statistical Fault Detection for Parallel Applications with AutomaDeD. In 6th IEEE Workshop on Silicon Errors in Logic - System Effects (SELSE’10), Stanford, CA, Mar 23-24, 2010.