Integrando checkpointing e compressão de dados de um Solver FWI com OpenMP em multi-GPUS

Carregando...
Imagem de Miniatura

Título da Revista

ISSN da Revista

Título de Volume

Editor

Universidade Federal de São Carlos

Resumo

High-performance computing (HPC) plays a crucial role in various scientific fields, and the use of GPUs for parallel computing has proven to be an efficient solution for large-scale problems, such as Full Waveform Inversion (FWI) in seismic simulations. This work proposes the implementation of domain decomposition, checkpointing, and data compression techniques in an FWI application executed on multiple GPUs, aiming to optimize computational performance and reduce memory usage. The research was conducted in three main stages: familiarization with the existing codebase, study of checkpointing and data compression techniques, and implementing domain decomposition in a multi-GPU environment. The domain decomposition strategy was applied by subdividing the three-dimensional grid into "slices," allowing distributed execution across multiple devices. Explicit control over data allocation and transfer between the CPU and GPUs, using OpenMP and nvcomp, enabled greater efficiency in memory management and data transfers. Experimental results, performed on a node with 4 NVidia V100 GPUs connected via NVLink, showed that the use of multiple GPUs offers significant performance gains for large-scale problems. For smaller problems, the overhead associated with data transfer between devices neutralizes the benefits of parallelism. The combination of checkpointing and data compression resulted in improvements in memory usage and communication between devices, particularly in larger domains and with more iterations. This work contributes to the advancement of FWI applications in distributed computing environments, providing a scalable and efficient solution for large-scale scientific problems.

Descrição

Citação

FREIRE, Yuri Nicolau. Integrando checkpointing e compressão de dados de um Solver FWI com OpenMP em multi-GPUS. 2025. Trabalho de Conclusão de Curso (Graduação em Engenharia de Computação) – Universidade Federal de São Carlos, São Carlos, 2025. Disponível em: https://repositorio.ufscar.br/handle/20.500.14289/21616.

Coleções

item.page.endorsement

item.page.review

item.page.supplemented

item.page.referenced

Licença Creative Commons

Exceto quando indicado de outra forma, a licença deste item é descrita como Attribution 3.0 Brazil