Strange MPI_AllReduce result for a 16-byte real

Question

Strange MPI_AllReduce result for a 16-byte real

Compiler: gfortran-4.8.5

MPI library: OpenMPI-1.7.2 (pre-installed OpenSuSE 13.2)

This program:

use mpi implicit none real*16 :: x integer :: ierr, irank, type16 call MPI_Init(ierr) call MPI_Comm_Rank(MPI_Comm_World, irank, ierr) if (irank+1==1) x = 2.1 if (irank+1==8) x = 2.8 if (irank+1==7) x = 5.2 if (irank+1==4) x = 6.7 if (irank+1==6) x = 6.5 if (irank+1==3) x = 5.7 if (irank+1==2) x = 4.0 if (irank+1==5) x = 6.8 print '(a,i0,a,f3.1)', "rank+1: ",irank+1," x: ",x call MPI_AllReduce(MPI_IN_PLACE, x, 1, MPI_REAL16, MPI_MAX, MPI_Comm_World, ierr) if (irank==0) print '(i0,a,f3.1)', irank+1," max x: ", x call MPI_Finalize(ierr) end

I also tried real(16) , real(kind(1.q0)) . real(real128) is actually equivalent to real*10 for this compiler.

Result:

 > mpif90 reduce16.f90 > mpirun -n 8 ./a.out rank+1: 1 x: 2.1 rank+1: 2 x: 4.0 rank+1: 3 x: 5.7 rank+1: 4 x: 6.7 rank+1: 5 x: 6.8 rank+1: 6 x: 6.5 rank+1: 7 x: 5.2 rank+1: 8 x: 2.8 1 max x: 2.8

The program finds the true maximum for real*10 , saving MPI_REAL16 . The MPI specification (3.1, p. 628 and 674) is not very clear if MPI_REAL16 matches real*16 or real(real128) if they differ.

In addition, assuming MPI_REAL16 is actually real(real128) , and trying to use it in a program leads to another problem:

 Error: There is no specific subroutine for the generic 'mpi_recv' at (1) Error: There is no specific subroutine for the generic 'mpi_send' at (1)

what does not happen for real*16 . (apart from the fact that you need to be able to transmit any bit diagram, so this check is unnecessary)

What is the correct way to use 16-byte reals? Is the OpenMPI library a bug?

+6

fortran quadruple-precision mpi openmpi

Vladimir F Oct 13 '15 at 17:33

source share

1 answer

Jeff · Answer 1 · 2015-12-11T18:40:58+0000

Although this should work correctly in every MPI implementation, a simple solution is to implement a user-specific abbreviation for this type written in Fortran, so there is no problem implementing it in C (this is how MPICH and OpenMPI try to do everything, so there are problems when C cannot reproduce Fortran behavior).

The following is an attempt to implement this. This is a user-defined abbreviation in Fortran. I'm sure experienced Fortran modern programmers can do it better.

  subroutine sum_real16(iv,iov,n) implicit none integer, intent(in) :: n real*16, intent(in) :: iv(:) real*16, intent(inout) :: iov(:) integer :: i do i = 1,n iov(i) = iov(i) + iv(i) enddo end subroutine sum_real16 subroutine reduce_sum_real16(iv, iov, n, dt) use, intrinsic :: iso_c_binding, only : c_ptr use mpi_f08 implicit none type(c_ptr), value :: iv, iov integer :: n type(MPI_Datatype) :: dt if ( dt .eq. MPI_REAL16 ) then call sum_real16(iv,iov,n) endif end subroutine reduce_sum_real16 program test_reduce_sum_real16 use, intrinsic :: iso_c_binding use mpi_f08 implicit none integer, parameter :: n = 10 real*16 :: output(n) real*16 :: input(n) real*16 :: error integer :: me, np procedure(MPI_User_function) :: reduce_sum_real16 type(MPI_Op) :: mysum integer :: i call MPI_Init() call MPI_Comm_rank(MPI_COMM_WORLD,me) call MPI_Comm_size(MPI_COMM_WORLD,np) output = 0.0 input = 1.0*me call MPI_Op_create(reduce_sum_real16,.true.,mysum) call MPI_Allreduce(input,output,n,MPI_REAL16,mysum,MPI_COMM_WORLD) error = 0.0 do i = 1,n error = error + (output(i)-1.0*np) enddo if (error.gt.0.0) then print*,'SAD PANDA = ',error call MPI_Abort(MPI_COMM_SELF,1) endif call MPI_Op_free(mysum) call MPI_Finalize() end program test_reduce_sum_real16

This program returns without errors with the Intel 16 Fortran compiler and MPICH 3.2+. Apparently, I am not using I / O correctly, so the confidence in the correctness of this program is not as high as if I could write all the results to stdout.

Strange MPI_AllReduce result for a 16-byte real

More articles: