Fatal Error in PMPI_Wait?
Posted: Mon Dec 19, 2016 12:17 pm
I've been getting the following error over the pass 3 days in a model run that we've been using daily with no issues for a couple of months now.
The cluster still passes MPI_check and I can provide the rest of the logs but I'm just curious if anyone has an idea. I did notice that NASA Sports database changed the ftp location the other day so my gribinfo file was incorrect but I'm still at a loss now.
Code: Select all
taskid: 9 hostname:node1 Fatal error in PMPI_Wait: Unknown error class, error stack: PMPI_Wait(203)..................: MPI_Wait(request)=0x4e52440, status=0x7ffff288ffc0) failed MPIR_Wait_impl(100) MPIDU_Complete_posted_with_error(1149):Process failed