We have been facing some issue, with below error, any assistance would be appreciated much and helpful.
Backing up 178404 files..............................
129874 of 178404 files (72%).........
178404 of 178404 files (100%)
0512-038 savevg: Backup Completed Successfully.
0042-001 nim: processing error encountered on "master":
0042-006 m_mkbosi: (From_Master) connect Error 0
poll: setup failure
and we noticed FIN_WAIT_1 at client side and LAST_ACK is hanging at NIM server side
NIM server end:-
============
# netstat -Aan |grep 1022
f1001000003e63b8 tcp4 0 0 10.231.128.141.1022 10.231.128.10.3902 LAST_ACK
# netstat -Aan |grep 1022
f1001000003e63b8 tcp4 0 0 10.231.128.141.1022 10.231.128.10.3902 LAST_ACK
# netstat -Aan |grep 1022
f1001000003e63b8 tcp4 0 0 10.231.128.141.1022 10.231.128.10.3902 LAST_ACK
# netstat -Aan |grep 1022
f1001000003e63b8 tcp4 0 0 10.231.128.141.1022 10.231.128.10.3902 LAST_ACK
Client side nimsh.log
================
Thu Oct 3 21:57:32 2019 getting 2nd port
Thu Oct 3 21:57:32 2019 count equals 0
Thu Oct 3 21:57:32 2019 count equals 1
Thu Oct 3 21:57:32 2019 count equals 2
Thu Oct 3 21:57:32 2019 count equals 3
Thu Oct 3 21:57:32 2019 count equals 4
Thu Oct 3 21:57:32 2019 got stderr port 1021
Thu Oct 3 21:57:32 2019 get_auxport: timelag is 1
Thu Oct 3 21:57:33 2019 get_auxport: timelag is 2
Thu Oct 3 21:57:35 2019 get_auxport: timelag is 4
Thu Oct 3 21:57:39 2019 get_auxport: timelag is 8
Thu Oct 3 21:57:47 2019 get_auxport: timelag is 16
Thu Oct 3 21:58:03 2019 cannot bind addr: Address already in use
Thu Oct 3 21:58:03 2019 error: can't get stderr port for 2nd socket
Hi Kunal, these issues are almost always related to host/IP resolution on both/either the NIM client and the master. If you're are sure that the /etc/hosts & DNS entries are correct for both the NIM client and the master, and everything else looks OK, then I would recommend opening a PMR for support. Please also check that /etc/netsvc.conf has a single entry of hosts=local4,bind4. Is there a firewall in the middle? Please refer to the following link for details on NIM comms in a firewall environment http://www-01.ibm.com/support/docview.wss?uid=isg3T1011808. Please check the /var/adm/ras/nimsh.log file on the client, for any useful messages and/or errors. Please refer to the following document on configuring NIMSH with SSL, http://www-01.ibm.com/support/docview.wss?uid=isg3T1024615. Is this issue impacting just one NIM client, or multiple. If it's just this one NIM client that has a problem, try removing and redefining the NIM client definition. Good luck!
Hi Chris
In
my environment /etc/hosts entries in NIM Master and AIX LPAR are
correct. The Client is defined to use nimsh, I can see that nimsh ports
are enabled in /etc/services. Yet when I run niminit command from AIX
LPAR it fails with below error:
# niminit -a name=BLR-Deploy2 -a master=10.23.172.99
0042-042 niminit: unable to communicate with 10.23.172.99
#
And when I try to run nim -o showlog <AIX LPAR> it fails as well
# nim -o showlog BLR-Deploy2
warning: warning: 0042-001 m_showlog: processing error encountered on "BLR-Deploy2":
0042-006 m_showlog: (From_Master) connect A remote host did not respond within the timeout period.
nconn: connect() failed, errno is 78
BLR-Deploy2: A remote host did not respond within the timeout period.
#
Any idea what could be the possible reason.
If the client is configured for rsh, make sure that the client rshd daemon is running and that you can connect to the client from the master using rsh as root .e.g. # rsh client1 date. Check the /.rhosts file configuration and permissions is correct. If the client is configured for nimsh, make sure that the client nimsh daemon is running. Try communicating with the client from the master using 'nim -o showlog'. Check the /var/adm/ras/nimsh.log file for errors on the client.
Hi b2b, could be lot's of reasons. Is this over rsh or nimsh? Check that the NIM master and client can resolve each others IP addresses and hostnames correctly. Double-check the entries in the /etc/hosts file to ensure they are correct on both the master & the client. Check for duplicate entries.
HI Chris, I find your blogs extremely useful. I am facing one issue on my NIM server while running command [root@<nimserver>]/export/nim/backup_images=> nim -o showlog <nimclienthostname> 0042-001 nim: processing error encountered on "master": 0042-006 m_showlog: (From_Master) connect Error 0 Permission denied. Do you know what could be the possible root cause?</nimclienthostname></nimserver>