Cannot connect to MJS with parcluster

3 views (last 30 days)
I'm working on setting up a cluster environment, but for some reason I can't get the client to communicate with the mdce service. When I try to validate the cluster profile, parcluster can't communicate with the cluster, with the following error. Both sides are running Matlab 2013a. There are no firewall blocking the traffic between the hosts, and nodestatus.bat seems to be working fine. What can I do to try to get this thing working?
Error using parcluster (line 53)
Could not contact an MJS lookup service on host '10.236.95.1'. Possible reasons for this problem are:
1. MJS has not been started, has crashed, or has been shut down.
2. A firewall is blocking communication between this computer and '10.236.95.1'.
3. This computer cannot resolve the hostname of '10.236.95.1', it resolves it to an incorrect IP address.
4. '10.236.95.1' resolves its own hostname to an incorrect IP address.
5. Network routers are unable to route traffic from this computer to '10.236.95.1'.
The hostname, 10.236.95.1, corresponds to the fully qualified hostname 10.236.95.1.
This computer resolves it to the IP address 10.236.95.1.
Nodestatus.bat says the following:
C:\Program Files\MATLAB\R2013a\toolbox\distcomp\bin>nodestatus.bat -remotehost 10.236.95.1 -infolevel 3
MDCE service:
Security level 0
Version 6.2
MATLAB /cm/shared/apps/MATLAB/R2013a
mdce_def File /cm/shared/apps/MATLAB/R2013a/toolbox/distcomp/bin/mdce_def.sh
Platform glnxa64
Job manager lookup process:
Status Running
Job manager:
Name <censored>
Running on host <censored>
Number of workers 14
Worker names and host names node001_worker01, node001
node001_worker02, node001
node002_worker01, node002
node002_worker02, node002
node003_worker01, node003
node003_worker02, node003
node004_worker01, node004
node004_worker02, node004
node005_worker01, node005
node005_worker02, node005
node006_worker01, node006
node006_worker02, node006
node007_worker01, node007
node007_worker02, node007
Start time Tue Jun 04 15:51:27 CEST 2013
Port 27355
Requested job manager lookup
processes <censored>:27350
Registered with job manager
lookup processes on hosts <censored>:27350
Database size in bytes 722065
VM heap size in bytes 28835840
Database item cache size 0
Worker lease timeout in
milliseconds 60000
Network addresses of host 10.236.95.1
127.0.0.1
Summary:
The mdce service on <censored> manages the following processes:
Job manager lookup processes 1
Job managers 1
Workers 0

Accepted Answer

Thomas Ibbotson
Thomas Ibbotson on 11 Jul 2013
I can't see anything obviously wrong there. I suggest you contact technical support, we have support engineers that can help you set up MDCS. http://www.mathworks.com/company/aboutus/contact_us/

More Answers (0)

Categories

Find more on MATLAB Parallel Server in Help Center and File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!