Why am I unable to validate my Job Manager configuration in the Parallel Computing Toolbox?

5 views (last 30 days)
I have MATLAB Parallel Server set up on a cluster using the MathWorks Job Manager. When I attempt to validate the cluster configuration it fails. How can I resolve this issue?

Accepted Answer

MathWorks Support Team
MathWorks Support Team on 26 Mar 2023
Edited: MathWorks Support Team on 19 Apr 2023
There are several issues that can prevent the validation of the cluster. Run the following tests below to make sure that your configuration is setup properly. If at any point you receive an error message, you can submit a request to Installation support using the link at the bottom of the page. When submitting a request, be sure to include the following:
- Your license number
- The release of MATLAB on the client and the cluster
- The output of your validation (click details to get the full information)
- The results of the tests below
Also when submitting a request please reference Solution 1-BJRNU9.
1) Test the licensing of MATLAB Parallel Server
The first step is to ensure that the licensing for MATLAB Parallel Server works on your cluster. This will also test to see if MATLAB is crashing on startup on your cluster. To test this, go to one of the cluster nodes and open up a Windows Command Prompt by clicking on the Start Menu and go to All Programs, Accessories, and click on Command Prompt. In the command prompt, run the following commands:
cd $MATLAB\bin (where $MATLAB is the installation folder for MATLAB on the cluster)
matlab.exe -dmlworker -nodisplay -logfile C:\output.txt -r "ver;exit"
This will generate an output.txt file in C:\ that contains the ver output on the cluster. If the log file contains a license manager error, this is the issue. In that case, check the support site for the license manager error number and take the appropriate action to resolve the license error before proceeding.
2) Check the releases of MATLAB on the cluster and the client where you validated
If you get the output of the "ver" command in the log file, check the releases of all the products in the list. The release of each product should match for all the products. Additionally, the release should match the release that is installed on the client where you ran the validation. To check the release on the client, run the ver command in MATLAB's command window. If the release of Parallel Computing Toolbox and MATLAB do not match the release of MATLAB and MATLAB Parallel Server on the cluster, you will not be able to use this configuration until the installations are at the same release.
3) Use the Admin Center to confirm that the cluster is configured properly
As of MATLAB R2009a, the MathWorks Job Manager has a new tool called the Admin Center which can be used to verify the cluster has been setup properly. This tool will allow you to verify the cluster configuration as well as run some tests on the cluster.
Verify your cluster using the Admin Center. For the instructions on how to use the Admin Center, see the documentation here:
NOTE: If the cluster fails the Admin Center tests, save the session and send the .mdcs file to Installation Support along with the other information requested above.
4) Manually set the client's return hostname
If there are multiple network configurations on the client computer, the client may be reporting the wrong return address. To change this, restart MATLAB and run the following command:
pctconfig('hostname','<ip address>')
where <ip address> refers to the external IP address of the client for which the cluster can contact. Once complete, retry validation to see if the return address was incorrect. If it validates, you can try reconfiguring the computer's network settings or create a startup.m file that calls the pctconfig each time MATLAB is launched. For more information about startup.m see the documentation here:
5) Check to ensure you have correctly configured the client configuration
In your client MATLAB, go to the Parallel menu to Manage Configurations. Right click on your jobmanager configuration and select Properties. For the Job Manager you must set the appropriate values for "LookupURL" (hostname of the Job Manager) and "Name" (the Job Manager name).  
6) If automatic validation does not assist the troubleshooting process, you can run a manual validation test by submitting sample jobs to MATLAB Parallel Server.  To do so, please reference the below article:
How do I manually validate my MATLAB cluster?
http://www.mathworks.com/matlabcentral/answers/130554
If the manual tests passed, your configuration is working and you should be able to submit jobs.
If you are still having an issue, contact Installation support here:
NOTE
: Starting in R2019a the following name changes occurred:
  • MATLAB Distributed Computing Server was renamed to MATLAB Parallel Server 
  • mdce_def was renamed to mjs_def
  • mdce binary was renamed to mjs

More Answers (0)

Categories

Find more on Startup and Shutdown in Help Center and File Exchange

Tags

No tags entered yet.

Products


Release

R2009b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!