As of Dec. 2007, the Hoffman2 Cluster has:
Since each node has 4 cores, the queues have been set up to run 4 processes or jobs per node.
The Hoffman2 Cluster has the following queues:
Each research group that has contributed nodes to the Hoffman2 Cluster has a queue named by the name of the group.
Each research group queue consists of either:
Queue properties:
The purpose of this queue is to both harvest unused cycles and allow members of research groups that have contributed nodes to run the extended shared Hoffman2 Cluster.
The 24 hour queue has access to ATS-contributed nodes/cores from the Base Shared Cluster and research group equivalent cores that are not currently running jobs.
Only those research groups that have contributed nodes to the shared Hoffman2 Cluster can take advantage of the 24 Hour Queue. This queue is not open to users of the Campus General Purpose Cluster.
Queue Properties:
The campus queue is intended for parallel jobs submitted by those members of the UCLA community who have access to the Campus General Purpose Cluster. It is limited to the number of processors in that part of the Cluster.
Queue Properties:
If your program, for some reason, absolutely requires more than 24 hours to run and cannot be stoped and restarted in the 24 hour time frame, you can make a special request to have it run for up to either 3 or 5 days. Send your request by email to atshpc@ucla.edu. Include the following in your request:
The application queue is intended for serial jobs and those jobs which run those licensed applications which ATS has purchased for campus use. The nodes of the appication cluster differ from those of the rest of the shared Hoffman2 Cluster in the number of cores they have per node and the fact that they do not have InfiniBand interconnects. They are Intel 64-bit nodes where as all the other nodes are AMD 64-bit nodes.
Queue Properties:
The Sun Grid Engine (SGE) is the job management system used on the Hoffman2 Cluster to ensure balanced use of resources by matching job needs to available resources. SGE serves as the job scheduler. SGE knows which users are in which groups and enforces the queueing policies. When you specify the time limit for the job, SGE will place your job in the correct queue even if you do not specify it.
When you submit a job using any of the methods: from the UCLA Grid Portal, via the queue scripts or the qsub command just request the number of wall clock hours of execution required and any needed applicaitons. Your job will automatically be assigned to a queue as follows:
| Is the number of cores requested by the job > the number contributed by the research group to the shared Hoffman2 Cluster? | Number of Hours Requested | |
| <=24 | >24 | |
| No | The queue (research group or 24 hour) in which this job will start first. | reasearchGroupName.q |
| Yes | the 24 hour queue | This job can never run |
| Is this job asking for a licensed application that the research group does not have licenses for? | Number of Hours Requested | |
| <=24 | >24 | |
| Yes | the application queue | This job can never run |
| Is this job asking for a licensed application ATS is providing? | Queue |
| No | the campus queue |
| Yes | the application queue |
Programs that require more than 24 hours to complete and which have to be run queues limited to 24 hours, should checkpoint before 24 hours is up so they can be continued later.