Supriya Ghosh (Editor)

TORQUE

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit
Development status
  
Active

Size
  
5MB

Website
  
TORQUE website

Developer(s)
  
Adaptive Computing

Written in
  
ANSI C

Platform
  
Cross-platform

Available in
  
English

Operating system
  
Unix-like

Initial release
  
2003

Type
  
distributed resource manager

License
  
OpenPBS version 2.3 (non-free in DFSG), or TORQUE v2.5+ Software License v1.1

Stable release
  
6.1.0 / 10 November 2016; 4 months ago

Adaptive computing leverages torque to manage intel xeon phi resources


The Terascale Open-source Resource and QUEue Manager (TORQUE) is a distributed resource manager providing control over batch jobs and distributed compute nodes. TORQUE can integrate with the non-commercial Maui Cluster Scheduler or the commercial Moab Workload Manager to improve overall utilization, scheduling and administration on a cluster.

Contents

The TORQUE community has extended the original PBS to extend scalability, fault tolerance, and functionality. Contributors include NCSA, OSC, USC, the US DOE, Sandia, PNNL, UB, TeraGrid, and other HPC organizations. TORQUE is described by its developers as open-source software, using the OpenPBS version 2.3 license and as non-free software by the Debian Free Software Guidelines due to license issues.

Feature Set

TORQUE provides enhancements over standard OpenPBS in the following areas:

  • Fault Tolerance
  • Additional failure conditions checked/handled
  • Node health check script support
  • Scheduling Interface
  • Extended query interface providing the scheduler with additional and more accurate information
  • Extended control interface allowing the scheduler increased control over job behavior and attributes
  • Allows the collection of statistics for completed jobs
  • Scalability
  • Significantly improved server to MOM communication model
  • Ability to handle larger clusters (over 15 TF/2,500 processors)
  • Ability to handle larger jobs (over 2000 processors)
  • Ability to support larger server messages
  • Usability
  • Extensive logging additions
  • More human readable logging (i.e. no more 'error 15038 on command 42')
  • References

    TORQUE Wikipedia