[Title] [Prev] [Contents] [Next] [End]


Contents


Contents

Figures

Tables

Preface

Audience
Using This Guide
Related Documents
Technical Assistance

1 Introduction

What is LSF?
LSF Features
Host Resources
Interactive Processing
Batch Processing
Clusters
Fault Tolerance
Structure of LSF
Load Information Manager
Remote Execution Server
Load Sharing Library
Load Sharing Utility Programs
Applications
Structure of LSF Batch

2 Getting Started

Getting Cluster Information
Displaying the Cluster and Master Names
Displaying Available Resources
Getting Host Information
Displaying Static Host Information
Displaying Load Information
Running Jobs
Running Jobs on Remote Hosts
Load Sharing Commands With lstcsh
Parallel Processing With lsmake
Batch Processing
Listing Hosts
Submitting a Job
Selecting a Job Queue
Tracking Batch Jobs
xbsub and xlsbatch GUI Applications

3 Cluster Information

Finding the Master
Listing Resources
Listing Hosts
Displaying the Load
Graphical Load Display

4 Resources

Introduction to Resources
Load Indices
Static Resources
Boolean Resources
Dedicated Resources
Listing Resources
Resource Requirement Strings
Selection String
Order String
Resource Usage String
Job Spanning String
Configuring Resource Requirements
Task List
Examining and Changing Task Lists
Using Resource Requirements

5 Using LSF Batch

Batch Jobs
Scheduling Policy
Host Partition Fairshare Scheduling
Queue-Level Fairshare Scheduling
Preemptive and Preemptable Scheduling
Exclusive Scheduling
Interactive Batch Scheduling
Scheduling Parameters
Load Thresholds
Resource Requirement Parameters
Run and Dispatch Windows for Queues and Hosts
Run Windows
Dispatch Windows
Batch Queues
Finding Out What Queues Are Available
Detailed Queue Information
Automatic Queue Selection
Specifying Default Queues
Queue Selection Mechanism
Choosing a Queue
Batch Users
Batch Hosts
User and Host Groups
Configuration Parameters
User Controlled Account Mapping

6 Submitting Batch Jobs

Input and Output
Resource Requirements
Dynamic Resource Requirements
Specifying Resource Reservation
Viewing Reserved Resources
Host Selection
Host Preference
Resource Limits
Pre-execution Commands
Job Dependencies
Remote File Access
Start and Termination Time
Parallel Jobs
Minimum and Maximum Number of Processors
Specifying Locality
Processor Reservation
Re-initializing Job Environment on the Execution Host
Other bsub Options
Job Scripts
Embedded Submission Options
Running a Job Under a Particular Shell
Submitting Jobs Using xbsub

7 Tracking Batch Jobs

Displaying Job Status
Finding Pending or Suspension Reasons
Monitoring Resource Consumption of Jobs
Displaying Job History
Checking Partial Job Output
Displaying Queue and Host Status
Killing Jobs
Suspending and Resuming Jobs
Moving Jobs Within and Between Queues
Job Parameter Modification
Job Tracking and Manipulation Using xlsbatch

8 Running Interactive Jobs

Shared Files and User IDs
Running Remote Jobs with lsrun
Running Parallel Jobs with lsgrun
Load Sharing Interactive Sessions
Load Sharing Login
Load Sharing X Sessions
Job Starter
Interactive Batch Job Support
Shell Mode for Remote Execution

9 Using lstcsh

Starting lstcsh
Using lstcsh as Your Login Shell
Automatic Remote Execution
Host Redirection
Job Control
Built-in Commands
The lsmode Command
The connect Command
Modes of Operation
Differences from Other Shells
Writing Shell Scripts in lstcsh
Limitations

10 Using lsmake

Parallel Execution
Invoking lsmake
Specifying the Number of Processors
File Server Load
Tuning Your Makefile
Building in Subdirectories
Running lsmake as a Batch Job
Differences from Other Versions of make

11 Checkpointing and Migration

Approaches to Checkpointing
Kernel-level Checkpointing
User-level Checkpointing
Application-level Checkpointing
Checkpoint Directory
Uniform Checkpointing Interface
The echkpnt Command
The erestart Command
Submitting Checkpointable Jobs
Checkpointing a Job
Restarting a Checkpointed Job
Job Migration
Queues and Hosts for Automatic Job Migration
Automatically Rerunning and Restarting Jobs
Submitting a Job for Automatic Migration
Building Checkpointable Jobs
The Checkpoint Library
The Checkpoint Startup Routine
Linking
Limitations

12 Customizing Batch Jobs for LSF

Environment Variables
Parallel Jobs
Getting the Host List
Starting Parallel Tasks With lstools
Using lsmake to Run Parallel Batch Jobs
Submitting PVM Jobs to LSF Batch
Submitting MPI Jobs to LSF Batch
Submitting POE Jobs to LSF Batch
Using a Job Starter for Parallel Jobs

13 Using LSF MultiCluster

What is LSF MultiCluster?
Getting Remote Cluster Information
Running Batch Jobs across Clusters
Running Interactive Jobs on Remote Clusters
User-Level Account Mapping between Clusters

14 Interoperation with NQS

Choosing an LSF Batch Queue
Submitting a Job from LSF to NQS
Controlling Jobs Running on NQS
Forwarding of Output Files

A Customizing xlsbatch Menu Items

B New Features in LSF 3.0

Host Preference
Interactive Batch Job Support
Shell Mode for Remote Execution
Job Starter
Resource Consumption Monitoring of Running Jobs
Job-Level Resource Limits
Resource Reservation
More Specific Job Pending Reasons
Enhanced Parallel Job Support
Enhanced Checkpointing Interface
Using Hostname in the Resource Requirement String
Customizing Menu Items of xlsbatch
New Option for bjobs Command


[Title] [Prev] [Contents] [Next] [End]

doc@platform.com

Copyright © 1994-1997 Platform Computing Corporation.
All rights reserved.