Tivoli Workload Scheduler/AWS Administrator
• *Description
- *The Digital Modernization Sector has an opening for a AWS Administrator/Tivoli Workload Scheduler to support a large healthcare contract.
We are seeking an AWS Administrator/ Tivoli Workload Scheduler engineer to support the day-to-day operations, security, and performance of our AWS cloud infrastructure while managing enterprise-level workload scheduling and automation through Tivoli/IBM Workload Scheduler (TWS/IWS). This role is hands-on and operationally focused, requiring strong troubleshooting skills, disciplined execution, and the ability to support business-critical batch and cloud-native workflows. This role also focuses on The ideal candidate has strong AWS administration experience, a clear grasp of workload automation platforms, and the ability to operate effectively in a production enterprise environment with minimal supervision.
- *Core Purpose**
Maintain the health, security, and performance of AWS environments while ensuring reliable execution of automated job scheduling workflows using TWS.
- *AWS Infrastructure Management
- Provision, configure, and maintain AWS resources including EC2, S3, IAM, and VPCs in alignment with the AWS Well-Architected Framework.
- Perform routine system maintenance, patching, and configuration updates across cloud environments.
- *Workload Automation & Scheduling
- Operate and support the TWS/IWS platform, including job stream monitoring, dependency management, and agent health checks.
- Ensure reliable execution of daily and intraday production workloads.
- Monitor batch workloads and ensure successful completion of processes.
- Implement scheduling solutions for application teams
- Follow enterprise change management processes and procedures
- Troubleshoot and resolve job failures, scheduling conflicts, and dependency issues
- Assist with workload automation platform maintenance activities
- *Security & Compliance
- Enforce least-privilege access controls using AWS IAM.
- Monitor and remediate security findings using tools such as AWS Security Hub.
- Support audit and compliance requirements related to cloud infrastructure and automation platforms.
- *Incident Response & Operations Support
- Triage, diagnose, and resolve incidents involving AWS infrastructure and failed or delayed scheduling batches.
- Escalate complex issues appropriately and participate in root-cause analysis efforts.
- *Performance & Cost Optimization
- Identify performance bottlenecks and inefficiencies related to cloud resource usage and job throughput.
- Implement auto-scaling, scheduling adjustments, or script improvements to improve performance and control costs.
- *Backup & Disaster Recovery
- Maintain, test, and document backup and disaster recovery strategies for AWS resources and TWS databases.
- Participate in disaster recovery exercises and validate recovery procedures.
- *Basic Qualifications
- *Education
- Bachelor’s degree in computer science, Information Technology, or a related field,. Additional years of experience may be substituted in lieu of degree.
- *Experience
- 4–6 years of experience in IT operations or infrastructure support roles.
- At least 1 year of hands-on AWS administration experience with limited supervision.
- An understanding of workload automation concepts including: Job dependencies, workstations, and job streams
- Ability to analyze logs and diagnose issues
- Must be able to obtain and maintain a public trust clearance.
- All candidates supporting the CMS programs must havelived in theUnited Statesatleastthree(3)out ofthelastfive(5)yearsprior in order to be considered.
- *Certifications
- AWS Certified SysOps Administrator – Associate
- *Tivoli / IBM Workload Scheduler (TWS) Knowledge
- *Core Architecture
- Understanding of TWS distributed architecture, including:
- Master Domain Manager (MDM): Central scheduling authority and database.
- Dynamic Workload Console (DWC): Web-based interface for job design and monitoring.
- Agents: Including Fault-Tolerant Agents (FTA) capable of running jobs during temporary MDM communication outages.
- *AWS Integration
- Deploy and manage TWS agents on EC2 instances to execute scripts and applications.
- Support cloud-native workflow triggers using AWS Lambda or Step Functions integrated with TWS.
- Manage file-based dependencies leveraging Amazon S3.
- *Daily Operations & Maintenance
- Create, modify, and maintain job definitions, calendars, and scheduling resources.
- Troubleshoot job failures and delays by analyzing
Apply tot his job Apply To this Job