Zewura workload log
Here you can find the data sets generated from TORQUE traces during the first five months of the year 2012.
The data sets contains:
- job descriptions (17,256 jobs)
Usage rules
The Zewura workload log was graciously provided by the Czech National Grid Infrastructure
MetaCentrum.
If you use this log in your work,
please use a similar acknowledgment.
File format
Job description
The job log is in Standard Workload Format (SWF).
The log can be obtained at:
job descriptions [1.2 MB, SWF file]
MetaCentrum data sets
Here you can find the data sets generated from PBSpro traces during the first five months of the year 2009.
The data sets contains:
- job descriptions (103656 jobs)
- node descriptions (14 clusters, 806 CPUs)
- list of queues
- descriptions of machines in maintenance (failures & upgrades)
- descriptions of dedicated and reserved machines
- SPEC CPU2006 benchmark results of MetaCentrum cluster's
Usage rules
The MetaCentrum workload log was graciously provided by the Czech National Grid Infrastructure
MetaCentrum.
If you use this log in your work,
please use a similar acknowledgment.
File format
Job description
| value | type | meaning | example
|
|---|
| job_id | (int) | job id | 123
|
| user | (String) | job owner | user_43
|
| queue_name | (String) | queue where the job was originaly submitted | q2
|
| used_CPUs | (int) | number of used CPUs | 16
|
| used_nodes | (int) | number of used nodes | 2
|
| required_properties | [String;String;...] | required properties that must be available on the target machine | [p1;p2;p14]
|
| used_main_memory(kB) | (int) | used memory | 2989520
|
| arrival_time | (int) | in epoch format | 1230675195
|
| start_time | (int) | in epoch format | 1230675200
|
| end_time | (int) | in epoch format | 1230768010
|
| duration | (int) | in seconds | 92810
|
| exit_status | (int) | usually the exit status of the shell executing the job script | 1
|
| id(s)_of_assigned_CPUs | list of ints, separated by spaces | CPUs where the job was executed (see Node description) | 16 17 18 19 20 21
|
::
View example
Node descriptions
| value | type | meaning | example
|
|---|
| node_id | (int) | id of the node | 11
|
| node_name | (String) | node name | cluster_9
|
| cpu_speed(MHz) | (int) | CPU speed in MHz | 2400
|
| main_memory_size(kB) | (int) | RAM size in kB of one node's machine | 4000000
|
| CPU_type | (String) | CPU type | Xeon
|
| operating_system( | String) | OS | linux
|
| list_of_supported_properties | (list of Strings separated by commas) | list of supported properties | p2,p12,p25,p10,p22,p9
|
| number_of_machines | (int) | the number of machines within this node | 3
|
| total_number_of_CPUs | (int) | the number of CPUs within this node | 8
|
| list_of_number_of_CPUs_on_each_machine | (list of ints separated by commas) | each int represents the number of CPUs of some node's machine | 2,2,4
|
| list_of_CPUs_ids | (list of ints) | CPU ids corresponding to the ids stored in the job description file | 16,17,18,19,20,21,22,23
|
::
View example
List of queues
| value | type | meaning | example
|
|---|
| queue_name | (int) | queue name as appears in job description | q3
|
| queue_priority | (int) | priority of this queue - used by the PBSpro scheduler to
define the order of queue selection during scheduling process (higher number = queue is chosen sooner) | 50
|
| time limit | (int) | job time limit in hours. If the execution time of a job being submitted into this queue exceeds this limit then the job is killed. | 24
|
::
View example
Maintenance & reservations & dedicated machines
| value | type | meaning | example
|
|---|
| start_time | (int) | when machine became failed / reserved / dedicated / unavailable (in epoch format) | 1231771012
|
| node_name | (String) | name of the node | cluster_8
|
| duration | (int) | the duration of the failure / reservation /... (in seconds) | 11231
|
| affected_machines | list of machine's ids (int) separated by spaces | defines which machines from node were failed / reserved / ... | 2 3 4
|
::
View example
Distribution of jobs into queues
| queue name | # of jobs
|
|---|
| q3 | 78183
| | q2 | 13059
| | q4 | 5845
| | q1 | 4629
| | q5 | 720
| | q6 | 696
| | q9 | 282
| | q10 | 123
| | q7 | 113
| | q11 | 6
|
|
Job paralelism = 14,06%
| used CPUs | # of jobs
|
|---|
| 1 | 89078
| | 4 | 6330
| | 2 | 4234
| | 8 | 1689
| | 3 | 963
| | 16 | 671
| | 12 | 187
| | 6 | 124
| | 32 | 98
| | 20 | 86
| | 24 | 41
| | 5 | 40
| | 10 | 38
| | 28 | 28
| | 15 | 12
| | 14 | 10
| | 7 | 8
| | 13 | 4
| | 21 | 3
| | 18 | 3
| | 60 | 2
| | 11 | 2
| | 30 | 2
| | 31 | 1
| | 22 | 1
| | 9 | 1
|
|
SPEC CPU2006 benchmarks of MetaCentrum cluster's
Coming soon.
Download
job descriptions [2.05 MB, zip archive]
node descriptions [txt file]
queue descriptions [txt file]
maintenance descriptions [txt file]
dedicated machines descriptions [txt file]
reserved machines descriptions [txt file]
If you have any questions, please contact me via the e-mail bellow.