- Notifications
You must be signed in to change notification settings - Fork9
Tracknodes keeps a history of node state and comment changes. It allows system administrators of HPC systems to determine when nodes were down and discover trends such as recurring issues. Supports Torque, PBSpro and SLURM.
License
NREL/tracknodes
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Tracknodes keeps a history of node state and comment changes. It allows system administrators of HPC systems to determine when nodes were down and discover trends such as recurring issues. Supports Torque, PBSpro and has limited support for SLURM.
$ pip install tracknodes
or
$ easy_install tracknodes
Setup a cronjob on an admin node. This step is required for node state changes to be tracked.
$ crontab -u root -e# Track Node State Every Minute***** (/usr/bin/tracknodes --update>/dev/null2>&1)
Use the below command to see the history of node changes.
$ tracknodesHistory of Nodes=========n101| 2016-11-28 21:30:01| online|''n101| 2016-11-28 20:30:01| offline,down|'Hardware issue bad DIMM'n092| 2016-11-27 19:30:01| online|''n092| 2016-11-27 12:00:01| offline|'Hardware issue failed disk'n021| 2016-11-27 09:00:01| online|''n021| 2016-11-26 19:00:01| offline,down|'DIMM Configuration Error'-- --
You can setup the configuration file for tracknodes to change the database location or the command to get node status. Use the below as an example.
$ cat /etc/tracknodes.conf---dbfile:"/opt/tracknodes.db"cmd:"/opt/pbsnodes"
Tracknodes uses a sqlite database to store the node history, you can determine what database its using with the -v argument.
$ tracknodes -vResource Manager Detected as torquecmd: /opt/pbsnodesdbfile:~/.tracknodes.db...
For usage information you can use --help.
$ tracknodes --helpUsage: tracknodes [options]Options: -h, --help show thishelp message andexit -U, --update Update Database From Current Node States -f DBFILE, --dbfile=DBFILE Database File -c CMD, --cmd=CMD Location ofcommand to show node state, example: /opt/pbsnodes, /opt/sinfo -v, --verbose Verbose Output
tracknodes is released under theGPLv3 License.
About
Tracknodes keeps a history of node state and comment changes. It allows system administrators of HPC systems to determine when nodes were down and discover trends such as recurring issues. Supports Torque, PBSpro and SLURM.