Feature #979

Add GPU Temperature to xymon

Added by bobmyth over 9 years ago. Updated over 9 years ago.

Status:Closed Start date:10/14/2014
Priority:Low Due date:
Assignee:- % Done:

100%

Category:- Spent time: 3.00 hours
Target version:8.2

Description

I've created and attached a script for monitoring GPU temperature in xymon. This only works for an NVidia based card with the nvidia drivers installed and the nvidia-smi utility. The script will exit silently if these conditions are not met.

The script attempts to identify the model of the video card and then determine the maximum temperature NVidia recommends.

gputemp (1.7 kB) bobmyth, 10/14/2014 08:28 pm

gputemp (1.8 kB) bobmyth, 10/20/2014 07:41 pm

gputemp (1.9 kB) bobmyth, 10/24/2014 03:41 pm

Associated revisions

Revision 525bfed2
Added by jams over 9 years ago

xymon: add gputemp.sh, fix permissions, and add some extra try/except to hbnotes
refs #979

History

Updated by jams over 9 years ago

This version will not work with LinHES 8.2 because of a change in location for proc nvidia.
Any reason that nvidia-smi wasn't used to determine the model?

Updated by brfransen over 9 years ago

  • Status changed from New to Feedback

Updated by bobmyth over 9 years ago

Good feedback, I've attached a newer version that does not rely on /proc anymore and uses just nvidia-smi. I let it run overnight and it appears to be working well.

Updated by bobmyth over 9 years ago

I just installed a GeForce GT 630, which is not recognized by the NVidia driver in LinHes 8.1. This case causing no temperature to be logged even though the card would report a temperature. This has been fixed.

Updated by jams over 9 years ago

  • % Done changed from 0 to 90
  • Status changed from Feedback to In Progress
  • Target version set to 8.2

Updated by jams over 9 years ago

  • Status changed from In Progress to Closed

Made a few minor modifications and added it to 8.2

Updated by jams over 9 years ago

  • % Done changed from 90 to 100

Also available in: Atom PDF