table of contents
other versions
- bookworm 5.2.3-2
- testing 5.7.0-1
- unstable 6.1.2-1
- experimental 6.1.2-1~exp1
ROCM-SMI(1) | User Commands | ROCM-SMI(1) |
NAME¶
rocm-smi - rocm-smi - a tool to monitor AMD accelerators and GPUs
DESCRIPTION¶
usage: rocm-smi [-h] [-V] [-d DEVICE [DEVICE ...]] [--alldevices] [--showhw] [-a] [-i] [-v] [-e [EVENT ...]]
- [--showdriverversion] [--showtempgraph] [--showfwinfo [BLOCK ...]] [--showmclkrange] [--showmemvendor] [--showsclkrange] [--showproductname] [--showserial] [--showuniqueid] [--showvoltagerange] [--showbus] [--showpagesinfo] [--showpendingpages] [--showretiredpages] [--showunreservablepages] [-f] [-P] [-t] [-u] [--showmemuse] [--showvoltage] [-b] [-c] [-g] [-l] [-M] [-m] [-o] [-p] [-S] [-s] [--showmeminfo TYPE [TYPE ...]] [--showpids [VERBOSE]] [--showpidgpus [SHOWPIDGPUS ...]] [--showreplaycount] [--showrasinfo [SHOWRASINFO ...]] [--showvc] [--showxgmierr] [--showtopo] [--showtopoaccess] [--showtopoweight] [--showtopohops] [--showtopotype] [--showtoponuma] [--showenergycounter] [--shownodesbw] [--showcomputepartition] [--showmemorypartition] [-r] [--resetfans] [--resetprofile] [--resetpoweroverdrive] [--resetxgmierr] [--resetperfdeterminism] [--resetcomputepartition] [--resetmemorypartition] [--setclock TYPE LEVEL] [--setsclk LEVEL [LEVEL ...]] [--setmclk LEVEL [LEVEL ...]] [--setpcie LEVEL [LEVEL ...]] [--setslevel SCLKLEVEL SCLK SVOLT] [--setmlevel MCLKLEVEL MCLK MVOLT] [--setvc POINT SCLK SVOLT] [--setsrange SCLKMIN SCLKMAX] [--setextremum min|max sclk|mclk CLK] [--setmrange MCLKMIN MCLKMAX] [--setfan LEVEL] [--setperflevel LEVEL] [--setoverdrive %] [--setmemoverdrive %] [--setpoweroverdrive WATTS] [--setprofile SETPROFILE] [--setperfdeterminism SCLK] [--setcomputepartition {CPX,SPX,DPX,TPX,QPX,cpx,spx,dpx,tpx,qpx}] [--setmemorypartition {NPS1,NPS2,NPS4,NPS8,nps1,nps2,nps4,nps8}] [--rasenable BLOCK ERRTYPE] [--rasdisable BLOCK ERRTYPE] [--rasinject BLOCK] [--gpureset] [--load FILE | --save FILE] [--autorespond RESPONSE] [--loglevel LEVEL] [--json] [--csv]
AMD ROCm System Management Interface | ROCM-SMI version: 2.2.0
options:¶
- -h, --help
- show this help message and exit
- --gpureset
- Reset specified GPU (One GPU must be specified)
- --load FILE
- Load Clock, Fan, Performance and Profile settings from FILE
- --save FILE
- Save Clock, Fan, Performance and Profile settings to FILE
- -V, --version
- Show version information
- -d DEVICE [DEVICE ...], --device DEVICE [DEVICE ...]
- Execute command on specified device
Display Options:¶
--alldevices
- --showhw
- Show Hardware details
- -a, --showallinfo
- Show Temperature, Fan and Clock values
Topology:¶
- -i, --showid
- Show DEVICE IDs
- -v, --showvbios
- Show VBIOS version
- -e [EVENT ...], --showevents [EVENT ...]
- Show event list
- --showdriverversion
- Show kernel driver version
- --showtempgraph
- Show Temperature Graph
- --showfwinfo [BLOCK ...]
- Show FW information
- --showmclkrange
- Show mclk range
- --showmemvendor
- Show GPU memory vendor
- --showsclkrange
- Show sclk range
- --showproductname
- Show product details
- --showserial
- Show GPU's Serial Number
- --showuniqueid
- Show GPU's Unique ID
- --showvoltagerange
- Show voltage range
- --showbus
- Show PCI bus number
Pages information:¶
- --showpagesinfo
- Show retired, pending and unreservable pages
- --showpendingpages
- Show pending retired pages
- --showretiredpages
- Show retired pages
- --showunreservablepages
- Show unreservable pages
Hardware-related information:¶
- -f, --showfan
- Show current fan speed
- -P, --showpower
- Show current average or instant socket graphics package power consumption
- -t, --showtemp
- Show current temperature
- -u, --showuse
- Show current GPU use
- --showmemuse
- Show current GPU memory used
- --showvoltage
- Show current GPU voltage
Software-related/controlled information:¶
- -b, --showbw
- Show estimated PCIe use
- -c, --showclocks
- Show current clock frequencies
- -g, --showgpuclocks
- Show current GPU clock frequencies
- -l, --showprofile
- Show Compute Profile attributes
- -M, --showmaxpower
- Show maximum graphics package power this GPU will consume
- -m, --showmemoverdrive
- Show current GPU Memory Clock OverDrive level
- -o, --showoverdrive
- Show current GPU Clock OverDrive level
- -p, --showperflevel
- Show current DPM Performance Level
- -S, --showclkvolt
- Show supported GPU and Memory Clocks and Voltages
- -s, --showclkfrq
- Show supported GPU and Memory Clock
- --showmeminfo TYPE [TYPE ...]
- Show Memory usage information for given block(s) TYPE
- --showpids [VERBOSE]
- Show current running KFD PIDs (pass details to VERBOSE for detailed information)
- --showpidgpus [SHOWPIDGPUS ...]
- Show GPUs used by specified KFD PIDs (all if no arg given)
- --showreplaycount
- Show PCIe Replay Count
- --showrasinfo [SHOWRASINFO ...]
- Show RAS enablement information and error counts for the specified block(s) (all if no arg given)
- --showvc
- Show voltage curve
- --showxgmierr
- Show XGMI error information since last read
- --showtopo
- Show hardware topology information
- --showtopoaccess
- Shows the link accessibility between GPUs
- --showtopoweight
- Shows the relative weight between GPUs
- --showtopohops
- Shows the number of hops between GPUs
- --showtopotype
- Shows the link type between GPUs
- --showtoponuma
- Shows the numa nodes
- --showenergycounter
- Energy accumulator that stores amount of energy consumed
- --shownodesbw
- Shows the numa nodes
- --showcomputepartition
- Shows current compute partitioning
- --showmemorypartition
- Shows current memory partition
Set options:¶
- --setclock TYPE LEVEL
- Set Clock Frequency Level(s) for specified clock (requires manual Perf level)
- --setsclk LEVEL [LEVEL ...]
- Set GPU Clock Frequency Level(s) (requires manual Perf level)
- --setmclk LEVEL [LEVEL ...]
- Set GPU Memory Clock Frequency Level(s) (requires manual Perf level)
- --setpcie LEVEL [LEVEL ...]
- Set PCIE Clock Frequency Level(s) (requires manual Perf level)
- --setslevel SCLKLEVEL SCLK SVOLT
- Change GPU Clock frequency (MHz) and Voltage (mV) for a specific Level
- --setmlevel MCLKLEVEL MCLK MVOLT
- Change GPU Memory clock frequency (MHz) and Voltage for (mV) a specific Level
- --setvc POINT SCLK SVOLT
- Change SCLK Voltage Curve (MHz mV) for a specific point
- --setsrange SCLKMIN SCLKMAX
- Set min and max SCLK speed
- --setextremum min|max sclk|mclk CLK
- Set min/max of SCLK/MCLK speed
- --setmrange MCLKMIN MCLKMAX
- Set min and max MCLK speed
- --setfan LEVEL
- Set GPU Fan Speed (Level or %)
- --setperflevel LEVEL
- Set Performance Level
- --setoverdrive %
- Set GPU OverDrive level (requires manual|high Perf level)
- --setmemoverdrive %
- Set GPU Memory Overclock OverDrive level (requires manual|high Perf level)
- --setpoweroverdrive WATTS
- Set the maximum GPU power using Power OverDrive in Watts
- --setprofile SETPROFILE
- Specify Power Profile level (#) or a quoted string of CUSTOM Profile attributes "# # # #..." (requires manual Perf level)
- --setperfdeterminism SCLK
- Set clock frequency limit to get minimal performance variation
- --setcomputepartition {CPX,SPX,DPX,TPX,QPX,cpx,spx,dpx,tpx,qpx}
- Set compute partition
- --setmemorypartition {NPS1,NPS2,NPS4,NPS8,nps1,nps2,nps4,nps8}
- Set memory partition
- --rasenable BLOCK ERRTYPE
- Enable RAS for specified block and error type
- --rasdisable BLOCK ERRTYPE
- Disable RAS for specified block and error type
- --rasinject BLOCK
- Inject RAS poison for specified block (ONLY WORKS ON UNSECURED BOARDS)
Reset options:¶
- -r, --resetclocks
- Reset clocks and OverDrive to default
- --resetfans
- Reset fans to automatic (driver) control
- --resetprofile
- Reset Power Profile back to default
- --resetpoweroverdrive
- Set the maximum GPU power back to the device default state
- --resetxgmierr
- Reset XGMI error count
- --resetperfdeterminism
- Disable performance determinism
- --resetcomputepartition
- Resets to boot compute partition state
- --resetmemorypartition
- Resets to boot memory partition state
Auto-response options:¶
- --autorespond RESPONSE
- Response to automatically provide for all prompts (NOT RECOMMENDED)
Output options:¶
- --loglevel LEVEL
- How much output will be printed for what program is doing, one of debug/info/warning/error/critical
- --json
- Print output in JSON format
- --csv
- Print output in CSV format
SEE ALSO¶
The full documentation for rocm-smi is maintained as a Texinfo manual. If the info and rocm-smi programs are properly installed at your site, the command
- info rocm-smi
should give you access to the complete manual.
November 2024 | rocm-smi 2.2.0 |