blob: 3c9a8c4a25eb89a87448f18fd7b550b4c5e74c55 [file] [log] [blame]
Andrew Scullb4b6d4a2019-01-02 15:54:55 +00001==========================
2PCIe Device AER statistics
3==========================
4These attributes show up under all the devices that are AER capable. These
5statistical counters indicate the errors "as seen/reported by the device".
6Note that this may mean that if an endpoint is causing problems, the AER
7counters may increment at its link partner (e.g. root port) because the
8errors may be "seen" / reported by the link partner and not the
9problematic endpoint itself (which may report all counters as 0 as it never
10saw any problems).
11
David Brazdil0f672f62019-12-10 10:32:29 +000012What: /sys/bus/pci/devices/<dev>/aer_dev_correctable
Andrew Scullb4b6d4a2019-01-02 15:54:55 +000013Date: July 2018
David Brazdil0f672f62019-12-10 10:32:29 +000014KernelVersion: 4.19.0
Andrew Scullb4b6d4a2019-01-02 15:54:55 +000015Contact: linux-pci@vger.kernel.org, rajatja@google.com
16Description: List of correctable errors seen and reported by this
17 PCI device using ERR_COR. Note that since multiple errors may
18 be reported using a single ERR_COR message, thus
19 TOTAL_ERR_COR at the end of the file may not match the actual
20 total of all the errors in the file. Sample output:
21-------------------------------------------------------------------------
22localhost /sys/devices/pci0000:00/0000:00:1c.0 # cat aer_dev_correctable
23Receiver Error 2
24Bad TLP 0
25Bad DLLP 0
26RELAY_NUM Rollover 0
27Replay Timer Timeout 0
28Advisory Non-Fatal 0
29Corrected Internal Error 0
30Header Log Overflow 0
31TOTAL_ERR_COR 2
32-------------------------------------------------------------------------
33
David Brazdil0f672f62019-12-10 10:32:29 +000034What: /sys/bus/pci/devices/<dev>/aer_dev_fatal
Andrew Scullb4b6d4a2019-01-02 15:54:55 +000035Date: July 2018
David Brazdil0f672f62019-12-10 10:32:29 +000036KernelVersion: 4.19.0
Andrew Scullb4b6d4a2019-01-02 15:54:55 +000037Contact: linux-pci@vger.kernel.org, rajatja@google.com
38Description: List of uncorrectable fatal errors seen and reported by this
39 PCI device using ERR_FATAL. Note that since multiple errors may
40 be reported using a single ERR_FATAL message, thus
41 TOTAL_ERR_FATAL at the end of the file may not match the actual
42 total of all the errors in the file. Sample output:
43-------------------------------------------------------------------------
44localhost /sys/devices/pci0000:00/0000:00:1c.0 # cat aer_dev_fatal
45Undefined 0
46Data Link Protocol 0
47Surprise Down Error 0
48Poisoned TLP 0
49Flow Control Protocol 0
50Completion Timeout 0
51Completer Abort 0
52Unexpected Completion 0
53Receiver Overflow 0
54Malformed TLP 0
55ECRC 0
56Unsupported Request 0
57ACS Violation 0
58Uncorrectable Internal Error 0
59MC Blocked TLP 0
60AtomicOp Egress Blocked 0
61TLP Prefix Blocked Error 0
62TOTAL_ERR_FATAL 0
63-------------------------------------------------------------------------
64
David Brazdil0f672f62019-12-10 10:32:29 +000065What: /sys/bus/pci/devices/<dev>/aer_dev_nonfatal
Andrew Scullb4b6d4a2019-01-02 15:54:55 +000066Date: July 2018
David Brazdil0f672f62019-12-10 10:32:29 +000067KernelVersion: 4.19.0
Andrew Scullb4b6d4a2019-01-02 15:54:55 +000068Contact: linux-pci@vger.kernel.org, rajatja@google.com
69Description: List of uncorrectable nonfatal errors seen and reported by this
70 PCI device using ERR_NONFATAL. Note that since multiple errors
71 may be reported using a single ERR_FATAL message, thus
72 TOTAL_ERR_NONFATAL at the end of the file may not match the
73 actual total of all the errors in the file. Sample output:
74-------------------------------------------------------------------------
75localhost /sys/devices/pci0000:00/0000:00:1c.0 # cat aer_dev_nonfatal
76Undefined 0
77Data Link Protocol 0
78Surprise Down Error 0
79Poisoned TLP 0
80Flow Control Protocol 0
81Completion Timeout 0
82Completer Abort 0
83Unexpected Completion 0
84Receiver Overflow 0
85Malformed TLP 0
86ECRC 0
87Unsupported Request 0
88ACS Violation 0
89Uncorrectable Internal Error 0
90MC Blocked TLP 0
91AtomicOp Egress Blocked 0
92TLP Prefix Blocked Error 0
93TOTAL_ERR_NONFATAL 0
94-------------------------------------------------------------------------
95
96============================
97PCIe Rootport AER statistics
98============================
99These attributes show up under only the rootports (or root complex event
100collectors) that are AER capable. These indicate the number of error messages as
101"reported to" the rootport. Please note that the rootports also transmit
102(internally) the ERR_* messages for errors seen by the internal rootport PCI
103device, so these counters include them and are thus cumulative of all the error
104messages on the PCI hierarchy originating at that root port.
105
David Brazdil0f672f62019-12-10 10:32:29 +0000106What: /sys/bus/pci/devices/<dev>/aer_stats/aer_rootport_total_err_cor
Andrew Scullb4b6d4a2019-01-02 15:54:55 +0000107Date: July 2018
David Brazdil0f672f62019-12-10 10:32:29 +0000108KernelVersion: 4.19.0
Andrew Scullb4b6d4a2019-01-02 15:54:55 +0000109Contact: linux-pci@vger.kernel.org, rajatja@google.com
110Description: Total number of ERR_COR messages reported to rootport.
111
David Brazdil0f672f62019-12-10 10:32:29 +0000112What: /sys/bus/pci/devices/<dev>/aer_stats/aer_rootport_total_err_fatal
Andrew Scullb4b6d4a2019-01-02 15:54:55 +0000113Date: July 2018
David Brazdil0f672f62019-12-10 10:32:29 +0000114KernelVersion: 4.19.0
Andrew Scullb4b6d4a2019-01-02 15:54:55 +0000115Contact: linux-pci@vger.kernel.org, rajatja@google.com
116Description: Total number of ERR_FATAL messages reported to rootport.
117
David Brazdil0f672f62019-12-10 10:32:29 +0000118What: /sys/bus/pci/devices/<dev>/aer_stats/aer_rootport_total_err_nonfatal
Andrew Scullb4b6d4a2019-01-02 15:54:55 +0000119Date: July 2018
David Brazdil0f672f62019-12-10 10:32:29 +0000120KernelVersion: 4.19.0
Andrew Scullb4b6d4a2019-01-02 15:54:55 +0000121Contact: linux-pci@vger.kernel.org, rajatja@google.com
122Description: Total number of ERR_NONFATAL messages reported to rootport.