162306a36Sopenharmony_ciPCIe Device AER statistics 262306a36Sopenharmony_ci-------------------------- 362306a36Sopenharmony_ci 462306a36Sopenharmony_ciThese attributes show up under all the devices that are AER capable. These 562306a36Sopenharmony_cistatistical counters indicate the errors "as seen/reported by the device". 662306a36Sopenharmony_ciNote that this may mean that if an endpoint is causing problems, the AER 762306a36Sopenharmony_cicounters may increment at its link partner (e.g. root port) because the 862306a36Sopenharmony_cierrors may be "seen" / reported by the link partner and not the 962306a36Sopenharmony_ciproblematic endpoint itself (which may report all counters as 0 as it never 1062306a36Sopenharmony_cisaw any problems). 1162306a36Sopenharmony_ci 1262306a36Sopenharmony_ciWhat: /sys/bus/pci/devices/<dev>/aer_dev_correctable 1362306a36Sopenharmony_ciDate: July 2018 1462306a36Sopenharmony_ciKernelVersion: 4.19.0 1562306a36Sopenharmony_ciContact: linux-pci@vger.kernel.org, rajatja@google.com 1662306a36Sopenharmony_ciDescription: List of correctable errors seen and reported by this 1762306a36Sopenharmony_ci PCI device using ERR_COR. Note that since multiple errors may 1862306a36Sopenharmony_ci be reported using a single ERR_COR message, thus 1962306a36Sopenharmony_ci TOTAL_ERR_COR at the end of the file may not match the actual 2062306a36Sopenharmony_ci total of all the errors in the file. Sample output:: 2162306a36Sopenharmony_ci 2262306a36Sopenharmony_ci localhost /sys/devices/pci0000:00/0000:00:1c.0 # cat aer_dev_correctable 2362306a36Sopenharmony_ci Receiver Error 2 2462306a36Sopenharmony_ci Bad TLP 0 2562306a36Sopenharmony_ci Bad DLLP 0 2662306a36Sopenharmony_ci RELAY_NUM Rollover 0 2762306a36Sopenharmony_ci Replay Timer Timeout 0 2862306a36Sopenharmony_ci Advisory Non-Fatal 0 2962306a36Sopenharmony_ci Corrected Internal Error 0 3062306a36Sopenharmony_ci Header Log Overflow 0 3162306a36Sopenharmony_ci TOTAL_ERR_COR 2 3262306a36Sopenharmony_ci 3362306a36Sopenharmony_ciWhat: /sys/bus/pci/devices/<dev>/aer_dev_fatal 3462306a36Sopenharmony_ciDate: July 2018 3562306a36Sopenharmony_ciKernelVersion: 4.19.0 3662306a36Sopenharmony_ciContact: linux-pci@vger.kernel.org, rajatja@google.com 3762306a36Sopenharmony_ciDescription: List of uncorrectable fatal errors seen and reported by this 3862306a36Sopenharmony_ci PCI device using ERR_FATAL. Note that since multiple errors may 3962306a36Sopenharmony_ci be reported using a single ERR_FATAL message, thus 4062306a36Sopenharmony_ci TOTAL_ERR_FATAL at the end of the file may not match the actual 4162306a36Sopenharmony_ci total of all the errors in the file. Sample output:: 4262306a36Sopenharmony_ci 4362306a36Sopenharmony_ci localhost /sys/devices/pci0000:00/0000:00:1c.0 # cat aer_dev_fatal 4462306a36Sopenharmony_ci Undefined 0 4562306a36Sopenharmony_ci Data Link Protocol 0 4662306a36Sopenharmony_ci Surprise Down Error 0 4762306a36Sopenharmony_ci Poisoned TLP 0 4862306a36Sopenharmony_ci Flow Control Protocol 0 4962306a36Sopenharmony_ci Completion Timeout 0 5062306a36Sopenharmony_ci Completer Abort 0 5162306a36Sopenharmony_ci Unexpected Completion 0 5262306a36Sopenharmony_ci Receiver Overflow 0 5362306a36Sopenharmony_ci Malformed TLP 0 5462306a36Sopenharmony_ci ECRC 0 5562306a36Sopenharmony_ci Unsupported Request 0 5662306a36Sopenharmony_ci ACS Violation 0 5762306a36Sopenharmony_ci Uncorrectable Internal Error 0 5862306a36Sopenharmony_ci MC Blocked TLP 0 5962306a36Sopenharmony_ci AtomicOp Egress Blocked 0 6062306a36Sopenharmony_ci TLP Prefix Blocked Error 0 6162306a36Sopenharmony_ci TOTAL_ERR_FATAL 0 6262306a36Sopenharmony_ci 6362306a36Sopenharmony_ciWhat: /sys/bus/pci/devices/<dev>/aer_dev_nonfatal 6462306a36Sopenharmony_ciDate: July 2018 6562306a36Sopenharmony_ciKernelVersion: 4.19.0 6662306a36Sopenharmony_ciContact: linux-pci@vger.kernel.org, rajatja@google.com 6762306a36Sopenharmony_ciDescription: List of uncorrectable nonfatal errors seen and reported by this 6862306a36Sopenharmony_ci PCI device using ERR_NONFATAL. Note that since multiple errors 6962306a36Sopenharmony_ci may be reported using a single ERR_FATAL message, thus 7062306a36Sopenharmony_ci TOTAL_ERR_NONFATAL at the end of the file may not match the 7162306a36Sopenharmony_ci actual total of all the errors in the file. Sample output:: 7262306a36Sopenharmony_ci 7362306a36Sopenharmony_ci localhost /sys/devices/pci0000:00/0000:00:1c.0 # cat aer_dev_nonfatal 7462306a36Sopenharmony_ci Undefined 0 7562306a36Sopenharmony_ci Data Link Protocol 0 7662306a36Sopenharmony_ci Surprise Down Error 0 7762306a36Sopenharmony_ci Poisoned TLP 0 7862306a36Sopenharmony_ci Flow Control Protocol 0 7962306a36Sopenharmony_ci Completion Timeout 0 8062306a36Sopenharmony_ci Completer Abort 0 8162306a36Sopenharmony_ci Unexpected Completion 0 8262306a36Sopenharmony_ci Receiver Overflow 0 8362306a36Sopenharmony_ci Malformed TLP 0 8462306a36Sopenharmony_ci ECRC 0 8562306a36Sopenharmony_ci Unsupported Request 0 8662306a36Sopenharmony_ci ACS Violation 0 8762306a36Sopenharmony_ci Uncorrectable Internal Error 0 8862306a36Sopenharmony_ci MC Blocked TLP 0 8962306a36Sopenharmony_ci AtomicOp Egress Blocked 0 9062306a36Sopenharmony_ci TLP Prefix Blocked Error 0 9162306a36Sopenharmony_ci TOTAL_ERR_NONFATAL 0 9262306a36Sopenharmony_ci 9362306a36Sopenharmony_ciPCIe Rootport AER statistics 9462306a36Sopenharmony_ci---------------------------- 9562306a36Sopenharmony_ci 9662306a36Sopenharmony_ciThese attributes show up under only the rootports (or root complex event 9762306a36Sopenharmony_cicollectors) that are AER capable. These indicate the number of error messages as 9862306a36Sopenharmony_ci"reported to" the rootport. Please note that the rootports also transmit 9962306a36Sopenharmony_ci(internally) the ERR_* messages for errors seen by the internal rootport PCI 10062306a36Sopenharmony_cidevice, so these counters include them and are thus cumulative of all the error 10162306a36Sopenharmony_cimessages on the PCI hierarchy originating at that root port. 10262306a36Sopenharmony_ci 10362306a36Sopenharmony_ciWhat: /sys/bus/pci/devices/<dev>/aer_stats/aer_rootport_total_err_cor 10462306a36Sopenharmony_ciDate: July 2018 10562306a36Sopenharmony_ciKernelVersion: 4.19.0 10662306a36Sopenharmony_ciContact: linux-pci@vger.kernel.org, rajatja@google.com 10762306a36Sopenharmony_ciDescription: Total number of ERR_COR messages reported to rootport. 10862306a36Sopenharmony_ci 10962306a36Sopenharmony_ciWhat: /sys/bus/pci/devices/<dev>/aer_stats/aer_rootport_total_err_fatal 11062306a36Sopenharmony_ciDate: July 2018 11162306a36Sopenharmony_ciKernelVersion: 4.19.0 11262306a36Sopenharmony_ciContact: linux-pci@vger.kernel.org, rajatja@google.com 11362306a36Sopenharmony_ciDescription: Total number of ERR_FATAL messages reported to rootport. 11462306a36Sopenharmony_ci 11562306a36Sopenharmony_ciWhat: /sys/bus/pci/devices/<dev>/aer_stats/aer_rootport_total_err_nonfatal 11662306a36Sopenharmony_ciDate: July 2018 11762306a36Sopenharmony_ciKernelVersion: 4.19.0 11862306a36Sopenharmony_ciContact: linux-pci@vger.kernel.org, rajatja@google.com 11962306a36Sopenharmony_ciDescription: Total number of ERR_NONFATAL messages reported to rootport. 120