sysfs-bus-pci-devices-aer_stats 4.6 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122
  1. ==========================
  2. PCIe Device AER statistics
  3. ==========================
  4. These attributes show up under all the devices that are AER capable. These
  5. statistical counters indicate the errors "as seen/reported by the device".
  6. Note that this may mean that if an endpoint is causing problems, the AER
  7. counters may increment at its link partner (e.g. root port) because the
  8. errors may be "seen" / reported by the link partner and not the
  9. problematic endpoint itself (which may report all counters as 0 as it never
  10. saw any problems).
  11. Where: /sys/bus/pci/devices/<dev>/aer_dev_correctable
  12. Date: July 2018
  13. Kernel Version: 4.19.0
  14. Contact: linux-pci@vger.kernel.org, rajatja@google.com
  15. Description: List of correctable errors seen and reported by this
  16. PCI device using ERR_COR. Note that since multiple errors may
  17. be reported using a single ERR_COR message, thus
  18. TOTAL_ERR_COR at the end of the file may not match the actual
  19. total of all the errors in the file. Sample output:
  20. -------------------------------------------------------------------------
  21. localhost /sys/devices/pci0000:00/0000:00:1c.0 # cat aer_dev_correctable
  22. Receiver Error 2
  23. Bad TLP 0
  24. Bad DLLP 0
  25. RELAY_NUM Rollover 0
  26. Replay Timer Timeout 0
  27. Advisory Non-Fatal 0
  28. Corrected Internal Error 0
  29. Header Log Overflow 0
  30. TOTAL_ERR_COR 2
  31. -------------------------------------------------------------------------
  32. Where: /sys/bus/pci/devices/<dev>/aer_dev_fatal
  33. Date: July 2018
  34. Kernel Version: 4.19.0
  35. Contact: linux-pci@vger.kernel.org, rajatja@google.com
  36. Description: List of uncorrectable fatal errors seen and reported by this
  37. PCI device using ERR_FATAL. Note that since multiple errors may
  38. be reported using a single ERR_FATAL message, thus
  39. TOTAL_ERR_FATAL at the end of the file may not match the actual
  40. total of all the errors in the file. Sample output:
  41. -------------------------------------------------------------------------
  42. localhost /sys/devices/pci0000:00/0000:00:1c.0 # cat aer_dev_fatal
  43. Undefined 0
  44. Data Link Protocol 0
  45. Surprise Down Error 0
  46. Poisoned TLP 0
  47. Flow Control Protocol 0
  48. Completion Timeout 0
  49. Completer Abort 0
  50. Unexpected Completion 0
  51. Receiver Overflow 0
  52. Malformed TLP 0
  53. ECRC 0
  54. Unsupported Request 0
  55. ACS Violation 0
  56. Uncorrectable Internal Error 0
  57. MC Blocked TLP 0
  58. AtomicOp Egress Blocked 0
  59. TLP Prefix Blocked Error 0
  60. TOTAL_ERR_FATAL 0
  61. -------------------------------------------------------------------------
  62. Where: /sys/bus/pci/devices/<dev>/aer_dev_nonfatal
  63. Date: July 2018
  64. Kernel Version: 4.19.0
  65. Contact: linux-pci@vger.kernel.org, rajatja@google.com
  66. Description: List of uncorrectable nonfatal errors seen and reported by this
  67. PCI device using ERR_NONFATAL. Note that since multiple errors
  68. may be reported using a single ERR_FATAL message, thus
  69. TOTAL_ERR_NONFATAL at the end of the file may not match the
  70. actual total of all the errors in the file. Sample output:
  71. -------------------------------------------------------------------------
  72. localhost /sys/devices/pci0000:00/0000:00:1c.0 # cat aer_dev_nonfatal
  73. Undefined 0
  74. Data Link Protocol 0
  75. Surprise Down Error 0
  76. Poisoned TLP 0
  77. Flow Control Protocol 0
  78. Completion Timeout 0
  79. Completer Abort 0
  80. Unexpected Completion 0
  81. Receiver Overflow 0
  82. Malformed TLP 0
  83. ECRC 0
  84. Unsupported Request 0
  85. ACS Violation 0
  86. Uncorrectable Internal Error 0
  87. MC Blocked TLP 0
  88. AtomicOp Egress Blocked 0
  89. TLP Prefix Blocked Error 0
  90. TOTAL_ERR_NONFATAL 0
  91. -------------------------------------------------------------------------
  92. ============================
  93. PCIe Rootport AER statistics
  94. ============================
  95. These attributes show up under only the rootports (or root complex event
  96. collectors) that are AER capable. These indicate the number of error messages as
  97. "reported to" the rootport. Please note that the rootports also transmit
  98. (internally) the ERR_* messages for errors seen by the internal rootport PCI
  99. device, so these counters include them and are thus cumulative of all the error
  100. messages on the PCI hierarchy originating at that root port.
  101. Where: /sys/bus/pci/devices/<dev>/aer_stats/aer_rootport_total_err_cor
  102. Date: July 2018
  103. Kernel Version: 4.19.0
  104. Contact: linux-pci@vger.kernel.org, rajatja@google.com
  105. Description: Total number of ERR_COR messages reported to rootport.
  106. Where: /sys/bus/pci/devices/<dev>/aer_stats/aer_rootport_total_err_fatal
  107. Date: July 2018
  108. Kernel Version: 4.19.0
  109. Contact: linux-pci@vger.kernel.org, rajatja@google.com
  110. Description: Total number of ERR_FATAL messages reported to rootport.
  111. Where: /sys/bus/pci/devices/<dev>/aer_stats/aer_rootport_total_err_nonfatal
  112. Date: July 2018
  113. Kernel Version: 4.19.0
  114. Contact: linux-pci@vger.kernel.org, rajatja@google.com
  115. Description: Total number of ERR_NONFATAL messages reported to rootport.