<rss xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title>PCIe - Tag - Naifan Li's Blog</title><link>https://blog.omagiclee.com/tags/pcie/</link><description>PCIe - Tag - Naifan Li's Blog</description><generator>Hugo -- gohugo.io</generator><language>en-us</language><lastBuildDate>Tue, 27 May 2025 16:15:09 +0800</lastBuildDate><atom:link href="https://blog.omagiclee.com/tags/pcie/" rel="self" type="application/rss+xml"/><item><title>Performance Profile</title><link>https://blog.omagiclee.com/posts/infra/performance/</link><pubDate>Tue, 27 May 2025 16:15:09 +0800</pubDate><author>Naifan Li</author><guid>https://blog.omagiclee.com/posts/infra/performance/</guid><description><![CDATA[<h2 id="确认pcie-型号通道数量带宽">确认PCIe 型号/通道数量/带宽</h2>
<table>
  <thead>
      <tr>
          <th style="text-align: left">PCIe 版本</th>
          <th style="text-align: left">发布时间</th>
          <th style="text-align: left">传输速率</th>
          <th style="text-align: left">编码方式</th>
          <th style="text-align: left">单通道带宽</th>
          <th style="text-align: left">x16 带宽</th>
      </tr>
  </thead>
  <tbody>
      <tr>
          <td style="text-align: left">PCIe 1.0</td>
          <td style="text-align: left">2003年</td>
          <td style="text-align: left">2.5 GT/s</td>
          <td style="text-align: left">8b/10b</td>
          <td style="text-align: left">250 MB/s</td>
          <td style="text-align: left">4 GB/s</td>
      </tr>
      <tr>
          <td style="text-align: left">PCIe 2.0</td>
          <td style="text-align: left">2007年</td>
          <td style="text-align: left">5.0 GT/s</td>
          <td style="text-align: left">8b/10b</td>
          <td style="text-align: left">500 MB/s</td>
          <td style="text-align: left">8 GB/s</td>
      </tr>
      <tr>
          <td style="text-align: left">PCIe 3.0</td>
          <td style="text-align: left">2010年</td>
          <td style="text-align: left">8.0 GT/s</td>
          <td style="text-align: left">128b/130b</td>
          <td style="text-align: left">984.6 MB/s</td>
          <td style="text-align: left">15.75 GB/s</td>
      </tr>
      <tr>
          <td style="text-align: left">PCIe 4.0</td>
          <td style="text-align: left">2017年</td>
          <td style="text-align: left">16.0 GT/s</td>
          <td style="text-align: left">128b/130b</td>
          <td style="text-align: left">1.969 GB/s</td>
          <td style="text-align: left">31.5 GB/s</td>
      </tr>
      <tr>
          <td style="text-align: left">PCIe 5.0</td>
          <td style="text-align: left">2019年</td>
          <td style="text-align: left">32.0 GT/s</td>
          <td style="text-align: left">128b/130b</td>
          <td style="text-align: left">3.938 GB/s</td>
          <td style="text-align: left">63 GB/s</td>
      </tr>
      <tr>
          <td style="text-align: left">PCIe 6.0</td>
          <td style="text-align: left">2022年</td>
          <td style="text-align: left">64.0 GT/s</td>
          <td style="text-align: left">PAM4</td>
          <td style="text-align: left">7.877 GB/s</td>
          <td style="text-align: left">126 GB/s</td>
      </tr>
  </tbody>
</table>
<ul>
<li>GT/s (Giga Transfers per second) 表示每秒传输次数</li>
<li>实际带宽需要考虑编码开销</li>
</ul>
<h2 id="4090-vs-l40s-vs-a100-pcie-规格对比">4090 vs L40S vs A100 PCIe 规格对比</h2>
<table>
  <thead>
      <tr>
          <th style="text-align: left">显卡型号</th>
          <th style="text-align: left">PCIe 版本</th>
          <th style="text-align: left">通道数</th>
          <th style="text-align: left">理论带宽</th>
          <th style="text-align: left">实际带宽</th>
      </tr>
  </thead>
  <tbody>
      <tr>
          <td style="text-align: left">RTX 4090</td>
          <td style="text-align: left">PCIe 4.0</td>
          <td style="text-align: left">x16</td>
          <td style="text-align: left">64 GB/s</td>
          <td style="text-align: left">63 GB/s</td>
      </tr>
      <tr>
          <td style="text-align: left">L40S</td>
          <td style="text-align: left">PCIe 4.0</td>
          <td style="text-align: left">x16</td>
          <td style="text-align: left">64 GB/s</td>
          <td style="text-align: left">63 GB/s</td>
      </tr>
      <tr>
          <td style="text-align: left">A100</td>
          <td style="text-align: left">PCIe 4.0</td>
          <td style="text-align: left">x16</td>
          <td style="text-align: left">64 GB/s</td>
          <td style="text-align: left">63 GB/s</td>
      </tr>
  </tbody>
</table>
<ul>
<li>所有三款显卡都支持 PCIe 4.0 x16，理论带宽相同</li>
<li>实际使用中，带宽利用率受以下因素影响：
<ul>
<li>主板 PCIe 插槽版本</li>
<li>CPU 支持的 PCIe 版本</li>
<li>系统总线带宽</li>
<li>其他 PCIe 设备占用</li>
</ul>
</li>
</ul>
<h2 id="查看实际-pcie-配置">查看实际 PCIe 配置：</h2>
<div class="highlight"><div class="chroma">
<table class="lntable"><tr><td class="lntd">
<pre tabindex="0" class="chroma"><code><span class="lnt">1
</span><span class="lnt">2
</span><span class="lnt">3
</span><span class="lnt">4
</span></code></pre></td>
<td class="lntd">
<pre tabindex="0" class="chroma"><code class="language-bash" data-lang="bash"><span class="line"><span class="cl"><span class="c1"># 查看 PCIe 链路状态</span>
</span></span><span class="line"><span class="cl">nvidia-smi -q <span class="p">|</span> grep -A <span class="m">10</span> <span class="s2">&#34;PCI&#34;</span>
</span></span><span class="line"><span class="cl"><span class="c1"># 或使用</span>
</span></span><span class="line"><span class="cl">lspci -vv <span class="p">|</span> grep -i <span class="s2">&#34;LnkSta&#34;</span>
</span></span></code></pre></td></tr></table>
</div>
</div><div class="highlight"><div class="chroma">
<table class="lntable"><tr><td class="lntd">
<pre tabindex="0" class="chroma"><code><span class="lnt">1
</span><span class="lnt">2
</span><span class="lnt">3
</span><span class="lnt">4
</span></code></pre></td>
<td class="lntd">
<pre tabindex="0" class="chroma"><code class="language-shell" data-lang="shell"><span class="line"><span class="cl"><span class="c1"># nvidia-smi: NVIDIA 显卡管理和监控工具，显示 GPU 状态信息</span>
</span></span><span class="line"><span class="cl"><span class="c1"># -q: Query, 详细显示 NVIDIA GPU 的所有状态和配置信息</span>
</span></span><span class="line"><span class="cl"><span class="c1"># grep -A 10 &#34;PCI&#34;: 查找包含&#34;PCI&#34;关键字的行，并显示该行及其后面10行内容</span>
</span></span><span class="line"><span class="cl">nvidia-smi -q <span class="p">|</span> grep -A <span class="m">10</span> <span class="s2">&#34;PCI&#34;</span>
</span></span></code></pre></td></tr></table>
</div>
</div><div class="highlight"><div class="chroma">
<table class="lntable"><tr><td class="lntd">
<pre tabindex="0" class="chroma"><code><span class="lnt">1
</span><span class="lnt">2
</span><span class="lnt">3
</span></code></pre></td>
<td class="lntd">
<pre tabindex="0" class="chroma"><code class="language-shell" data-lang="shell"><span class="line"><span class="cl">watch -n 0.5 nvidia-smi dmon -s puctmvb
</span></span><span class="line"><span class="cl">nvtop
</span></span><span class="line"><span class="cl">NVIDIA Nsight Systems
</span></span></code></pre></td></tr></table>
</div>
</div><p>nvidia-smi</p>]]></description></item></channel></rss>