Skip to content

Commit

Permalink
[BAAI] update marddown (FlagOpen#651)
Browse files Browse the repository at this point in the history
  • Loading branch information
tianxiao-baai authored Jul 15, 2024
1 parent 2deea1e commit cc9c8a4
Show file tree
Hide file tree
Showing 12 changed files with 24 additions and 35 deletions.
5 changes: 2 additions & 3 deletions operation/benchmarks/bitwise_and/nvidia/A100_40_SXM/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -36,15 +36,14 @@ https://github.com/FlagOpen/FlagGems. Commit ID:9168f2d031ecc1b31a9f658fb66dd673

## 其他评测结果

| 评测项 | 相对误差(with FP64-CPU)标准差 | cputime | kerneltime | cputime吞吐 | kerneltime吞吐 | 无预热时延 | 预热后时>延 |
| 评测项 | 相对误差(with FP64-CPU)标准差 | cputime | kerneltime | cputime吞吐 | kerneltime吞吐 | 无预热时延 | 预热后时延 |
| ---- | -------------- | -------------- | ------------ | ------------ | -------------- | -------------- | ------------ |
| flaggems | 0.00E+00 | 108940.43us | 106414.08us | 9.18op/s | 9.4op/s | 279049.06us | 98172.16us |
| nativetorch | 0.00E+00 | 100299.41us | 96353.28us | 9.97op/s | 10.38op/s | 270552.12us | 96428.79us |

## 能耗监控结果

| 监控项 | 系统平均功耗 | 系统最大功耗 | 系统功耗标准差 | 单机TDP | 单卡平均功耗 | 单卡最大功耗 | 单卡功耗标准差 | 单
卡TDP |
| 监控项 | 系统平均功耗 | 系统最大功耗 | 系统功耗标准差 | 单机TDP | 单卡平均功耗 | 单卡最大功耗 | 单卡功耗标准差 | 单卡TDP |
| ---- | ------- | ------- | ------- | ----- | ------------ | ------------ | ------------- | ----- |
| nativetorch监控结果 | 1560.0W | 1560.0W | 0.0W | / | 59.96W | 62.0W | 1.89W | 1560.0 |
| flaggems监控结果 | 1560.0W | 1560.0W | 0.0W | / | 60.33W | 62.0W | 1.55W | 1560.0 |
Expand Down
5 changes: 2 additions & 3 deletions operation/benchmarks/bitwise_not/nvidia/A100_40_SXM/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -36,15 +36,14 @@ https://github.com/FlagOpen/FlagGems. Commit ID:9168f2d031ecc1b31a9f658fb66dd673

## 其他评测结果

| 评测项 | 相对误差(with FP64-CPU)标准差 | cputime | kerneltime | cputime吞吐 | kerneltime吞吐 | 无预热时延 | 预热后时>延 |
| 评测项 | 相对误差(with FP64-CPU)标准差 | cputime | kerneltime | cputime吞吐 | kerneltime吞吐 | 无预热时延 | 预热后时延 |
| ---- | -------------- | -------------- | ------------ | ------------ | -------------- | -------------- | ------------ |
| flaggems | 0.00E+00 | 108143.29us | 100317.18us | 9.25op/s | 9.97op/s | 39373.23us | 104317.95us |
| nativetorch | 0.00E+00 | 107053.46us | 102887.42us | 9.34op/s | 9.72op/s | 43383.28us | 98195.36us |

## 能耗监控结果

| 监控项 | 系统平均功耗 | 系统最大功耗 | 系统功耗标准差 | 单机TDP | 单卡平均功耗 | 单卡最大功耗 | 单卡功耗标准差 | 单
卡TDP |
| 监控项 | 系统平均功耗 | 系统最大功耗 | 系统功耗标准差 | 单机TDP | 单卡平均功耗 | 单卡最大功耗 | 单卡功耗标准差 | 单卡TDP |
| ---- | ------- | ------- | ------- | ----- | ------------ | ------------ | ------------- | ----- |
| nativetorch监控结果 | 1482.0W | 1482.0W | 0.0W | / | 60.39W | 62.0W | 1.5W | 1482.0 |
| flaggems监控结果 | 1560.0W | 1560.0W | 0.0W | / | 60.36W | 62.0W | 1.55W | 1560.0 |
Expand Down
5 changes: 2 additions & 3 deletions operation/benchmarks/bitwise_or/nvidia/A100_40_SXM/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -36,15 +36,14 @@ https://github.com/FlagOpen/FlagGems. Commit ID:9168f2d031ecc1b31a9f658fb66dd673

## 其他评测结果

| 评测项 | 相对误差(with FP64-CPU)标准差 | cputime | kerneltime | cputime吞吐 | kerneltime吞吐 | 无预热时延 | 预热后时>延 |
| 评测项 | 相对误差(with FP64-CPU)标准差 | cputime | kerneltime | cputime吞吐 | kerneltime吞吐 | 无预热时延 | 预热后时延 |
| ---- | -------------- | -------------- | ------------ | ------------ | -------------- | -------------- | ------------ |
| flaggems | 0.00E+00 | 104786.08us | 100961.28us | 9.54op/s | 9.9op/s | 298346.76us | 98256.06us |
| nativetorch | 0.00E+00 | 107694.18us | 99507.2us | 9.29op/s | 10.05op/s | 269314.33us | 98708.36us |

## 能耗监控结果

| 监控项 | 系统平均功耗 | 系统最大功耗 | 系统功耗标准差 | 单机TDP | 单卡平均功耗 | 单卡最大功耗 | 单卡功耗标准差 | 单
卡TDP |
| 监控项 | 系统平均功耗 | 系统最大功耗 | 系统功耗标准差 | 单机TDP | 单卡平均功耗 | 单卡最大功耗 | 单卡功耗标准差 | 单卡TDP |
| ---- | ------- | ------- | ------- | ----- | ------------ | ------------ | ------------- | ----- |
| nativetorch监控结果 | 1560.0W | 1560.0W | 0.0W | / | 60.15W | 62.0W | 1.68W | 1560.0 |
| flaggems监控结果 | 1560.0W | 1560.0W | 0.0W | / | 60.45W | 62.0W | 1.5W | 1560.0 |
Expand Down
5 changes: 2 additions & 3 deletions operation/benchmarks/gelu/nvidia/A100_40_SXM/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -36,15 +36,14 @@ https://github.com/FlagOpen/FlagGems. Commit ID:9168f2d031ecc1b31a9f658fb66dd673

## 其他评测结果

| 评测项 | 相对误差(with FP64-CPU)标准差 | cputime | kerneltime | cputime吞吐 | kerneltime吞吐 | 无预热时延 | 预热后时>延 |
| 评测项 | 相对误差(with FP64-CPU)标准差 | cputime | kerneltime | cputime吞吐 | kerneltime吞吐 | 无预热时延 | 预热后时延 |
| ---- | -------------- | -------------- | ------------ | ------------ | -------------- | -------------- | ------------ |
| flaggems | 3.87E-07 | 6139.86us | 6145.02us | 162.87op/s | 162.73op/s | 314473.94us | 6205.19us |
| nativetorch | 4.20E-07 | 6138.4us | 6141.95us | 162.91op/s | 162.81op/s | 10541.9us | 6156.06us |

## 能耗监控结果

| 监控项 | 系统平均功耗 | 系统最大功耗 | 系统功耗标准差 | 单机TDP | 单卡平均功耗 | 单卡最大功耗 | 单卡功耗标准差 | 单
卡TDP |
| 监控项 | 系统平均功耗 | 系统最大功耗 | 系统功耗标准差 | 单机TDP | 单卡平均功耗 | 单卡最大功耗 | 单卡功耗标准差 | 单卡TDP |
| ---- | ------- | ------- | ------- | ----- | ------------ | ------------ | ------------- | ----- |
| nativetorch监控结果 | 1716.0W | 1716.0W | 0.0W | / | 325.77W | 329.0W | 5.39W | 1716.0 |
| flaggems监控结果 | 1794.0W | 1794.0W | 0.0W | / | 373.03W | 379.0W | 4.64W | 1794.0 |
Expand Down
5 changes: 2 additions & 3 deletions operation/benchmarks/max/nvidia/A100_40_SXM/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -37,15 +37,14 @@ https://github.com/FlagOpen/FlagGems. Commit ID:982781081f5d62856064ae986e8927a3

## 其他评测结果

| 评测项 | 相对误差(with FP64-CPU)标准差 | cputime | kerneltime | cputime吞吐 | kerneltime吞吐 | 无预热时延 | 预热后时>延 |
| 评测项 | 相对误差(with FP64-CPU)标准差 | cputime | kerneltime | cputime吞吐 | kerneltime吞吐 | 无预热时延 | 预热后时延 |
| ---- | -------------- | -------------- | ------------ | ------------ | -------------- | -------------- | ------------ |
| flaggems | 0.00E+00 | 2872.97us | 2894.85us | 348.07op/s | 345.44op/s | 715679.85us | 2927.86us |
| nativetorch | 0.00E+00 | 2961.24us | 2982.91us | 337.7op/s | 335.24op/s | 3189.87us | 2980.2us |

## 能耗监控结果

| 监控项 | 系统平均功耗 | 系统最大功耗 | 系统功耗标准差 | 单机TDP | 单卡平均功耗 | 单卡最大功耗 | 单卡功耗标准差 | 单
卡TDP |
| 监控项 | 系统平均功耗 | 系统最大功耗 | 系统功耗标准差 | 单机TDP | 单卡平均功耗 | 单卡最大功耗 | 单卡功耗标准差 | 单卡TDP |
| ---- | ------- | ------- | ------- | ----- | ------------ | ------------ | ------------- | ----- |
| nativetorch监控结果 | 1716.0W | 1716.0W | 0.0W | / | 274.77W | 280.0W | 3.77W | 1716.0 |
| flaggems监控结果 | 1716.0W | 1716.0W | 0.0W | / | 276.03W | 278.0W | 3.02W | 1716.0 |
Expand Down
5 changes: 2 additions & 3 deletions operation/benchmarks/min/nvidia/A100_40_SXM/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -37,15 +37,14 @@ https://github.com/FlagOpen/FlagGems. Commit ID:982781081f5d62856064ae986e8927a3

## 其他评测结果

| 评测项 | 相对误差(with FP64-CPU)标准差 | cputime | kerneltime | cputime吞吐 | kerneltime吞吐 | 无预热时延 | 预热后时>延 |
| 评测项 | 相对误差(with FP64-CPU)标准差 | cputime | kerneltime | cputime吞吐 | kerneltime吞吐 | 无预热时延 | 预热后时延 |
| ---- | -------------- | -------------- | ------------ | ------------ | -------------- | -------------- | ------------ |
| flaggems | 0.00E+00 | 2873.03us | 2895.87us | 348.06op/s | 345.32op/s | 1925186.13us | 2939.79us |
| nativetorch | 0.00E+00 | 2961.08us | 2981.89us | 337.71op/s | 335.36op/s | 3209.14us | 2987.49us |

## 能耗监控结果

| 监控项 | 系统平均功耗 | 系统最大功耗 | 系统功耗标准差 | 单机TDP | 单卡平均功耗 | 单卡最大功耗 | 单卡功耗标准差 | 单
卡TDP |
| 监控项 | 系统平均功耗 | 系统最大功耗 | 系统功耗标准差 | 单机TDP | 单卡平均功耗 | 单卡最大功耗 | 单卡功耗标准差 | 单卡TDP |
| ---- | ------- | ------- | ------- | ----- | ------------ | ------------ | ------------- | ----- |
| nativetorch监控结果 | 1716.0W | 1716.0W | 0.0W | / | 276.03W | 278.0W | 2.76W | 1716.0 |
| flaggems监控结果 | 1716.0W | 1716.0W | 0.0W | / | 276.66W | 281.0W | 2.56W | 1716.0 |
Expand Down
2 changes: 1 addition & 1 deletion operation/benchmarks/mm/nvidia/A100_40_SXM/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -50,7 +50,7 @@ https://github.com/FlagOpen/FlagGems. Commit ID:982781081f5d62856064ae986e8927a3

## 其他重要监控结果

| 监控项 | 系统平均CPU占用 | 系统平均内存占用 | 单卡平均温度 | 单卡平均显存占用 |
| 监控项 | 系统平均CPU占用 | 系统平均内存占用 | 单卡平均温度 | 单卡最大显存占用 |
| ---- | --------- | -------- | ------------ | -------------- |
| nativetorch监控结果 | 1.021% | 1.228% | 63.70°C | 0.262% |
| flaggems监控结果 | 1.058% | 1.192% | 64.05°C | 0.338% |
7 changes: 3 additions & 4 deletions operation/benchmarks/prod/nvidia/A100_40_SXM/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -43,11 +43,10 @@ https://github.com/FlagOpen/FlagGems. Commit ID:9168f2d031ecc1b31a9f658fb66dd673

## 能耗监控结果

| 监控项 | 系统平均功耗 | 系统最大功耗 | 系统功耗标准差 | 单机TDP | 单卡平均功耗 | 单卡最大功耗 | 单卡功耗标准差 | 单
卡TDP |
| 监控项 | 系统平均功耗 | 系统最大功耗 | 系统功耗标准差 | 单机TDP | 单卡平均功耗 | 单卡最大功耗 | 单卡功耗标准差 | 单卡TDP |
| ---- | ------- | ------- | ------- | ----- | ------------ | ------------ | ------------- | ----- |
| nativetorch监控结果 | 1638.0W | 1638.0W | 0.0W | / | 267.55W | 270.0W | 3.29W | 1638.0 |
| flaggems监控结果 | 1716.0W | 1716.0W | 0.0W | / | 275.55W | 278.0W | 3.63W | 1716.0 |
| nativetorch监控结果 | 1638.0W | 1638.0W | 0.0W| / | 267.55W | 270.0W | 3.29W | 1638.0 |
| flaggems监控结果 | 1716.0W | 1716.0W | 0.0W | / | 275.55W | 278.0W | 3.63W | 1716.0 |

## 其他重要监控结果

Expand Down
5 changes: 2 additions & 3 deletions operation/benchmarks/relu/nvidia/A100_40_SXM/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -36,15 +36,14 @@ https://github.com/FlagOpen/FlagGems. Commit ID:9168f2d031ecc1b31a9f658fb66dd673

## 其他评测结果

| 评测项 | 相对误差(with FP64-CPU)标准差 | cputime | kerneltime | cputime吞吐 | kerneltime吞吐 | 无预热时延 | 预热后时>延 |
| 评测项 | 相对误差(with FP64-CPU)标准差 | cputime | kerneltime | cputime吞吐 | kerneltime吞吐 | 无预热时延 | 预热后时延 |
| ---- | -------------- | -------------- | ------------ | ------------ | -------------- | -------------- | ------------ |
| flaggems | 0.00E+00 | 6191.06us | 6194.18us | 161.52op/s | 161.44op/s | 410813.62us | 6272.68us |
| nativetorch | 0.00E+00 | 6194.68us | 6198.27us | 161.43op/s | 161.34op/s | 10396.25us | 6263.84us |

## 能耗监控结果

| 监控项 | 系统平均功耗 | 系统最大功耗 | 系统功耗标准差 | 单机TDP | 单卡平均功耗 | 单卡最大功耗 | 单卡功耗标准差 | 单
卡TDP |
| 监控项 | 系统平均功耗 | 系统最大功耗 | 系统功耗标准差 | 单机TDP | 单卡平均功耗 | 单卡最大功耗 | 单卡功耗标准差 | 单卡TDP |
| ---- | ------- | ------- | ------- | ----- | ------------ | ------------ | ------------- | ----- |
| nativetorch监控结果 | 1638.0W | 1638.0W | 0.0W | / | 254.82W | 259.0W | 4.5W | 1638.0 |
| flaggems监控结果 | 1677.0W | 1716.0W | 39.0W | / | 289.42W | 295.0W | 3.19W | 1677.0 |
Expand Down
5 changes: 2 additions & 3 deletions operation/benchmarks/rsqrt/nvidia/A100_40_SXM/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -36,15 +36,14 @@ https://github.com/FlagOpen/FlagGems. Commit ID:9168f2d031ecc1b31a9f658fb66dd673

## 其他评测结果

| 评测项 | 相对误差(with FP64-CPU)标准差 | cputime | kerneltime | cputime吞吐 | kerneltime吞吐 | 无预热时延 | 预热后时>延 |
| 评测项 | 相对误差(with FP64-CPU)标准差 | cputime | kerneltime | cputime吞吐 | kerneltime吞吐 | 无预热时延 | 预热后时延 |
| ---- | -------------- | -------------- | ------------ | ------------ | -------------- | -------------- | ------------ |
| flaggems | 8.18E-10 | 6184.4us | 6190.08us | 161.7op/s | 161.55op/s | 185962.49us | 6251.48us |
| nativetorch | 5.91E-10 | 6195.47us | 6197.25us | 161.41op/s | 161.36op/s | 6267.83us | 6237.22us |

## 能耗监控结果

| 监控项 | 系统平均功耗 | 系统最大功耗 | 系统功耗标准差 | 单机TDP | 单卡平均功耗 | 单卡最大功耗 | 单卡功耗标准差 | 单
卡TDP |
| 监控项 | 系统平均功耗 | 系统最大功耗 | 系统功耗标准差 | 单机TDP | 单卡平均功耗 | 单卡最大功耗 | 单卡功耗标准差 | 单卡TDP |
| ---- | ------- | ------- | ------- | ----- | ------------ | ------------ | ------------- | ----- |
| nativetorch监控结果 | 1638.0W | 1638.0W | 0.0W | / | 258.61W | 263.0W | 4.54W | 1638.0 |
| flaggems监控结果 | 1716.0W | 1716.0W | 0.0W | / | 308.11W | 312.0W | 3.87W | 1716.0 |
Expand Down
5 changes: 2 additions & 3 deletions operation/benchmarks/sigmoid/nvidia/A100_40_SXM/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -36,15 +36,14 @@ https://github.com/FlagOpen/FlagGems. Commit ID:982781081f5d62856064ae986e8927a3

## 其他评测结果

| 评测项 | 相对误差(with FP64-CPU)标准差 | cputime | kerneltime | cputime吞吐 | kerneltime吞吐 | 无预热时延 | 预热后时>延 |
| 评测项 | 相对误差(with FP64-CPU)标准差 | cputime | kerneltime | cputime吞吐 | kerneltime吞吐 | 无预热时延 | 预热后时延 |
| ---- | -------------- | -------------- | ------------ | ------------ | -------------- | -------------- | ------------ |
| flaggems | 8.56E-10 | 6184.88us | 6188.03us | 161.68op/s | 161.6op/s | 302165.88us | 6273.36us |
| nativetorch | 7.91E-10 | 6183.92us | 6188.03us | 161.71op/s | 161.6op/s | 701797.94us | 6237.07us |

## 能耗监控结果

| 监控项 | 系统平均功耗 | 系统最大功耗 | 系统功耗标准差 | 单机TDP | 单卡平均功耗 | 单卡最大功耗 | 单卡功耗标准差 | 单
卡TDP |
| 监控项 | 系统平均功耗 | 系统最大功耗 | 系统功耗标准差 | 单机TDP | 单卡平均功耗 | 单卡最大功耗 | 单卡功耗标准差 | 单卡TDP |
| ---- | ------- | ------- | ------- | ----- | ------------ | ------------ | ------------- | ----- |
| nativetorch监控结果 | 1716.0W | 1716.0W | 0.0W | / | 300.44W | 305.0W | 3.32W | 1716.0 |
| flaggems监控结果 | 1716.0W | 1716.0W | 0.0W | / | 313.47W | 318.0W | 3.29W | 1716.0 |
Expand Down
5 changes: 2 additions & 3 deletions operation/benchmarks/tanh/nvidia/A100_40_SXM/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -36,15 +36,14 @@ https://github.com/FlagOpen/FlagGems. Commit ID:982781081f5d62856064ae986e8927a3

## 其他评测结果

| 评测项 | 相对误差(with FP64-CPU)标准差 | cputime | kerneltime | cputime吞吐 | kerneltime吞吐 | 无预热时延 | 预热后时>延 |
| 评测项 | 相对误差(with FP64-CPU)标准差 | cputime | kerneltime | cputime吞吐 | kerneltime吞吐 | 无预热时延 | 预热后时延 |
| ---- | -------------- | -------------- | ------------ | ------------ | -------------- | -------------- | ------------ |
| flaggems | 9.52E-10 | 6169.25us | 6171.65us | 162.09op/s | 162.03op/s | 301271.92us | 6246.96us |
| nativetorch | 9.52E-10 | 6188.82us | 6194.18us | 161.58op/s | 161.44op/s | 12297.66us | 6213.04us |

## 能耗监控结果

| 监控项 | 系统平均功耗 | 系统最大功耗 | 系统功耗标准差 | 单机TDP | 单卡平均功耗 | 单卡最大功耗 | 单卡功耗标准差 | 单
卡TDP |
| 监控项 | 系统平均功耗 | 系统最大功耗 | 系统功耗标准差 | 单机TDP | 单卡平均功耗 | 单卡最大功耗 | 单卡功耗标准差 | 单卡TDP |
| ---- | ------- | ------- | ------- | ----- | ------------ | ------------ | ------------- | ----- |
| nativetorch监控结果 | 1716.0W | 1716.0W | 0.0W | / | 303.74W | 308.0W | 4.99W | 1716.0 |
| flaggems监控结果 | 1794.0W | 1794.0W | 0.0W | / | 344.47W | 348.0W | 3.68W | 1794.0 |
Expand Down

0 comments on commit cc9c8a4

Please sign in to comment.