Commit04b5d2d

jimmy.xj

committed

Update README.md

1 parent12a3c0d commit04b5d2dCopy full SHA for 04b5d2d

File tree

2 files changed

-4

lines changed

2 files changed

-4

lines changed

`‎README.md‎`

Lines changed: 3 additions & 2 deletions

Original file line number	Diff line number	Diff line change
`@@ -19,6 +19,7 @@ DevOps-Eval is a comprehensive evaluation suite specifically designed for founda`
`19`	`19`
`20`	`20`
`21`	`21`	`##🔔 News`
	`22`	`+*[2023.10.30] Add the AIOps Leaderboard.`
`22`	`23`	`*[2023.10.25] Add the AIOps samples, including log parsing, time series anomaly detection, time series classification and root cause analysis.`
`23`	`24`	`*[2023.10.18] Update the initial Leaderboard...`
`24`	`25`	`<br>`
`@@ -38,7 +39,7 @@ DevOps-Eval is a comprehensive evaluation suite specifically designed for founda`
`38`	`39`
`39`	`40`	`##🏆 Leaderboard`
`40`	`41`	`Below are zero-shot and five-shot accuracies from the models that we evaluate in the initial release. We note that five-shot performance is better than zero-shot for many instruction-tuned models.`
`41`		`-###DevOps`
	`42`	`+###👀DevOps`
`42`	`43`	`####Zero Shot`
`43`	`44`
`44`	`45`	`\|ModelName\| plan\| code\| build\| test\| release\| deploy\| operate\| monitor\|AVG\|`
`@@ -78,7 +79,7 @@ Below are zero-shot and five-shot accuracies from the models that we evaluate in`
`78`	`79`	`\| Baichuan2-7B-Chat\| 60.61\| 64.95\| 81.19\| 75.88\| 71.23\| 75.69\| 78.36\| 79.17\| 70.49\|`
`79`	`80`	`\| Internlm-7B-Base\| 62.12\| 65.25\| 77.52\| 80.7\| 74.06\| 78.82\| 79.85\| 75.46\| 69.17\|`
`80`	`81`
`81`		`-###AIOps`
	`82`	`+###🔥AIOps`
`82`	`83`	`####Zero Shot`
`83`	`84`	`\|ModelName\| LogParsing\| RootCauseAnalysis\| TimeSeriesAnomalyDetection\| TimeSeriesClassification\|AVG\|`
`84`	`85`	`\|:-------------------:\|:------------:\|:------------------:\|:---------------------------:\|:-------------------------:\|:-------:\|`

`‎README_zh.md‎`

Lines changed: 3 additions & 2 deletions

Original file line number	Diff line number	Diff line change
`@@ -19,6 +19,7 @@ DevOps-Eval是一个专门为DevOps领域大模型设计的综合评估数据集`
`19`	`19`
`20`	`20`
`21`	`21`	`##🔔 更新`
	`22`	`+*[2023.10.30] 增加针对AIOps场景的评测排行榜`
`22`	`23`	`*[2023.10.25] 增加AIOps样本，包含日志解析、时序异常检测、时序分类和根因分析`
`23`	`24`	`*[2023.10.18] DevOps-Eval发布大模型评测排行版`
`24`	`25`	`<br>`
`@@ -39,7 +40,7 @@ DevOps-Eval是一个专门为DevOps领域大模型设计的综合评估数据集`
`39`	`40`	`##🏆 排行榜`
`40`	`41`	`以下是我们获得的初版评测结果，包括多个开源模型的zero-shot和five-shot准确率。我们注意到，对于大多数指令模型来说，five-shot的准确率要优于zero-shot。`
`41`	`42`
`42`		`-###DevOps`
	`43`	`+###👀DevOps`
`43`	`44`	`####Zero Shot`
`44`	`45`
`45`	`46`	`\|模型\| plan\| code\| build\| test\| release\| deploy\| operate\| monitor\|平均分\|`
`@@ -80,7 +81,7 @@ DevOps-Eval是一个专门为DevOps领域大模型设计的综合评估数据集`
`80`	`81`	`\| Internlm-7B-Base\| 62.12\| 65.25\| 77.52\| 80.7\| 74.06\| 78.82\| 79.85\| 75.46\| 69.17\|`
`81`	`82`
`82`	`83`
`83`		`-###AIOps`
	`84`	`+###🔥AIOps`
`84`	`85`	`####Zero Shot`
`85`	`86`	`\|模型\| 日志解析\| 根因分析\| 时序异常检测\| 时序分类\|平均分\|`
`86`	`87`	`\|:-------------------:\|:-----:\|:----:\|:------:\|:----:\|:-------:\|`

0 commit comments

Comments

(0)

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commit04b5d2d

File tree

2 files changed

2 files changed

`‎README.md‎`

`‎README_zh.md‎`

0 commit comments