Add Artificial Analysis evaluations for kimi-k2-thinking

#2
Files changed (1) hide show
  1. README.md +54 -3
README.md CHANGED
@@ -1,3 +1,54 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ model-index:
4
+ - name: MackenzieTest
5
+ results:
6
+ - task:
7
+ type: evaluation
8
+ dataset:
9
+ name: Artificial Analysis Benchmarks
10
+ type: artificial_analysis
11
+ metrics:
12
+ - name: Artificial Analysis Intelligence Index
13
+ type: artificial_analysis_intelligence_index
14
+ value: 67
15
+ - name: Artificial Analysis Coding Index
16
+ type: artificial_analysis_coding_index
17
+ value: 52.2
18
+ - name: Artificial Analysis Math Index
19
+ type: artificial_analysis_math_index
20
+ value: 94.7
21
+ - name: Mmlu Pro
22
+ type: mmlu_pro
23
+ value: 0.848
24
+ - name: Gpqa
25
+ type: gpqa
26
+ value: 0.838
27
+ - name: Hle
28
+ type: hle
29
+ value: 0.223
30
+ - name: Livecodebench
31
+ type: livecodebench
32
+ value: 0.853
33
+ - name: Scicode
34
+ type: scicode
35
+ value: 0.424
36
+ - name: Aime 25
37
+ type: aime_25
38
+ value: 0.947
39
+ - name: Ifbench
40
+ type: ifbench
41
+ value: 0.681
42
+ - name: Lcr
43
+ type: lcr
44
+ value: 0.663
45
+ - name: Terminalbench Hard
46
+ type: terminalbench_hard
47
+ value: 0.291
48
+ - name: Tau2
49
+ type: tau2
50
+ value: 0.93
51
+ source:
52
+ name: Artificial Analysis API
53
+ url: https://artificialanalysis.ai
54
+ ---