mackenzietechdocs commited on
Commit
16011c9
·
verified ·
1 Parent(s): 92703c1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +47 -54
README.md CHANGED
@@ -1,54 +1,47 @@
1
- ---
2
- license: mit
3
- model-index:
4
- - name: MackenzieTest
5
- results:
6
- - task:
7
- type: evaluation
8
- dataset:
9
- name: Artificial Analysis Benchmarks
10
- type: artificial_analysis
11
- metrics:
12
- - name: Artificial Analysis Intelligence Index
13
- type: artificial_analysis_intelligence_index
14
- value: 21.8
15
- - name: Artificial Analysis Coding Index
16
- type: artificial_analysis_coding_index
17
- value: 13
18
- - name: Artificial Analysis Math Index
19
- type: artificial_analysis_math_index
20
- value: 22
21
- - name: Mmlu Pro
22
- type: mmlu_pro
23
- value: 0.524
24
- - name: Gpqa
25
- type: gpqa
26
- value: 0.358
27
- - name: Hle
28
- type: hle
29
- value: 0.053
30
- - name: Livecodebench
31
- type: livecodebench
32
- value: 0.247
33
- - name: Scicode
34
- type: scicode
35
- value: 0.144
36
- - name: Aime 25
37
- type: aime_25
38
- value: 0.22
39
- - name: Ifbench
40
- type: ifbench
41
- value: 0.268
42
- - name: Lcr
43
- type: lcr
44
- value: 0.117
45
- - name: Terminalbench Hard
46
- type: terminalbench_hard
47
- value: 0
48
- - name: Tau2
49
- type: tau2
50
- value: 0.249
51
- source:
52
- name: Artificial Analysis API
53
- url: https://artificialanalysis.ai
54
- ---
 
1
+ ---
2
+ license: mit
3
+ library_name: transformers
4
+ pipeline_tag: text-generation
5
+
6
+ base_model:
7
+ - deepseek-ai/DeepSeek-V3.2-Exp-Base
8
+ base_model_relation: finetune
9
+
10
+ model-index:
11
+ - name: MyPrivateModel
12
+ results:
13
+ - task:
14
+ name: Text Generation
15
+ type: text-generation
16
+ dataset:
17
+ name: DummyEval Benchmark
18
+ type: dummy-eval-benchmark
19
+ metrics:
20
+ - name: accuracy
21
+ type: accuracy
22
+ value: 0.0
23
+ ---
24
+
25
+ # MyPrivateModel
26
+
27
+ This is a **placeholder model card** for a private repository used for experimentation and evaluation tooling.
28
+
29
+ ## Model description
30
+
31
+ - **Owner:** my-username
32
+ - **Base model:** `deepseek-ai/DeepSeek-V3.2-Exp-Base`
33
+ - **Intended use:** Internal testing of evaluation workflows, HF Skills tasks, and model card metadata.
34
+
35
+ ## Evaluation
36
+
37
+ The `model-index` block above contains **dummy evaluation data** (0.0 accuracy on a fictitious `DummyEval Benchmark`).
38
+ This exists only to ensure the model card passes Hugging Face metadata validation and can be used to test:
39
+
40
+ - `hf_model_evaluation` tools
41
+ - `import-aa`, `extract-readme`, and related commands
42
+ - Leaderboard / evaluation display on the Hub
43
+
44
+ ## Limitations
45
+
46
+ - The evaluation numbers are **not real** and must not be used as performance claims.
47
+ - This repository is intended only for private experimentation.