PyTorch
llama
shrango commited on
Commit
2ad02ca
·
verified ·
1 Parent(s): e753070

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +46 -4
README.md CHANGED
@@ -1,6 +1,48 @@
1
- This is the reproduced HASS model and is used as a baseline of the paper **PosS:Position Specialist Generates Better Draft for Speculative Decoding**
 
 
2
 
3
- If the code fails to auto-download the models, you may mannually download the following files.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4
 
5
- - `pytorch_model.bin`: Model weights
6
- - `config.json`: Model config
 
1
+ ---
2
+ license: cc-by-nc-nd-4.0
3
+ ---
4
 
5
+ # HASS Baseline Checkpoint (Used in PosS)
6
+
7
+ This repository provides a reproduced **HASS** model checkpoint that is used as a **baseline** in **PosS (Position Specialist)** experiments.
8
+
9
+ PosS is a speculative decoding method proposed in the paper:
10
+
11
+ > **PosS: Position Specialist Generates Better Draft for Speculative Decoding**
12
+
13
+ In our experiments, this HASS checkpoint serves as the baseline draft model for comparison with the proposed position-specialized draft models.
14
+
15
+ ---
16
+
17
+ ## 🔗 Code
18
+
19
+ The full implementation of PosS, along with training details and evaluation scripts (including EAGLE-2 and HASS baselines), is available at:
20
+
21
+ 👉 **GitHub:** https://github.com/shrango/PosS
22
+
23
+ ---
24
+
25
+ ## 📦 Files
26
+
27
+ If the model is not automatically downloaded by your framework, you may manually download the following files from this repository:
28
+
29
+ - `pytorch_model.bin` — model weights
30
+ - `config.json` — model configuration
31
+
32
+ ---
33
+
34
+ ## 📖 Citation
35
+
36
+ If you use this checkpoint in the context of PosS or refer to the PosS method, please cite:
37
+
38
+ ```bibtex
39
+ @misc{huang2025posspositionspecialistgenerates,
40
+ title = {POSS: Position Specialist Generates Better Draft for Speculative Decoding},
41
+ author = {Langlin Huang and Chengsong Huang and Jixuan Leng and Di Huang and Jiaxin Huang},
42
+ year = {2025},
43
+ eprint = {2506.03566},
44
+ archivePrefix= {arXiv},
45
+ primaryClass = {cs.CL},
46
+ url = {https://arxiv.org/abs/2506.03566}
47
+ }
48