-
- Downloads
Merging script added alongside with training
Showing
- README.md 1 addition, 3 deletionsREADME.md
- resources/qwen_extra_32000_unigram.tiktoken 36703 additions, 0 deletionsresources/qwen_extra_32000_unigram.tiktoken
- resources/unigram_32000_vocab_frequency.txt 31328 additions, 0 deletionsresources/unigram_32000_vocab_frequency.txt
- src/tokenization/add_merges.py 12 additions, 8 deletionssrc/tokenization/add_merges.py
- src/tokenization/convert_hf_tokenizer_vocab_to_freq_list.py 82 additions, 0 deletionssrc/tokenization/convert_hf_tokenizer_vocab_to_freq_list.py
- src/tokenization/convert_tokenizer_spm_to_hf.py 25 additions, 0 deletionssrc/tokenization/convert_tokenizer_spm_to_hf.py
- src/training/train_deepspeed.py 166 additions, 0 deletionssrc/training/train_deepspeed.py
Loading
Please register or sign in to comment