Explore
Find agent skills by outcome
129,888 skills indexed with the new KISS metadata standard.
How to contribute to TRL?
Everyone is welcome to contribute, and we value everybody's contribution. Code contributions are not the only way to help the community. Answering questions, helping others, and improving the document...
Contributor Covenant Code of Conduct
We as members, contributors, and leaders pledge to make participation in our
repos:
- repo: https://github.com/astral-sh/ruff-pre-commit
*.bak
.gitattributes
Byte-compiled / optimized / DLL files
pycache/
<!---
Copyright 2023 The HuggingFace Team. All rights reserved.
repos:
- repo: https://github.com/astral-sh/ruff-pre-commit
<!---
Copyright 2020 The HuggingFace Team. All rights reserved.
Alpaca Instruction Following Dataset
To enable more open-source research on instruction following large language models, we use generate 52K instruction-followng demonstrations using OpenAI's text-davinci-003 model.
Byte-compiled / optimized / DLL files
pycache/
Untitled Skill
Alpaca Model Card
Organization developing the model
!# LLaMA Factory
repos:
- repo: https://github.com/pre-commit/pre-commit-hooks
!# LLaMA Factory
Auto detect text files and perform LF normalization
* text=auto
Byte-compiled / optimized / DLL files
pycache/
.vscode
.git
Tokenization
注:作为术语的“tokenization”在中文中尚无共识的概念对应,本文档采用英文表达以利说明。
トークン化
Qwen-7B は tiktoken パッケージを使用して、UTF-8 バイトを BPE トークン化します。
Untitled Skill
中文  |  English  |  日本語 |  Français |  Español
Introducing Qwen-7B: Open foundation and human-aligned models (of the state-of-the-arts)
Large language models have recently attracted an extremely large amount of
Tokenization
Qwen-7B uses BPE tokenization on UTF-8 bytes using the tiktoken package.