Explore
Find agent skills by outcome
138,944 skills indexed with the new KISS metadata standard.
*.bak
.gitattributes
repos:
- repo: https://github.com/astral-sh/ruff-pre-commit
Byte-compiled / optimized / DLL files
pycache/
<!---
Copyright 2023 The HuggingFace Team. All rights reserved.
repos:
- repo: https://github.com/astral-sh/ruff-pre-commit
<!---
Copyright 2020 The HuggingFace Team. All rights reserved.
Byte-compiled / optimized / DLL files
pycache/
Alpaca Instruction Following Dataset
To enable more open-source research on instruction following large language models, we use generate 52K instruction-followng demonstrations using OpenAI's text-davinci-003 model.
Untitled Skill
Alpaca Model Card
Organization developing the model
!# LLaMA Factory
repos:
- repo: https://github.com/pre-commit/pre-commit-hooks
!# LLaMA Factory
Byte-compiled / optimized / DLL files
pycache/
.vscode
.git
Auto detect text files and perform LF normalization
* text=auto
Tokenization
注:作为术语的“tokenization”在中文中尚无共识的概念对应,本文档采用英文表达以利说明。
Introducing Qwen-7B: Open foundation and human-aligned models (of the state-of-the-arts)
Large language models have recently attracted an extremely large amount of
Untitled Skill
中文  |  English  |  日本語 |  Français |  Español
トークン化
Qwen-7B は tiktoken パッケージを使用して、UTF-8 バイトを BPE トークン化します。
Tokenization
Qwen-7B uses BPE tokenization on UTF-8 bytes using the tiktoken package.
Untitled Skill
中文  |  English  |  日本語 |  Français |  Español
Untitled Skill
中文  |  English  |  日本語 |  Français |  Español