FAQ
Flash attention is an option for accelerating training and inference. Only NVIDIA GPUs of Turing, Ampere, Ada, and Hopper architecture, e.g., H100, A100, RTX 3090, T4, RTX 2080, can support flash atte...
Explore
125,929 skills indexed with the new KISS metadata standard.
Flash attention is an option for accelerating training and inference. Only NVIDIA GPUs of Turing, Ampere, Ada, and Hopper architecture, e.g., H100, A100, RTX 3090, T4, RTX 2080, can support flash atte...
*.so
*.so
/test
中文README.
In order to make the contribution process as smooth as possible, we have established some
Read this in English.
*.swp
generic skill
pycache/
We are happy to accept your contributions to make this repo better and more awesome! To avoid unnecessary work on either
English | 中文
English | 中文
🇨🇳中文 | 🌐English | 📖文档/Docs | ❓提问/Issues | 💬讨论/Discussions | [⚔️
🇨🇳中文 | 🌐English | 📖文档/Docs | ❓提问/Issues | 💬讨论/Discussions | [⚔️
generic skill
*/.DS_Store
🇨🇳中文 | 🌐English | 📖文档/Docs | ❓提问/Issues | 💬讨论/Discussions | [⚔️竞技场/Ar
🇨🇳中文 | 🌐English | 📖文档/Docs | ❓提问/Issues | 💬讨论/Discussions | [⚔️竞技场/Ar
为了保证文件的完整性,请一定要检查下列文件SHA256值的一致性。